Selection of summary statistics for network model choice

We here provide a Python package named cost_based_selection associated with our manuscript

L. Raynal, T. Hoffmann, and J.-P. Onnela. "Selection of summary statistics for network model choice." arXiv, 2101.07766, 2021.

Description

This project focuses on cost-based selection of features for distinguishing between different (mechanistic network) models. The computational cost of features can vary significantly, and, in computationally demanding settings such as approximate Bayesian computation, selecting low-cost yet informative features is desirable. We consider cost-based adaptations of a range of feature selection methods as well as using pilot simulations based on smaller networks to identify informative features.

Installation

Clone the git repository or download the code as an archive.
Set up a new Python virtual environment, e.g., using pyenv. This code has been tested on python 3.9.9.
Install all python requirements by running pip install -r requirements.txt.
Ensure an R interpreter is installed (see https://www.r-project.org for installation instructions).
Install the ranger package for random forests, e.g., by running R -e 'if (!require("ranger")) install.packages("ranger", version="0.14.1", repos="http://cran.r-project.org/")'.
Run pytest -v to verify your installation.

Reproducing the results

You can reproduce all results presented in our paper by running the following commands

# Ignore figures until we've generated all other results.
doit ignore figures
# Run the analysis (this will take some time ...).
doit -n [number of cores]
# Forget that we ignored figures ...
doit forget figures
# ... and generate them.
doit figures

Your results may differ slightly from ours because computation times differ across machines and depend on other processes running on your machine. For reference, the results presented in the manuscript were obtained with doit -n 6 on a M1 Macbook Pro (2020) with 16GB of memory.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.github/workflows		.github/workflows
cost_based_selection		cost_based_selection
figures		figures
scripts		scripts
tests		tests
.gitignore		.gitignore
DESCRIPTION.md		DESCRIPTION.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
dodo.py		dodo.py
requirements.in		requirements.in
requirements.txt		requirements.txt
scrartcl.mplstyle		scrartcl.mplstyle
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selection of summary statistics for network model choice

Table of contents

Description

Installation

Reproducing the results

About

Releases

Packages

Contributors 3

Languages

License

onnela-lab/net-summary-selection

Folders and files

Latest commit

History

Repository files navigation

Selection of summary statistics for network model choice

Table of contents

Description

Installation

Reproducing the results

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages