Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
requirements_pymc.yml		requirements_pymc.yml
requirements_pymc_mo_linux.lock		requirements_pymc_mo_linux.lock
requirements_pytorch.yml		requirements_pytorch.yml
requirements_pytorch_mo_linux.lock		requirements_pytorch_mo_linux.lock
requirements_sklearn.yml		requirements_sklearn.yml
requirements_sklearn_mo_linux.lock		requirements_sklearn_mo_linux.lock
requirements_tensorflow-probability.yml		requirements_tensorflow-probability.yml
requirements_tensorflow-probability_mo_linux.lock		requirements_tensorflow-probability_mo_linux.lock
requirements_tensorflow.yml		requirements_tensorflow.yml
requirements_tensorflow_mo_linux.lock		requirements_tensorflow_mo_linux.lock
requirements_xgboost.yml		requirements_xgboost.yml
requirements_xgboost_mo_linux.lock		requirements_xgboost_mo_linux.lock

README.md

Setting up environments for The Data Science Community of Practice scripts/notebooks

The environments for code found in this repository, either in scripts or in notebooks, depending on the platform and type of work you are doing, are all primarily provided by conda environments.

These files provide pre-checked base environments that are a starting point for project environments into which other more project specific libraries can be added.

To create an environment from any of the .yml files, run the following command:

conda env create --file requirement.yml

Where you replace requirements.yml with the particular requirements file in the folder required for the files you want to run. There several different environments as different packages are required for different activities or demonstrations, some of which are mutually incompatible or make for slow/difficult to resolve installations, so we have tried to keep each environment as simple as possible to improve performance, as well as fixing compatible major and minor package versions to help the evironment solve.

The installation of environments with the yml files in conda can take a significant amount of memory, for this reason we present two methods to overcome this issue: Install the provided lock files which are system dependant, or Use mamba to install yml files.

If you are using a system supported by the existing lock files, this is the recommended method of install. Lock files specify versions and builds for all included libraries which completely avoid solving the evironment

Installing environments via provided lock files

To create an environment from any of the .lock files, run the following command which also requires a name for the environment to be specified:

conda create --name myenv --file requirements.lock

Lock file are generated on and for specific systems, and will fail to install on systems that are significantly different from the target. Each lock file includes a reference to the target system as follows: *_mo_linux.lock - Met Office internal linux "desktop" systems.
A significantly different system in this case would be a system not using both the Linux-64 architecture and the Met Office conda channels. Generation of the lock files was performed in the Met Office VDI.

Environment name suggestions can be found at the top of all .yml files

Installing environments via mamba

To create an environment using mamba, run the following set of commands:
First, using the desktop conda, create an environment that contains mamba

conda create --name mamba --channel conda-forge mamba

Now activate that environment

conda activate mamba

Now create your data science environment

mamba env create --file requirements.yml

Relevant Environment Links

Each environment has a focus on a particular technology, below lists the existing environments and links to the relevant page for each technology.

requirements_pytorch.yml: Pytorch
requirements_sklearn.yml: Scikit Learn
requirements_tensorflow-probability.yml: Tensorflow's Probability
requirements_tensorflow.yml: Tensorflow
requirements_xgboost.yml: XGBoost
requirements_pymc.yml: PyMC and ArviZ

Use on Met Office systems

Use on VDI

To use on VDI, just create the conda environment as above, activate and then either run the script required, or start a jupyter lab server. Eg:

jupyter lab

SPICE

To use a script on spice, you can simply activate the environment inside your SPICE script. To run the notebooks on spice, use of the Met Office Jupyter Hub installation is advised. Any conda environments you have installed in your homespace will automatically become available as kernels in JupyterHub.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

env

env

README.md

Setting up environments for The Data Science Community of Practice scripts/notebooks

Installing environments via provided lock files

Installing environments via mamba

Relevant Environment Links

Use on Met Office systems

Use on VDI

SPICE

Files

env

Directory actions

More options

Directory actions

More options

Latest commit

History

env

Folders and files

parent directory

README.md

Setting up environments for The Data Science Community of Practice scripts/notebooks

Installing environments via provided lock files

Installing environments via mamba

Relevant Environment Links

Use on Met Office systems

Use on VDI

SPICE