Skip to content

Latest commit

 

History

History
52 lines (35 loc) · 1.34 KB

NOTES_Polyaxon.md

File metadata and controls

52 lines (35 loc) · 1.34 KB

Experiments tracking with Polyaxon

Requirements

In this case we assume, user has Polyaxon installed on a machine/cluster/cloud and can schedule experiments with polyaxon-cli.

Usage

Setup Polyaxon project

Create project on the cluster

polyaxon project create --name=imagenet --description="Classification on ImageNet"

Initialize local project

polyaxon init imagenet

Please rename and modify experiments/plx/xp_training.yml.tmpl to experiments/plx/xp_training.yml to adapt to your cluster configuration.

Download ImageNet dataset

Since 10/2019, we need to register an account in order to download the dataset. To download the dataset, use the following form : http://www.image-net.org/download.php

Single node with multiple GPUs

polyaxon run -u -f experiments/plx/xp_training.yml --name="baseline_resnet50" --tags=train,resnet50

Training tracking

Please, see Polyaxon dashboard usage at https://docs.polyaxon.com/

Implementation details

Files tree description:

code
configs  
experiments/plx : Polyaxon related files
notebooks 

Experiments

File xp_training.yml.tmpl defines all configurations and dependencies necessary for our experimentations. Part run.cmd starts single-node multi-GPU training script.