Rethinking Deep Active Learning

This repository provides a pytorch framework that can be used to compare different active learning methods under different setups. We additionally propose to use knowledge acquired from unlabeled data by adding unsupervised and semi-supervised methods. Among others, this code allows you to reproduce the following results:

Those results were published in the paper:

Rethinking deep active learning: Using unlabeled data at model training Siméoni O., Budnik M., Avrithis Y., Gravier G. ICPR 2020 [arXiv]

Dependencies

Python3 (tested on version 3.6)
PyTorch (tested on version 1.4)
FAISS

If using conda, you can run the following commands:

conda create -n rethinkingAL python=3.6
conda activate rethinkingAL

And then install the python packages using

pip install -r requirements.txt
conda install -c pytorch faiss-gpu

Setting up the datasets

In order to download and format the datasets CIFAR10, CIFAR100, MNIST and SVHN, please run the following script:

sh data/install_datasets.sh

You can create additional splits with different sizes by running the script below. By default the script creates 5 splits with 100 balanced labels. For example, 10 splits with the size of 1000 for CIFAR10:

python data/generate_splits.py --dataset cifar10 --nr-splits 10 --split-size 1000

Usage

Launching

In our paper, we propose to test different AL baselines method with the addition of unsupervised pre-training and semi-supervised methods.

Following are the command lines to launch learning for one split (here slit 0 - for more repetitions, run different split) with method uncertainty_entropy on the dataset cifar10. The --dataset can currently take as input cifar10, cifar100, svhn or mnist.

python main_al.py --dataset cifar10 --al-method uncertainty_entropy --split 0 --al-budget 100

It is possible to add unsupervised pretraining (--add-unsupervised-pretraining True). We follow the unsupervised method DeepCluster.

python main_al.py --dataset cifar10 --al-method uncertainty_entropy --split 0 --al-budget 100 --add-unsupervised-pretraining True

We also implemented the addition of the label-propagation method semi-supervised (--add-lp True) following Iscen et al.

python main_al.py --dataset cifar10 --al-method uncertainty_entropy --split 0 --al-budget 100 --add-lp True --b 128 --labeled-batch-size 50

The al-method argument can take any of the four following values corresponding to the different methods we have implemented:

random
uncertainty_entropy
coreset
jlp (our AL acquisition function based on label-propagation)

The semi-supervised method CEAL can be added to any previous acquisition functions using the add-ceal parameter. Following is an example:

python main_al.py --dataset cifar10 --al-method uncertainty_entropy --split 0 --al-budget 100 --add-ceal True

Citation

If you use our work, please cite us using:

@conference{SMAG20,
   title = {Rethinking deep active learning: Using unlabeled data at model training},
   author = {O. Sim\'eoni and M. Budnik and Y. Avrithis and G. Gravier},
   booktitle = {Proceedings of International Conference on Pattern Recognition (ICPR)},
   month = {12},
   address = {Virtual},
   year = {2020}
}

Acknowledgements

The code is based on the Mean Teacher Pytorch, the LabelProp-SSDL and the DeepCluster implementations.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
lib		lib
models/pretrained		models/pretrained
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
al_results.PNG		al_results.PNG
main_al.py		main_al.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rethinking Deep Active Learning

Dependencies

Setting up the datasets

Usage

Launching

Citation

Acknowledgements

About

Releases

Packages

Languages

License

osimeoni/RethinkingDeepActiveLearning

Folders and files

Latest commit

History

Repository files navigation

Rethinking Deep Active Learning

Dependencies

Setting up the datasets

Usage

Launching

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages