Skip to content

Latest commit

 

History

History
 
 

dask

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

RAPIDS Hyperparameter Optimization (HPO) with Dask ML

Dask-ML provides machine learning utilities built on top of the scalable Dask platform. Dask already offers first-class integration with RAPIDS, and Dask-ML is no exception.

The Dask-ML hyperparameter search tools make it easy to take advantage of grid search, randomized search, or hyperband HPO algorithms. It particularly excels at incorporating cross-validation into the HPO process for more stable accuracy estimates and at allowing intelligent reuse of intermediate results from the Dask task graph.

RAPIDS + Dask-ML sample notebooks

This sample notebook shows how to use Ray Tune to optimize XGBoost and cuML Random Forest classifiers over a large dataset of airline arrival times. By design, it is very similar to the RAPIDS examples provided for other cloud and bring-your-own-cloud HPO offerings.

You need both Jupyter and RAPIDS 0.13 or later installed to begin. See https://rapids.ai/start.html for instructions. We recommend using 0.14 nightly packages for the latest updates. Dask-ML can be installed via conda or pip, following the instructions from: https://ml.dask.org/install.html.