Skip to content

roarjn/cloud-ml-examples

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

 RAPIDS Cloud Machine Learning Services Integration

RAPIDS is a suite of open-source libraries that bring GPU acceleration to data science pipelines. Users building cloud-based hyperparameter optimization experiments can take advantage of this acceleration throughout their workloads to build models faster, cheaper, and more easily on the cloud platform of their choice.

This repository provides example notebooks and "getting started" code samples to help you integrate RAPIDS with the hyperparameter optimization services from Azure ML, AWS Sagemaker, Google Cloud, and Databricks. The directory for each cloud contains a step-by-step guide to launch an example hyperparameter optimization job.

Each example job will use RAPIDS cuDF to load and preprocess 20 million rows of airline arrival and departure data and build a model to predict whether or not a flight will arrive on time. It demonstrates both cuML Random Forests and GPU-accelerated XGBoost modeling.

Microsoft Azure ML

Azure ML Step-by-step.

AWS SageMaker

Amazon SageMaker Step-by-step.

Google Cloud AI Platform

Google Cloud AI Step-by-step

Databricks

Databricks Step-by-step

MLflow

Local Step-by-step

Databricks Step-by-step

Kubernetes Step-by-step

Bring Your Own Cloud (Dask and Ray)

In addition to public cloud HPO options, the respository also includes "BYOC" sample notebooks that can be run on the public cloud or private infrastructure of your choice. These leverage Ray Tune or Dask-ML for distributed infrastructure, while demonstrating the same airline classifier HPO workload.

Logo

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 89.2%
  • Python 9.6%
  • Other 1.2%