[Webinar] An introduction to Ray for scaling machine learning (ML) workloads

Introduction to Ray for scaling machine learning Robert Nishihara Co-founder,
Anyscale and co-creator of Ray Bill Chambers Product lead, Anyscale

- Machine learning is pervasive in every domain - Distributed
machine learning is becoming a necessity - Distributed computing is notoriously hard Why Ray?

Apps increasingly incorporate AI/ML

35x every 18 m onths 2020 GPT-3 Compute demand growing
faster than supply Moore’s Law (2x every 18 months) CPU https://openai.com/blog/ai-and-compute/

35x every 18 m onths 2020 GPT-3 Specialized hardware is
also not enough Moore’s Law (2x every 18 months) CPU https://openai.com/blog/ai-and-compute/ GPU* TPU *

35x every 18 m onths 2020 GPT-3 Specialized hardware is
also not enough Moore’s Law (2x every 18 months) CPU https://openai.com/blog/ai-and-compute/ GPU* TPU * No way out but to distribute!

Generality Ease of development Existing solutions have may tradeoffs

Existing solutions have may tradeoffs Generality Ease of development

machine learning is becoming a necessity - Distributed computing is notoriously hard Ray’s vision: Make distributed computing accessible to every developer Why Ray?

The Ray Ecosystem

Rich ecosystem for scaling ML workloads Native libraries - easily
scale common bottlenecks in ML workflows - Examples: Ray Tune for HPO, RLlib for RLlib, Ray Serve for Serving, etc. Integrations - scale popular frameworks with Ray with minimal changes - Examples: XGBoost, TF, Jax, PyTorch etc.

Rich ecosystem for scaling ML workloads Ray Core / Datasets
Model Serving Data Processing Training Serving Ray Core + Datasets Reinforcement Learning Hyper. Tuning ** a small subset of the Ray ecosystem in ML

Model Serving Data Processing Training Serving Ray Core + Datasets Reinforcement Learning Hyper. Tuning ** a small subset of the Ray ecosystem in ML Integrate Ray only based on your needs!

Challenges in scaling hyperparameter tuning? Rich ecosystem for scaling ML
workloads Ray Core / Datasets Model Serving Data Processing Training Serving Ray Core + Datasets Reinforcement Learning Hyper. Tuning

Model Serving Data Processing Training Serving Ray Core + Datasets Reinforcement Learning Hyper. Tuning Integrate Ray Tune! No need to adopt entire Ray framework.

Generality Ease of development Stitching together different frameworks to go
end-to-end?

Model Serving Data Processing Training Serving Ray Core + Datasets Reinforcement Learning Hyper. Tuning Unified, distributed toolkit to go end-to-end

Companies scaling ML with Ray

Ray Core / Datasets Model Serving Data Processing Training Serving
Reinforcement Learning Hyper. Tuning Companies scaling ML with Ray

Scaling Ecosystem Restoration Dendra Systems

Making Boats Fly with AI Mckinsey | QuantumBlack Australia

Large Scale ML Platforms Uber, Shopify, Robinhood, and more

Starting scaling your ML workloads Getting Started: Documentation (docs.ray.io) Quick
start example, reference guides, etc Forums (discuss.ray.io) Learn / share with broader Ray community, including core team Ray Slack Connect with the Ray team and community

Thank you

[Webinar] An introduction to Ray for scaling ma...

[Webinar] An introduction to Ray for scaling machine learning (ML) workloads

Anyscale

More Decks by Anyscale

Other Decks in Technology

Featured

Transcript

Introduction to Ray for scaling machine learning Robert Nishihara Co-founder,

- Machine learning is pervasive in every domain - Distributed

- Machine learning is pervasive in every domain - Distributed

Apps increasingly incorporate AI/ML

- Machine learning is pervasive in every domain - Distributed

35x every 18 m onths 2020 GPT-3 Compute demand growing

35x every 18 m onths 2020 GPT-3 Specialized hardware is

35x every 18 m onths 2020 GPT-3 Specialized hardware is

- Machine learning is pervasive in every domain - Distributed

Generality Ease of development Existing solutions have may tradeoffs

Generality Ease of development Existing solutions have may tradeoffs

Existing solutions have may tradeoffs Generality Ease of development

Existing solutions have may tradeoffs Generality Ease of development

- Machine learning is pervasive in every domain - Distributed

The Ray Ecosystem

Rich ecosystem for scaling ML workloads Native libraries - easily

Rich ecosystem for scaling ML workloads Ray Core / Datasets

Rich ecosystem for scaling ML workloads Ray Core / Datasets

Challenges in scaling hyperparameter tuning? Rich ecosystem for scaling ML

Rich ecosystem for scaling ML workloads Ray Core / Datasets

Generality Ease of development Stitching together different frameworks to go

Rich ecosystem for scaling ML workloads Ray Core / Datasets

Companies scaling ML with Ray

Ray Core / Datasets Model Serving Data Processing Training Serving

Scaling Ecosystem Restoration Dendra Systems

Making Boats Fly with AI Mckinsey | QuantumBlack Australia

Large Scale ML Platforms Uber, Shopify, Robinhood, and more

Demo

Starting scaling your ML workloads Getting Started: Documentation (docs.ray.io) Quick

Thank you