Upgrade to Pro — share decks privately, control downloads, hide ads and more …

State of Ray (Robert Nishihara, Anyscale)

State of Ray (Robert Nishihara, Anyscale)

Robert Nishihara, co-founder and CEO, Anyscale, kicks off Ray Summit 2021 with a look back at the past year, celebrating the growth of the Ray project, ecosystem, and community.

Anyscale

July 14, 2021
Tweet

More Decks by Anyscale

Other Decks in Technology

Transcript

  1. Native Libraries 3rd Party Libraries Your app here! Ecosystem -

    the big picture! Universal framework for distributed computing Run anywhere Library + app ecosystem
  2. “Ray will play an increasingly important role in bringing much

    needed common infrastructure and standardization to the production machine learning ecosystem, both within Uber and the industry at large.” Horovod and more ...
  3. Driver Driver Ray cluster import tensorflow import pandas @ray.remote def

    f(): # app logic if __name__ == ‘__main__’: main() import tensorflow import pandas @ray.remote def f(): # app logic if __name__ == ‘__main__’: main() ray.client() Streamlined Workflows NEW
  4. Driver import tensorflow import pandas @ray.remote def f(): # app

    logic if __name__ == ‘__main__’: main() Ray cluster ray.client() Streamlined Workflows NEW
  5. ray.client().env( { 'pip': ['tensorflow', 'pandas'], 'working_dir': '/home/my-project' } ) ray.client().env(

    { 'pip': ['tensorflow', 'pandas'], } ) Environments Streamlined Workflows
  6. Expand your knowledge 50+ breakout sessions Go deeper Tutorials (day

    3) Connect Slack and Gather.town Welcome to Ray Summit!
  7. Get Involved Meet Ray users and developers (today after the

    break) Keynote: Ion Stoica Find the talks online youtube.com/anyscale Thank you!
  8. Ray Summit 2021 Ray Core • Ray internals: object management

    (today) • Deep dive into Ray’s scheduling (tomorrow) Ecosystem • Patterns of ML models in production (today) • Distributed XGBoost on Ray (today) • A bridge for preprocessing and training (today) • Data processing on Ray (today) • The ML ecosystem (tomorrow)
  9. Ecosystem • Airflow • XGBoost • PyTorch • Horovod We

    believe that Ray will continue to play an increasingly important role in bringing much needed common infrastructure and standardization to the production machine learning ecosystem, both within Uber and the industry at large.
  10. Ecosystem • Airflow • XGBoost • PyTorch • Horovod •

    Hugging Face • MLflow • Scikit-learn
  11. Ecosystem • Airflow • XGBoost • PyTorch • Horovod •

    Hugging Face • MLflow • Scikit-learn • Dask
  12. Ecosystem • Airflow • XGBoost • PyTorch • Horovod •

    Hugging Face • MLflow • Scikit-learn • Dask • And more… Ray is the tool of choice for scaling libraries
  13. Native Libraries 3rd Party Libraries Your app here! Ecosystem universal

    framework for distributed computing run anywhere library + app ecosystem
  14. What’s New - Dev & Prod Improved workflows for development

    & production • Ray client • Environments
  15. What’s New - Dev & Prod Ray Client Driver Driver

    Ray cluster import tensorflow import pandas @ray.remote def f(): # app logic if __name__ == ‘__main__’: main() import tensorflow import pandas @ray.remote def f(): # app logic if __name__ == ‘__main__’: main()
  16. What’s New - Dev & Prod Ray Client Driver import

    tensorflow import pandas @ray.remote def f(): # app logic if __name__ == ‘__main__’: main() Ray cluster
  17. What’s New - Dev & Prod Environments ray.client().env( ) Environments

    ray.client().env( { 'pip': ['tensorflow', 'pandas'], } )
  18. What’s New - Dev & Prod Environments ray.client().env( { 'pip':

    ['tensorflow', 'pandas'], } ) Environments ray.client().env( { 'pip': ['tensorflow', 'pandas'], 'working_dir': '/home/my-project' } )
  19. User Stories - Uber Ray will play an increasingly important

    role in bringing much needed common infrastructure and standardization to the production machine learning ecosystem, both within Uber and the industry at large.
  20. We believe that Ray will continue to play an increasingly

    important role in bringing much needed common infrastructure and standardization to the production machine learning ecosystem, both within Uber and the industry at large. Horovod and more ...
  21. Native Libraries 3rd Party Libraries Your app here! Ecosystem -

    the big picture! Universal framework for distributed computing Run anywhere Library + app ecosystem
  22. What’s New - Robustness & Scale • Data Processing on

    Ray ◦ SangBin Cho • Ray Internals: Object Management with the Ownership Model ◦ Stephanie Wang, Yi Cheng • A Deep Dive into Ray’s Scheduling Policy ◦ Alex Wu • MLDataset: A Ray Bridge for Data Preprocessing and Distributed Training ◦ Clark Zinzow • Building High Availability and Scalability Online Computing Applications on Ray ◦ Tengwei Cai • Improving Ray for Large-scale Applications ◦ Hao Chen