Using RLLib in an Enterprise Scale Reinforcemen...

July 21, 2021

260

Using RLLib in an Enterprise Scale Reinforcement Learning Solution (Jeroen Bédorf & Ishaan Sood, minds.ai)

DeepSim is an optimization platform that can use advanced Reinforcement Learning (RL) methods to develop neural network-based controller software. DeepSim supports various RL libraries, including RLLib. In this talk, we discuss how RLLib, as well as the Tune hyperparameter optimizer, are used to develop controller software. Next, to the default set of features that RLLib offers, DeepSim offers its users a set of custom loggers, actions distributions and network architectures for improved performance of the controllers. The training runs, required to train the neural network, are executed on a Kubernetes based Ray cluster and can be monitored via command line interface tools as well as via TensorBoard. Finally, we show how the trained neural network can be exported, for example via Keras, to be deployed on target hardware.

All the above is demonstrated using two concrete examples, in the first the fuel efficiency of a Hybrid Electric Vehicle is optimized and in the second we develop cruise control software using the Ansys VRXPERIENCE autonomous driving simulator.

Anyscale

July 21, 2021

Tweet

More Decks by Anyscale

See All by Anyscale

Ray in 2023: Ray in Reflection

0

150

Evaluating LLM Applications is hard

0

4.2k

Developing and serving RAG-Based LLM applications in production

0

140

Ray_Essentials__Introduction_to_Ray_for_machine_learning.pdf

0

160

How to build a serverless database cloud service

0

110

Multi-Region/Cloud Ray Pipeline with Distributed Caching

0

170

Modern Compute Stack for Scaling Large AI/ML/LLM Workloads

0

110

5 Painful Lessons using LLMs

0

140

How continuous batching enables 23x throughput in LLM inference

0

1.3k

Other Decks in Technology

See All in Technology

「良さそう」と「とても良い」の間には「良さそうだがホンマか」がたくさんある / 2025.07.01 LLM品質Night

1

420

Yamla: Rustでつくるリアルタイム性を追求した機械学習基盤 / Yamla: A Rust-Based Machine Learning Platform Pursuing Real-Time Capabilities

4

170

生成AIで小説を書くためにプロンプトの制約や原則について学ぶ / prompt-engineering-for-ai-fiction

4

3.2k

mrubyと micro-ROSが繋ぐロボットの世界

2

380

How Community Opened Global Doors

1

130

Connect 100+を支える技術

0

140

PHPでWebブラウザのレンダリングエンジンを実装する

0

220

LangSmith×Webhook連携で実現するプロンプトドリブンCI/CD

1

140

開発生産性を組織全体の「生産性」へ！部門間連携の壁を越える実践的ステップ

0

270

Amazon Bedrockで実現する新たな学習体験

2

670

250627 関西Ruby会議08 前夜祭 RejectKaigi「DJ on Ruby Ver.0.1」

2

370

Beyond Kaniko: Navigating Unprivileged Container Image Creation

0

100

Featured

See All Featured

Building Better People: How to give real-time feedback that sticks.

367

19k

Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End

252

21k

Cheating the UX When There Is Nothing More to Optimize - PixelPioneers

stephaniewalter

281

13k

Improving Core Web Vitals using Speculation Rules API

sergeychernyshev

17

950

What's in a price? How to price your products and services

246

12k

Into the Great Unknown - MozCon

39

1.9k

Unsuck your backbone

671

58k

How to Think Like a Performance Engineer

24

1.7k

455

42k

Adopting Sorbet at Scale

77

9.4k

The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024

26

2.9k

We Have a Design System, Now What?

53

7.7k

Transcript

Using RLlib in an enterprise scale reinforcement learning solution Ray
Summit 2021 Jeroen Bédorf, [email protected] Ishaan Sood, [email protected]
©minds.ai Problem Statements Integration and usage of RLlib and Tune
DeepSim Platform Adaptive Cruise Control Demo Hybrid Electric Vehicle Demo Outline
©minds.ai Trend: Exploding complexity and proliferation of smart systems DeepSim:
Bring RL to Subject Matter Experts Electrification Autonomy Automation Renewables
©minds.ai DeepSim: Bring RL to Subject Matter Experts Controllers: Brains
behind complex systems Reinforcement Learning Controllers: Trained for operating complex systems PID Controller Process Feedback Input Output RL Agent (neural network) Environment Input Output
©minds.ai DeepSim: Platform Overview Environment Integration & Scenario support Training
libraries Data Analysis & Visualization Toolkit HPO & NAS Neural Network Models & definition method Front end TFAgents RLlib Ray Internal MPI Horovod Tune Internal Public Cloud Backend Algorithms Distribution method
©minds.ai DeepSim: Usage of Ray, RLlib and Tune Custom Action
Distributions Easy Model Definition Method Custom Logging Custom Models Export Methods Analysis Tools RLlib Tune Ray Inference Methods
©minds.ai Typical end-user workflow Configure simulation, reward, etc. Status &
Progress information Export trained Agent 1 2 3 Set up training runs (HPO & NAS) Tune Train
©minds.ai Optimizer (ONNX, TensorRT, etc.) Trained Agent Ray Serve Embedded
Laptop/Workstation Inference System / Controller RLlib Checkpoint Inference Library Deployment
©minds.ai. Use Cases