Deep Learning Labs #7 Warsaw

Video link: https://www.youtube.com/watch?v=JHX87iv8YJA

Who we are and what we do?

Deep Learning Labs / Warsaw Season #01 Episode #07

Reinforcement Learning (RL) Basics By Misha Zanka

What is RL?

Policy A policy is an agent's strategy. https://towardsdatascience.com/self-learning-ai-agents-iv-stochastic-policy-gradients-b53f088fce20

Stable-baselines • Stable Baselines is a set of improved implementations
of Reinforcement Learning (RL) algorithms based on OpenAI Baselines. • Main feature is unified interface for all models. • You are free to use other frameworks, but this one is the most user-friendly

Grading Today the ranking includes the following tasks : •
CartPole - 1 pt. • LunarLander - 4 pt. • Hopper - 6 pt. • HalfCheetah - 12 pt. • BipedalWalker - 24 pt. Extra 20% points to each task in case when team will make something special, like good presentation with insights or non-standard solution of the problem. We will maintain the leaderboard on our page.

Submission You will have the link to the google form
where you will need to send: • If you used stable-baselines .zip with a model and name of the algorithm used. • If you used smth else, send trained model and instruction how to extract actions from your policy

Template for submitting results will be available

Deep Learning Labs #7 Warsaw

Deep Learning Labs #7 Warsaw

Mathias Åsberg

More Decks by Mathias Åsberg

Other Decks in Programming

Featured

Transcript

Video link: https://www.youtube.com/watch?v=JHX87iv8YJA

Who we are and what we do?

Deep Learning Labs / Warsaw Season #01 Episode #07

Reinforcement Learning (RL) Basics By Misha Zanka

What is RL?

Policy A policy is an agent's strategy. https://towardsdatascience.com/self-learning-ai-agents-iv-stochastic-policy-gradients-b53f088fce20

Stable-baselines • Stable Baselines is a set of improved implementations

Grading Today the ranking includes the following tasks : •

Submission You will have the link to the google form

Template for submitting results will be available