Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer for learning RL
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
貞松政史
April 06, 2019
Technology
1.4k
0
Share
DeepRacer for learning RL
2019.4.6 Developers.IO at OKAYAMA.
貞松政史
April 06, 2019
More Decks by 貞松政史
See All by 貞松政史
Amazon Forecast亡き今、我々がマネージドサービスに頼らず時系列予測を実行する方法
sadynitro
0
1.3k
今日のハイライトをシステマティックに
sadynitro
1
86
はじめてのレコメンド〜Amazon Personalizeを使った推薦システム超超超入門〜
sadynitro
2
2.6k
予知保全利用を目指した外観検査AIの開発 〜画像処理AIを用いた外観画像に対する異常検知〜
sadynitro
0
1.3k
20230904_GoogleCloudNext23_Recap_AI_ML
sadynitro
0
950
Foundation Model全盛時代を生きるAI/MLエンジニアの生存戦略
sadynitro
0
1.1k
Amazon SageMakerが存在しない世界線 のAWS上で実現する機械学習基盤
sadynitro
0
320
Amazon SageMakerが存在しない世界線のAWS上で実現する機械学習基盤
sadynitro
0
2.1k
みんな大好き強化学習
sadynitro
0
1.3k
Other Decks in Technology
See All in Technology
Modernizing Your HCL Connections Experience: Visual Report to chain, Profile Enhancements, and AI Integration
wannesrams
0
280
Building Production-Ready Agents Microsoft Agent Framework
_mertmetin
0
160
Digital Independence: Why, When and How
wannesrams
0
280
The 7 pitfalls of AI
ufried
0
190
世界の中心でApp Runnerを叫ぶ FINAL
tsukuboshi
0
230
VespaのParent Childを用いたフィードパフォーマンスの改善
taking
0
260
知ってた?JavaScriptの"正しさ"を検証するテストが5万以上もあること(Test262)
riyaamemiya
1
140
需要創出(Chatwork)×供給(BPaaS) フライホイールとMoat 実行能力の最適配置とAI戦略
kubell_hr
0
2k
「QA=テスト」「シフトレフト=スクラムイベントの参加者の一員」の呪縛を解く。アジャイルな開発を止めないために、10Xで挑んだ「右側のしわ寄せ」解消記 #scrumniigata
nihonbuson
PRO
3
820
音声言語モデル手法に関する発表の紹介
kzinmr
0
160
Fabric MCPの紹介と使い分け
ryomaru0825
1
120
「誰一人取り残されない」 AIエージェント時代のプロダクト設計思想 Product Management Summit 2026
mizushimac
1
2.9k
Featured
See All Featured
How to optimise 3,500 product descriptions for ecommerce in one day using ChatGPT
katarinadahlin
PRO
1
3.6k
The #1 spot is gone: here's how to win anyway
tamaranovitovic
2
1k
Leo the Paperboy
mayatellez
7
1.7k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
254
22k
What does AI have to do with Human Rights?
axbom
PRO
1
2.1k
Odyssey Design
rkendrick25
PRO
2
610
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.6k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
122
21k
Effective software design: The role of men in debugging patriarchy in IT @ Voxxed Days AMS
baasie
0
330
Ecommerce SEO: The Keys for Success Now & Beyond - #SERPConf2024
aleyda
1
2k
The Invisible Side of Design
smashingmag
302
52k
Agile Actions for Facilitating Distributed Teams - ADO2019
mkilby
0
180
Transcript
4 D 29 26 1 . 0 I 1
& .-* (2 ,0'/4"# 51 83;7 +)
!&%$9( 6: Attention
3 #cmdevio2019
4 os t m ( L @S g E b
i _d L rMI D @ E ( ( ( ) ( e a n k AWS E
5 DeepRacer
6 D
7 ) (
8 …
9 DeepRacer 4 D 26 9 01 .
10 DeepRacer A A A
11
1 2 3
12 DeepRacer
13
14 DeepRacer 1/18
3D AWS DeepRacer League
15 DeepRacer https://aws.amazon.com/jp/deepracer/
16 DeepRacer ! &%$ +)*2 1 '/*2
(#-, 0. "
17 3D AWS RoboMaker Robot Operating System (ROS) Gazebo rqt
18 AWS DeepRacer League ⁻ 0 1 : 9 A
⁻ 9 2 R ⁻ ⁻ D I ⁻ 1 2 ⁻ https://aws.amazon.com/jp/deepracer/league/
19
20 (Artificial Intelligence, AI) (Machine Learning, ML)
NeuralNetwork DeepLearning
21
22 = 1 (
) ( (
23 L N - ) ( - D Q
24 DeepRacer Cliped PPO PPO (Proximal
Policy Optimization) OpenAI2017
25 ( ( )
)
26 1
27 ) () (
28 DeepRacer
29 DeepRacer + + +
30 DeepRacer
31 DeepRacer …
32 DeepRacer
33 orz
34 DeepRacer + + +
35 DeepRacer D A D
36 $ ' + (# &!
%"
37 ( ) ) https://docs.aws.amazon.com/ja_jp /deepracer/latest/developerguide/ deepracer-train-models-define- reward-function.html
38
39 ⁻ 10 ⁻
:
40 SageMaker RL + RoboMaker
41 SageMeker RLRoboMakerGA
42 SageMaker “RL” ⁻ ⁻ M ⁻ M M
⁻ M ⁻ ⁻ ⁻ J M S
43 DeepRacer ) D ( ) ) ( )
44 SageMaker
https://dev.classmethod.jp/machine -learning/sagemaker-robomaker- deepracer-sample/
45 $# "
! https://github.com/awslabs/amazon-sagemaker-examples
46 Jupyter !
47 ( ( )
48
49 2 1
50 2 2
51 (. ( )(
52 $2# " /1(+ $2#,- ! https://docs.aws.amazon.com/ja_jp/deepracer/latest/developerguide/deepracer -iteratively-enhance-reward-functions.html *
$2%) '.0& )
53 Best Practices when training with PPO
(Unity Technologies) https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-PPO.md
54 DeepRacer "% ! "%
$# ! !
55 DeepRacer
56 DeepRacer
57 DeepRacer
58 DeepRacer
59 DeepRacer
60
61 • g • + + • M D c
• R S D a k • b • LL e
62 DeepRacer
None