Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer DeepDive
Search
jogannaoki
December 05, 2018
0
970
DeepRacer DeepDive
jogannaoki
December 05, 2018
Tweet
Share
More Decks by jogannaoki
See All by jogannaoki
エンジニアの越境
jogannaoki
0
5.8k
[re:Growth 2019] Fargate for EKS
jogannaoki
0
1.1k
Kubernetesを使うことによるメリットと注意点
jogannaoki
0
2.1k
Developers.IO 2018 BaaSでゲームインフラをスピード構築
jogannaoki
1
900
Featured
See All Featured
Visualization
eitanlees
149
16k
Art, The Web, and Tiny UX
lynnandtonic
303
21k
Practical Orchestrator
shlominoach
190
11k
Scaling GitHub
holman
463
140k
Java REST API Framework Comparison - PWX 2021
mraible
34
8.9k
Faster Mobile Websites
deanohume
310
31k
Why Our Code Smells
bkeepers
PRO
340
57k
Building Applications with DynamoDB
mza
96
6.7k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
285
14k
Context Engineering - Making Every Token Count
addyosmani
7
260
Six Lessons from altMBA
skipperchong
29
4k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
55
3k
Transcript
Naoki Jogan 1 Classmethod, Inc. AWS Business Unit Consulting Dev.
2018/12/5 %FFQ3BDFS%FFQ %JWF re:Growth 2018
ࣗݾհ • ιϦϡʔγϣϯΞʔΩςΫτ • ݄ೖࣾ ؒ4*FSͰ"1ΤϯδχΞ • ࠷ۙ͞Θ͍ͬͯΔαʔϏε &,4
4VNFSJBO ؛رʢδϣΨϯφΦΩʣ "84ࣄۀຊ෦ίϯαϧςΟϯά෦
re:Invent 2018 KeyNote
ࠓ͢͜ͱ w%FFQ3BDFSͰԿ͕Ͱ͖Δͷ͔ɹʙNJO w%FFQ3BDFSΛͲ͏ͬͯಈ͔͢ͷ͔ɹʙNJO w%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ɹʙNJO
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ 5
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ ڧԽֶशΛֶΔ
%FFQ3BDFSੜͷഎܠ How can we put Reinforcement Learning in the
hands of all developers?
%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ wڧԽֶशΛ։ൃऀͷखʹ͢ ΤΩαΠςΟϯάͳखஈ wڧԽֶशͷϋϯζΦϯΛఏڙ
ڧԽֶशͱʁ
ػցֶशͷछྨ UCL Course on RL Lecture 1: ntroduction to
Reinforcement Learning wڭࢣ͋Γֶश wڭࢣͳֶ͠श wڧԽֶशʢ%FFQ3BDFSʣ
ࣗಈӡస
ғޟকع
ڧԽֶशͬͯͲ͏Δͷʁ
ڧԽֶशʢྫʣ ใु RLΞϧΰϦζϜ wΤαΛͨ͘͞Μ৯ͯ wఢ͕͍ͳ͍ͱ͜ΖʹਐΜͰ wγϛϡϨʔγϣϯ wใुΛ࠷େԽ
%FFQ3BDFSΛಈ͔ͯ͠ΈΔ 15
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̍ɿ3FXBSEؔ ใुؔʢPythonʣ ɾEJTUBODF@GSPN@DFOUFS ɹηϯλʔϥΠϯ͔Βͷڑ ɾPO@USBDL ɹं྆ͷલ෦͕നઢͷ֎ଆʹ͋Δ͔Ͳ͏͔ ɾUISPUUMF ɹंͷɹఀࢭΛࣔ͠ɺ࠷ߴ ɾUSBDL@XJEUI
ɹτϥοΫ෯ ɹ ɹͳͲͳͲ Πϯϓοτ Ξτϓοτ
εςοϓ̍ɿ3FXBSEؔ NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI SFXBSEF JGEJTUBODF@GSPN@DFOUFSBOEEJTUBODF@GSPN@DFOUFSNBSLFS@
SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMTF SFXBSEFMJLFMZDSBTIFEDMPTFUPPGGUSBDL
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̎ɿϞσϧ࡞
εςοϓ̎ɿϞσϧ࡞
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̏ɿཧͰΒͤΔ ֶशϞσϧ
εςοϓ̏ɿཧͰΒͤΔ
"84্ͷΞʔΩςΫνϟ SageMaker RoboMaker S3 kinesis video streams CloudWatch Logs
Client
%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ 27
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018 ɾ45"5&ɿঢ়ଶ ɹ%FFQ3BDFSͷϑϩϯτΧϝϥ ɾ3&8"3%ɿใु ɹηϯλʔϥΠϯʹ͚ۙΕ(PPE ɹεϐʔυ͕ग़͍ͯΕ(PPE ɾ"$5*0/ɿߦಈ ɹεϐʔυΛ্͛Δ
ɹӈʹۂ͕Δɺࠨʹۂ͕Δ
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ಓ֎ΕͨͷͰใुͳ͠ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ঢ়گʹԠͨ͡ใु 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018
·ͱΊ 38
·ͱΊ w%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ w%FFQ3BDFSڧԽֶशͱ͔ؔͳ͘୯७ʹָ͍͠ wϨʔεେձ͋ΔͷͰΈΜͳࢀՃ͠Α͏ʂʂΞϝϦΧͰʁ
40 ԶͰڧԽֶशΛ ֶΜͰ͘Εʂʂ ࠓͳΒˈ249
Let's start reinforcement learning with DeepRacer
42