Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer DeepDive
Search
jogannaoki
December 05, 2018
0
830
DeepRacer DeepDive
jogannaoki
December 05, 2018
Tweet
Share
More Decks by jogannaoki
See All by jogannaoki
エンジニアの越境
jogannaoki
0
2.6k
[re:Growth 2019] Fargate for EKS
jogannaoki
0
900
Kubernetesを使うことによるメリットと注意点
jogannaoki
0
1.8k
Developers.IO 2018 BaaSでゲームインフラをスピード構築
jogannaoki
1
800
Featured
See All Featured
Designing Experiences People Love
moore
135
23k
What's new in Ruby 2.0
geeforr
335
31k
Typedesign – Prime Four
hannesfritz
36
2k
Building Better People: How to give real-time feedback that sticks.
wjessup
350
18k
Unsuck your backbone
ammeep
660
56k
Code Review Best Practice
trishagee
54
15k
A better future with KSS
kneath
230
16k
Art, The Web, and Tiny UX
lynnandtonic
288
19k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
8
8.2k
The Language of Interfaces
destraynor
150
22k
Building Flexible Design Systems
yeseniaperezcruz
317
37k
Side Projects
sachag
451
41k
Transcript
Naoki Jogan 1 Classmethod, Inc. AWS Business Unit Consulting Dev.
2018/12/5 %FFQ3BDFS%FFQ %JWF re:Growth 2018
ࣗݾհ • ιϦϡʔγϣϯΞʔΩςΫτ • ݄ೖࣾ ؒ4*FSͰ"1ΤϯδχΞ • ࠷ۙ͞Θ͍ͬͯΔαʔϏε &,4
4VNFSJBO ؛رʢδϣΨϯφΦΩʣ "84ࣄۀຊ෦ίϯαϧςΟϯά෦
re:Invent 2018 KeyNote
ࠓ͢͜ͱ w%FFQ3BDFSͰԿ͕Ͱ͖Δͷ͔ɹʙNJO w%FFQ3BDFSΛͲ͏ͬͯಈ͔͢ͷ͔ɹʙNJO w%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ɹʙNJO
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ 5
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ ڧԽֶशΛֶΔ
%FFQ3BDFSੜͷഎܠ How can we put Reinforcement Learning in the
hands of all developers?
%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ wڧԽֶशΛ։ൃऀͷखʹ͢ ΤΩαΠςΟϯάͳखஈ wڧԽֶशͷϋϯζΦϯΛఏڙ
ڧԽֶशͱʁ
ػցֶशͷछྨ UCL Course on RL Lecture 1: ntroduction to
Reinforcement Learning wڭࢣ͋Γֶश wڭࢣͳֶ͠श wڧԽֶशʢ%FFQ3BDFSʣ
ࣗಈӡస
ғޟকع
ڧԽֶशͬͯͲ͏Δͷʁ
ڧԽֶशʢྫʣ ใु RLΞϧΰϦζϜ wΤαΛͨ͘͞Μ৯ͯ wఢ͕͍ͳ͍ͱ͜ΖʹਐΜͰ wγϛϡϨʔγϣϯ wใुΛ࠷େԽ
%FFQ3BDFSΛಈ͔ͯ͠ΈΔ 15
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̍ɿ3FXBSEؔ ใुؔʢPythonʣ ɾEJTUBODF@GSPN@DFOUFS ɹηϯλʔϥΠϯ͔Βͷڑ ɾPO@USBDL ɹं྆ͷલ෦͕നઢͷ֎ଆʹ͋Δ͔Ͳ͏͔ ɾUISPUUMF ɹंͷɹఀࢭΛࣔ͠ɺ࠷ߴ ɾUSBDL@XJEUI
ɹτϥοΫ෯ ɹ ɹͳͲͳͲ Πϯϓοτ Ξτϓοτ
εςοϓ̍ɿ3FXBSEؔ NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI SFXBSEF JGEJTUBODF@GSPN@DFOUFSBOEEJTUBODF@GSPN@DFOUFSNBSLFS@
SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMTF SFXBSEFMJLFMZDSBTIFEDMPTFUPPGGUSBDL
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̎ɿϞσϧ࡞
εςοϓ̎ɿϞσϧ࡞
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̏ɿཧͰΒͤΔ ֶशϞσϧ
εςοϓ̏ɿཧͰΒͤΔ
"84্ͷΞʔΩςΫνϟ SageMaker RoboMaker S3 kinesis video streams CloudWatch Logs
Client
%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ 27
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018 ɾ45"5&ɿঢ়ଶ ɹ%FFQ3BDFSͷϑϩϯτΧϝϥ ɾ3&8"3%ɿใु ɹηϯλʔϥΠϯʹ͚ۙΕ(PPE ɹεϐʔυ͕ग़͍ͯΕ(PPE ɾ"$5*0/ɿߦಈ ɹεϐʔυΛ্͛Δ
ɹӈʹۂ͕Δɺࠨʹۂ͕Δ
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ಓ֎ΕͨͷͰใुͳ͠ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ঢ়گʹԠͨ͡ใु 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018
·ͱΊ 38
·ͱΊ w%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ w%FFQ3BDFSڧԽֶशͱ͔ؔͳ͘୯७ʹָ͍͠ wϨʔεେձ͋ΔͷͰΈΜͳࢀՃ͠Α͏ʂʂΞϝϦΧͰʁ
40 ԶͰڧԽֶशΛ ֶΜͰ͘Εʂʂ ࠓͳΒˈ249
Let's start reinforcement learning with DeepRacer
42