Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer DeepDive
Search
jogannaoki
December 05, 2018
0
960
DeepRacer DeepDive
jogannaoki
December 05, 2018
Tweet
Share
More Decks by jogannaoki
See All by jogannaoki
エンジニアの越境
jogannaoki
0
5.7k
[re:Growth 2019] Fargate for EKS
jogannaoki
0
1.1k
Kubernetesを使うことによるメリットと注意点
jogannaoki
0
2.1k
Developers.IO 2018 BaaSでゲームインフラをスピード構築
jogannaoki
1
890
Featured
See All Featured
Building a Modern Day E-commerce SEO Strategy
aleyda
42
7.4k
Unsuck your backbone
ammeep
671
58k
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
Building an army of robots
kneath
306
45k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
34
3.1k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
53
2.8k
How GitHub (no longer) Works
holman
314
140k
The Cult of Friendly URLs
andyhume
79
6.5k
Scaling GitHub
holman
459
140k
Music & Morning Musume
bryan
46
6.6k
GraphQLの誤解/rethinking-graphql
sonatard
71
11k
Bash Introduction
62gerente
614
210k
Transcript
Naoki Jogan 1 Classmethod, Inc. AWS Business Unit Consulting Dev.
2018/12/5 %FFQ3BDFS%FFQ %JWF re:Growth 2018
ࣗݾհ • ιϦϡʔγϣϯΞʔΩςΫτ • ݄ೖࣾ ؒ4*FSͰ"1ΤϯδχΞ • ࠷ۙ͞Θ͍ͬͯΔαʔϏε &,4
4VNFSJBO ؛رʢδϣΨϯφΦΩʣ "84ࣄۀຊ෦ίϯαϧςΟϯά෦
re:Invent 2018 KeyNote
ࠓ͢͜ͱ w%FFQ3BDFSͰԿ͕Ͱ͖Δͷ͔ɹʙNJO w%FFQ3BDFSΛͲ͏ͬͯಈ͔͢ͷ͔ɹʙNJO w%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ɹʙNJO
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ 5
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ ڧԽֶशΛֶΔ
%FFQ3BDFSੜͷഎܠ How can we put Reinforcement Learning in the
hands of all developers?
%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ wڧԽֶशΛ։ൃऀͷखʹ͢ ΤΩαΠςΟϯάͳखஈ wڧԽֶशͷϋϯζΦϯΛఏڙ
ڧԽֶशͱʁ
ػցֶशͷछྨ UCL Course on RL Lecture 1: ntroduction to
Reinforcement Learning wڭࢣ͋Γֶश wڭࢣͳֶ͠श wڧԽֶशʢ%FFQ3BDFSʣ
ࣗಈӡస
ғޟকع
ڧԽֶशͬͯͲ͏Δͷʁ
ڧԽֶशʢྫʣ ใु RLΞϧΰϦζϜ wΤαΛͨ͘͞Μ৯ͯ wఢ͕͍ͳ͍ͱ͜ΖʹਐΜͰ wγϛϡϨʔγϣϯ wใुΛ࠷େԽ
%FFQ3BDFSΛಈ͔ͯ͠ΈΔ 15
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̍ɿ3FXBSEؔ ใुؔʢPythonʣ ɾEJTUBODF@GSPN@DFOUFS ɹηϯλʔϥΠϯ͔Βͷڑ ɾPO@USBDL ɹं྆ͷલ෦͕നઢͷ֎ଆʹ͋Δ͔Ͳ͏͔ ɾUISPUUMF ɹंͷɹఀࢭΛࣔ͠ɺ࠷ߴ ɾUSBDL@XJEUI
ɹτϥοΫ෯ ɹ ɹͳͲͳͲ Πϯϓοτ Ξτϓοτ
εςοϓ̍ɿ3FXBSEؔ NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI SFXBSEF JGEJTUBODF@GSPN@DFOUFSBOEEJTUBODF@GSPN@DFOUFSNBSLFS@
SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMTF SFXBSEFMJLFMZDSBTIFEDMPTFUPPGGUSBDL
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̎ɿϞσϧ࡞
εςοϓ̎ɿϞσϧ࡞
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̏ɿཧͰΒͤΔ ֶशϞσϧ
εςοϓ̏ɿཧͰΒͤΔ
"84্ͷΞʔΩςΫνϟ SageMaker RoboMaker S3 kinesis video streams CloudWatch Logs
Client
%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ 27
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018 ɾ45"5&ɿঢ়ଶ ɹ%FFQ3BDFSͷϑϩϯτΧϝϥ ɾ3&8"3%ɿใु ɹηϯλʔϥΠϯʹ͚ۙΕ(PPE ɹεϐʔυ͕ग़͍ͯΕ(PPE ɾ"$5*0/ɿߦಈ ɹεϐʔυΛ্͛Δ
ɹӈʹۂ͕Δɺࠨʹۂ͕Δ
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ಓ֎ΕͨͷͰใुͳ͠ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ঢ়گʹԠͨ͡ใु 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018
·ͱΊ 38
·ͱΊ w%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ w%FFQ3BDFSڧԽֶशͱ͔ؔͳ͘୯७ʹָ͍͠ wϨʔεେձ͋ΔͷͰΈΜͳࢀՃ͠Α͏ʂʂΞϝϦΧͰʁ
40 ԶͰڧԽֶशΛ ֶΜͰ͘Εʂʂ ࠓͳΒˈ249
Let's start reinforcement learning with DeepRacer
42