Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer DeepDive
Search
jogannaoki
December 05, 2018
0
970
DeepRacer DeepDive
jogannaoki
December 05, 2018
Tweet
Share
More Decks by jogannaoki
See All by jogannaoki
エンジニアの越境
jogannaoki
0
5.7k
[re:Growth 2019] Fargate for EKS
jogannaoki
0
1.1k
Kubernetesを使うことによるメリットと注意点
jogannaoki
0
2.1k
Developers.IO 2018 BaaSでゲームインフラをスピード構築
jogannaoki
1
890
Featured
See All Featured
BBQ
matthewcrist
89
9.7k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
46
9.6k
Documentation Writing (for coders)
carmenintech
72
4.9k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
20
1.3k
jQuery: Nuts, Bolts and Bling
dougneiner
63
7.8k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
18
970
Building Better People: How to give real-time feedback that sticks.
wjessup
367
19k
A better future with KSS
kneath
238
17k
Measuring & Analyzing Core Web Vitals
bluesmoon
7
510
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
29
9.6k
KATA
mclloyd
30
14k
The Invisible Side of Design
smashingmag
301
51k
Transcript
Naoki Jogan 1 Classmethod, Inc. AWS Business Unit Consulting Dev.
2018/12/5 %FFQ3BDFS%FFQ %JWF re:Growth 2018
ࣗݾհ • ιϦϡʔγϣϯΞʔΩςΫτ • ݄ೖࣾ ؒ4*FSͰ"1ΤϯδχΞ • ࠷ۙ͞Θ͍ͬͯΔαʔϏε &,4
4VNFSJBO ؛رʢδϣΨϯφΦΩʣ "84ࣄۀຊ෦ίϯαϧςΟϯά෦
re:Invent 2018 KeyNote
ࠓ͢͜ͱ w%FFQ3BDFSͰԿ͕Ͱ͖Δͷ͔ɹʙNJO w%FFQ3BDFSΛͲ͏ͬͯಈ͔͢ͷ͔ɹʙNJO w%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ɹʙNJO
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ 5
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ ڧԽֶशΛֶΔ
%FFQ3BDFSੜͷഎܠ How can we put Reinforcement Learning in the
hands of all developers?
%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ wڧԽֶशΛ։ൃऀͷखʹ͢ ΤΩαΠςΟϯάͳखஈ wڧԽֶशͷϋϯζΦϯΛఏڙ
ڧԽֶशͱʁ
ػցֶशͷछྨ UCL Course on RL Lecture 1: ntroduction to
Reinforcement Learning wڭࢣ͋Γֶश wڭࢣͳֶ͠श wڧԽֶशʢ%FFQ3BDFSʣ
ࣗಈӡస
ғޟকع
ڧԽֶशͬͯͲ͏Δͷʁ
ڧԽֶशʢྫʣ ใु RLΞϧΰϦζϜ wΤαΛͨ͘͞Μ৯ͯ wఢ͕͍ͳ͍ͱ͜ΖʹਐΜͰ wγϛϡϨʔγϣϯ wใुΛ࠷େԽ
%FFQ3BDFSΛಈ͔ͯ͠ΈΔ 15
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̍ɿ3FXBSEؔ ใुؔʢPythonʣ ɾEJTUBODF@GSPN@DFOUFS ɹηϯλʔϥΠϯ͔Βͷڑ ɾPO@USBDL ɹं྆ͷલ෦͕നઢͷ֎ଆʹ͋Δ͔Ͳ͏͔ ɾUISPUUMF ɹंͷɹఀࢭΛࣔ͠ɺ࠷ߴ ɾUSBDL@XJEUI
ɹτϥοΫ෯ ɹ ɹͳͲͳͲ Πϯϓοτ Ξτϓοτ
εςοϓ̍ɿ3FXBSEؔ NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI SFXBSEF JGEJTUBODF@GSPN@DFOUFSBOEEJTUBODF@GSPN@DFOUFSNBSLFS@
SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMTF SFXBSEFMJLFMZDSBTIFEDMPTFUPPGGUSBDL
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̎ɿϞσϧ࡞
εςοϓ̎ɿϞσϧ࡞
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̏ɿཧͰΒͤΔ ֶशϞσϧ
εςοϓ̏ɿཧͰΒͤΔ
"84্ͷΞʔΩςΫνϟ SageMaker RoboMaker S3 kinesis video streams CloudWatch Logs
Client
%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ 27
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018 ɾ45"5&ɿঢ়ଶ ɹ%FFQ3BDFSͷϑϩϯτΧϝϥ ɾ3&8"3%ɿใु ɹηϯλʔϥΠϯʹ͚ۙΕ(PPE ɹεϐʔυ͕ग़͍ͯΕ(PPE ɾ"$5*0/ɿߦಈ ɹεϐʔυΛ্͛Δ
ɹӈʹۂ͕Δɺࠨʹۂ͕Δ
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ಓ֎ΕͨͷͰใुͳ͠ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ঢ়گʹԠͨ͡ใु 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018
·ͱΊ 38
·ͱΊ w%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ w%FFQ3BDFSڧԽֶशͱ͔ؔͳ͘୯७ʹָ͍͠ wϨʔεେձ͋ΔͷͰΈΜͳࢀՃ͠Α͏ʂʂΞϝϦΧͰʁ
40 ԶͰڧԽֶशΛ ֶΜͰ͘Εʂʂ ࠓͳΒˈ249
Let's start reinforcement learning with DeepRacer
42