Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer DeepDive
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
jogannaoki
December 05, 2018
0
1k
DeepRacer DeepDive
jogannaoki
December 05, 2018
Tweet
Share
More Decks by jogannaoki
See All by jogannaoki
エンジニアの越境
jogannaoki
0
5.9k
[re:Growth 2019] Fargate for EKS
jogannaoki
0
1.1k
Kubernetesを使うことによるメリットと注意点
jogannaoki
0
2.1k
Developers.IO 2018 BaaSでゲームインフラをスピード構築
jogannaoki
1
910
Featured
See All Featured
Making the Leap to Tech Lead
cromwellryan
135
9.8k
Jamie Indigo - Trashchat’s Guide to Black Boxes: Technical SEO Tactics for LLMs
techseoconnect
PRO
0
83
Art, The Web, and Tiny UX
lynnandtonic
304
21k
Build your cross-platform service in a week with App Engine
jlugia
234
18k
The Organizational Zoo: Understanding Human Behavior Agility Through Metaphoric Constructive Conversations (based on the works of Arthur Shelley, Ph.D)
kimpetersen
PRO
0
270
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
34
2.7k
The Language of Interfaces
destraynor
162
26k
Rebuilding a faster, lazier Slack
samanthasiow
85
9.4k
SEO in 2025: How to Prepare for the Future of Search
ipullrank
3
3.4k
Learning to Love Humans: Emotional Interface Design
aarron
275
41k
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.5k
Transcript
Naoki Jogan 1 Classmethod, Inc. AWS Business Unit Consulting Dev.
2018/12/5 %FFQ3BDFS%FFQ %JWF re:Growth 2018
ࣗݾհ • ιϦϡʔγϣϯΞʔΩςΫτ • ݄ೖࣾ ؒ4*FSͰ"1ΤϯδχΞ • ࠷ۙ͞Θ͍ͬͯΔαʔϏε &,4
4VNFSJBO ؛رʢδϣΨϯφΦΩʣ "84ࣄۀຊ෦ίϯαϧςΟϯά෦
re:Invent 2018 KeyNote
ࠓ͢͜ͱ w%FFQ3BDFSͰԿ͕Ͱ͖Δͷ͔ɹʙNJO w%FFQ3BDFSΛͲ͏ͬͯಈ͔͢ͷ͔ɹʙNJO w%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ɹʙNJO
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ 5
%FFQ3BDFSͰԿ͕Ͱ͖Δͷʁ ڧԽֶशΛֶΔ
%FFQ3BDFSੜͷഎܠ How can we put Reinforcement Learning in the
hands of all developers?
%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ wڧԽֶशΛ։ൃऀͷखʹ͢ ΤΩαΠςΟϯάͳखஈ wڧԽֶशͷϋϯζΦϯΛఏڙ
ڧԽֶशͱʁ
ػցֶशͷछྨ UCL Course on RL Lecture 1: ntroduction to
Reinforcement Learning wڭࢣ͋Γֶश wڭࢣͳֶ͠श wڧԽֶशʢ%FFQ3BDFSʣ
ࣗಈӡస
ғޟকع
ڧԽֶशͬͯͲ͏Δͷʁ
ڧԽֶशʢྫʣ ใु RLΞϧΰϦζϜ wΤαΛͨ͘͞Μ৯ͯ wఢ͕͍ͳ͍ͱ͜ΖʹਐΜͰ wγϛϡϨʔγϣϯ wใुΛ࠷େԽ
%FFQ3BDFSΛಈ͔ͯ͠ΈΔ 15
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̍ɿ3FXBSEؔ ใुؔʢPythonʣ ɾEJTUBODF@GSPN@DFOUFS ɹηϯλʔϥΠϯ͔Βͷڑ ɾPO@USBDL ɹं྆ͷલ෦͕നઢͷ֎ଆʹ͋Δ͔Ͳ͏͔ ɾUISPUUMF ɹंͷɹఀࢭΛࣔ͠ɺ࠷ߴ ɾUSBDL@XJEUI
ɹτϥοΫ෯ ɹ ɹͳͲͳͲ Πϯϓοτ Ξτϓοτ
εςοϓ̍ɿ3FXBSEؔ NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI NBSLFS@ USBDL@XJEUI SFXBSEF JGEJTUBODF@GSPN@DFOUFSBOEEJTUBODF@GSPN@DFOUFSNBSLFS@
SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMJGEJTUBODF@GSPN@DFOUFSNBSLFS@ SFXBSE FMTF SFXBSEFMJLFMZDSBTIFEDMPTFUPPGGUSBDL
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̎ɿϞσϧ࡞
εςοϓ̎ɿϞσϧ࡞
%FFQ3BDFSΛಈ͔͢·Ͱ wεςοϓ̍ɿ3FXBSEؔ࡞ wεςοϓ̎ɿγϛϡϨʔγϣϯ wεςοϓ̏ɿཧͰΒͤΔ
εςοϓ̏ɿཧͰΒͤΔ ֶशϞσϧ
εςοϓ̏ɿཧͰΒͤΔ
"84্ͷΞʔΩςΫνϟ SageMaker RoboMaker S3 kinesis video streams CloudWatch Logs
Client
%FFQ3BDFSͲ͏ֶͬͯΜͰ͍Δͷ͔ 27
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018 ɾ45"5&ɿঢ়ଶ ɹ%FFQ3BDFSͷϑϩϯτΧϝϥ ɾ3&8"3%ɿใु ɹηϯλʔϥΠϯʹ͚ۙΕ(PPE ɹεϐʔυ͕ग़͍ͯΕ(PPE ɾ"$5*0/ɿߦಈ ɹεϐʔυΛ্͛Δ
ɹӈʹۂ͕Δɺࠨʹۂ͕Δ
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ಓ֎ΕͨͷͰใुͳ͠ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ t=0 S0
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ t=1 S "$5*0/ ɾεϐʔυΛ্͛Δ ɾਐ͢Δ ɾӈʹۂ͕Δ ɾࠨʹۂ͕Δ R
:ঢ়گʹԠͨ͡ใु 1
τϨʔχϯάͲͷΑ͏ʹͯ͠ߦΘΕΔ͔ʁ https://www.slideshare.net/AmazonWebServices/robocar-rally-2018-aim206r20-aws-reinvent-2018
·ͱΊ 38
·ͱΊ w%FFQ3BDFSڧԽֶशΛֶͿͨΊͷखஈ w%FFQ3BDFSڧԽֶशͱ͔ؔͳ͘୯७ʹָ͍͠ wϨʔεେձ͋ΔͷͰΈΜͳࢀՃ͠Α͏ʂʂΞϝϦΧͰʁ
40 ԶͰڧԽֶशΛ ֶΜͰ͘Εʂʂ ࠓͳΒˈ249
Let's start reinforcement learning with DeepRacer
42