Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
強化学習による振り子の制御.pdf
Search
Minagawa Kei
June 20, 2020
0
4.1k
強化学習による振り子の制御.pdf
Minagawa Kei
June 20, 2020
Tweet
Share
More Decks by Minagawa Kei
See All by Minagawa Kei
t-sne を調べてみた(実装編)
keimina
0
360
t-sne_を調べてみた_途中まで_.pdf
keimina
0
190
離散化の最適化を行うモデルの検討_実装してみた.pdf
keimina
0
270
Pandas_勉強会の紹介_20191102_機械学習勉強会.pdf
keimina
0
360
Featured
See All Featured
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
8
650
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
251
21k
How STYLIGHT went responsive
nonsquared
99
5.5k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
47
2.5k
The Language of Interfaces
destraynor
157
24k
The World Runs on Bad Software
bkeepers
PRO
67
11k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
31
1.1k
The Straight Up "How To Draw Better" Workshop
denniskardys
232
140k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
135
33k
Documentation Writing (for coders)
carmenintech
69
4.7k
VelocityConf: Rendering Performance Case Studies
addyosmani
328
24k
Automating Front-end Workflow
addyosmani
1369
200k
Transcript
,FJ.JOBHBXB !LFJNJOB ڧԽֶशʹΑΔৼΓࢠͷ੍ޚ 1FOEVMVN7̌
IUUQTLFJNJOBIBUFOBCMPHKQ
ࠓճɺڧԽֶशͰ ৼΓࢠͷ੍ޚΛߦ͍·ͨ͠ɻ ͲͷΑ͏ͳ͜ͱΛߦ͍ɺ݁Ռ͕Ͳ͏ͳ͔ͬͨɺ ͬ͘͟Γͱઆ໌͠·͢
·ͣɺৼΓࢠͷಈ͖Λ1$Ͱ࠶ݱͰ͖ΔγϛϡϨʔλΛಋೖ͠·ͨ͠ɻ QJQJOTUBMMHZN ಈ͔͍ͨ͠ํೖΕ·͠ΐ͏ɻ
JNQPSUHZN JNQPSUUJNF FOWHZNNBLF 1FOEVMVNW FOWSFTFU GPS@JOSBOHF
FOWTUFQ <> FOWSFOEFS UJNFTMFFQ
JNQPSUHZN JNQPSUUJNF FOWHZNNBLF 1FOEVMVNW FOWSFTFU GPS@JOSBOHF
FOWTUFQ <> FOWSFOEFS UJNFTMFFQ τϧΫΛ༩͑ͯ̍εςοϓਐΊΔ
None
1FOEVMVNWͷύϥϝʔλҎԼʹͳΓ·͢ɻ ্͔Βॱʹɺ֯ɺ֯ɺτϧΫͰ͢ɻ ֯ɺ֯ঢ়ଶΛද͠·͢ɻ τϧΫߦಈΛද͠·͢ɻ ˞ύϥϝʔλͷऔΓ͏Δʹ͍ͭͯҎԼͷαΠτʹهࡌ͞Ε͍ͯ·͢ ɹIUUQTHJUIVCDPNPQFOBJHZNXJLJ1FOEVMVNW
ֶशΞϧΰϦζϜͷ
ڧԽֶशͰग़ͯ͘Δొਓ ঢ়ଶTͷ࣌ɺํࡦКʹै͍ߦಈBΛߦ͏ͱɺঢ়ଶભҠ֬Qʹै͍ɺ ঢ়ଶ͕TʹભҠ͠ɺใुSΛಘΔɻ ڧԽֶशͷΞϧΰϦζϜɺใुSͷ૯͕࠷େͱͳΔΑ͏ͳɺ TͱBͷΈ߹ΘͤΛݟ͚ͭग़͢Α͏ʹͳ͍ͬͯ·͢ɻ T B T Q К
ొਓΛ༻͠ɺ 1 3 W RΛࣜ ͷΑ͏ʹఆٛ͠·͢
˞ࢀߟจݙʮݱͰ͑Δʂ1ZUIPOਂڧԽֶशೖڧԽֶशͱਂֶशʹΑΔ୳ࡧͱ੍ޚʯ 1 1ΑΓҾ༻
ࠓճ༻͢ΔڧԽֶशͷֶशΞϧΰϦζϜҎԼʹͳΓ·͢ɻ લทͷࣜ ΛܭࢉͰٻΊ·͢ɻ શͯͷTʹ͍ͭͯࣜ ͷR͕࠷େͱͳΔB
ߦಈ ΛٻΊ·͢ɻ ํࡦКʹ্هT BΛ༩͑ͨ࣌ɺͦͷ͕֬ʹͳΔΑ͏ʹํࡦΛߋ৽͠·͢ɻ ্هΛ܁Γฦ͠·͢ɻ ˞্هɺֶशΞϧΰϦζϜͷҰͭͰ͢ɻ
લॲཧ ࠓճɺ֯ͳͲͷύϥϝʔλͷΛࢄԽ͠·͢ɻ ࢄԽ͢Δͱঢ়ଶભҠ֬ҎԼͷΑ͏ͳදͰදݱͰ͖·͢ɻ > > >
> > > > > > > > > > > > > > > > >
ֶशΞϧΰϦζϜΛ༻͠ɺίʔυΛ࣮͠·ͨ͠ ˣ IUUQTHJTUHJUIVCDPNLFJNJOBBGGDCCBCB
݁Ռ
None
ߟ ɾͰ͖Δ্͚ͩ෦ʹ͍͘Α͏ʹɺΕΔൣғͰؤு͍ͬͯΔΑ͏ʹݟ͑Δ ɾ্ʹͱͲ·Δͷ͕࠷దղ͔ͱࢥ͕ͬͨɺ࣮ࡍͦ͏ͳΒͳ͔ͬͨ ɹɹɾ·ͩɺ࠷దղʹͨͲΓ͍͍ͭͯͳ͍ ɹɹɾࢄԽͨ͠ঢ়ଶͰ͜Ε͕࠷దͰ͋ΔՄೳੑ ɹɹɾ্ʹͱͲ·Δͷ͕࠷దͱࢥ͍ͬͯΔਓ͕ؒؒҧ͍ͬͯΔ ɹɹɾ࣮͕ؒҧ͍ͬͯΔՄೳੑ ·ͱΊ ڧԽֶशͰৼΓࢠͷ੍ޚΛߦ͍·ͨ͠
ࢀߟจݙ ݱͰ͑Δʂ1ZUIPOਂڧԽֶशೖڧԽֶशͱਂֶशʹΑΔ୳ࡧͱ੍ޚ IUUQTXXXTIPFJTIBDPKQCPPLEFUBJM ͜Ε͔ΒͷڧԽֶश IUUQTXXXNPSJLJUBDPKQCPPLTCPPL