Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
NeuralIPS
Search
Wonseok Jung
December 11, 2018
440
0
Share
NeuralIPS
Wonseok Jung
December 11, 2018
More Decks by Wonseok Jung
See All by Wonseok Jung
Ai for business -self car driving
wonseokjung
0
210
reinforcement_learning_.pdf
wonseokjung
2
1.5k
원석이의 모두연에서 강화학습 보석되기
wonseokjung
0
440
Introduction Deep Reinforcement Learning
wonseokjung
0
170
Deep reinforcemenet learning -2
wonseokjung
0
220
Deep Reinforcement Learning - Introduction
wonseokjung
1
670
How to become a datascientist ?
wonseokjung
2
2.3k
Review of Taylor series
wonseokjung
1
130
꿈꾸는 Agent
wonseokjung
2
160
Featured
See All Featured
Building a Scalable Design System with Sketch
lauravandoore
463
34k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
49
9.9k
More Than Pixels: Becoming A User Experience Designer
marktimemedia
3
400
Designing Experiences People Love
moore
143
24k
Getting science done with accelerated Python computing platforms
jacobtomlinson
2
190
What’s in a name? Adding method to the madness
productmarketing
PRO
24
4k
The B2B funnel & how to create a winning content strategy
katarinadahlin
PRO
1
350
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
54k
Paper Plane (Part 1)
katiecoart
PRO
0
7.3k
BBQ
matthewcrist
89
10k
The Pragmatic Product Professional
lauravandoore
37
7.3k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
Transcript
3-JOWJUFEUBML/FVSBM*14 8POTFPL+VOH
None
None
None
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent .
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent ܳ ೧ঠೠ.
None
None
None
None
None