Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
NeuralIPS
Search
Wonseok Jung
December 11, 2018
0
360
NeuralIPS
Wonseok Jung
December 11, 2018
Tweet
Share
More Decks by Wonseok Jung
See All by Wonseok Jung
Ai for business -self car driving
wonseokjung
0
180
reinforcement_learning_.pdf
wonseokjung
2
1.5k
원석이의 모두연에서 강화학습 보석되기
wonseokjung
0
390
Introduction Deep Reinforcement Learning
wonseokjung
0
130
Deep reinforcemenet learning -2
wonseokjung
0
170
Deep Reinforcement Learning - Introduction
wonseokjung
1
610
How to become a datascientist ?
wonseokjung
2
2.3k
Review of Taylor series
wonseokjung
1
120
꿈꾸는 Agent
wonseokjung
2
110
Featured
See All Featured
Thoughts on Productivity
jonyablonski
67
4.3k
Into the Great Unknown - MozCon
thekraken
32
1.5k
How to train your dragon (web standard)
notwaldorf
88
5.7k
A Modern Web Designer's Workflow
chriscoyier
693
190k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
42
9.2k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
169
50k
Building Flexible Design Systems
yeseniaperezcruz
327
38k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
232
17k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
38
1.8k
Speed Design
sergeychernyshev
24
610
Optimising Largest Contentful Paint
csswizardry
33
2.9k
Build your cross-platform service in a week with App Engine
jlugia
229
18k
Transcript
3-JOWJUFEUBML/FVSBM*14 8POTFPL+VOH
None
None
None
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent .
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent ܳ ೧ঠೠ.
None
None
None
None
None