Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
NeuralIPS
Search
Wonseok Jung
December 11, 2018
0
300
NeuralIPS
Wonseok Jung
December 11, 2018
Tweet
Share
More Decks by Wonseok Jung
See All by Wonseok Jung
Ai for business -self car driving
wonseokjung
0
170
reinforcement_learning_.pdf
wonseokjung
2
1.5k
원석이의 모두연에서 강화학습 보석되기
wonseokjung
0
390
Introduction Deep Reinforcement Learning
wonseokjung
0
120
Deep reinforcemenet learning -2
wonseokjung
0
160
Deep Reinforcement Learning - Introduction
wonseokjung
1
600
How to become a datascientist ?
wonseokjung
2
2.3k
Review of Taylor series
wonseokjung
1
110
꿈꾸는 Agent
wonseokjung
2
96
Featured
See All Featured
Designing Dashboards & Data Visualisations in Web Apps
destraynor
226
51k
The Illustrated Children's Guide to Kubernetes
chrisshort
30
46k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
20
1.6k
Code Review Best Practice
trishagee
54
15k
The MySQL Ecosystem @ GitHub 2015
samlambert
242
12k
10 Git Anti Patterns You Should be Aware of
lemiorhan
647
58k
Six Lessons from altMBA
skipperchong
20
3k
Scaling GitHub
holman
457
140k
Testing 201, or: Great Expectations
jmmastey
27
6.3k
The Straight Up "How To Draw Better" Workshop
denniskardys
227
130k
Stop Working from a Prison Cell
hatefulcrawdad
266
19k
RailsConf 2023
tenderlove
2
530
Transcript
3-JOWJUFEUBML/FVSBM*14 8POTFPL+VOH
None
None
None
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent .
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent ܳ ೧ঠೠ.
None
None
None
None
None