Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
NeuralIPS
Search
Wonseok Jung
December 11, 2018
440
0
Share
NeuralIPS
Wonseok Jung
December 11, 2018
More Decks by Wonseok Jung
See All by Wonseok Jung
Ai for business -self car driving
wonseokjung
0
210
reinforcement_learning_.pdf
wonseokjung
2
1.5k
원석이의 모두연에서 강화학습 보석되기
wonseokjung
0
440
Introduction Deep Reinforcement Learning
wonseokjung
0
170
Deep reinforcemenet learning -2
wonseokjung
0
220
Deep Reinforcement Learning - Introduction
wonseokjung
1
670
How to become a datascientist ?
wonseokjung
2
2.3k
Review of Taylor series
wonseokjung
1
130
꿈꾸는 Agent
wonseokjung
2
160
Featured
See All Featured
Thoughts on Productivity
jonyablonski
76
5.2k
The untapped power of vector embeddings
frankvandijk
2
1.7k
Rails Girls Zürich Keynote
gr2m
96
14k
Winning Ecommerce Organic Search in an AI Era - #searchnstuff2025
aleyda
1
2k
YesSQL, Process and Tooling at Scale
rocio
174
15k
The SEO identity crisis: Don't let AI make you average
varn
0
480
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
1
190
Highjacked: Video Game Concept Design
rkendrick25
PRO
1
370
Introduction to Domain-Driven Design and Collaborative software design
baasie
1
810
Side Projects
sachag
455
43k
How STYLIGHT went responsive
nonsquared
100
6.1k
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
170
Transcript
3-JOWJUFEUBML/FVSBM*14 8POTFPL+VOH
None
None
None
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent .
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent ܳ ೧ঠೠ.
None
None
None
None
None