Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
NeuralIPS
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Wonseok Jung
December 11, 2018
0
430
NeuralIPS
Wonseok Jung
December 11, 2018
Tweet
Share
More Decks by Wonseok Jung
See All by Wonseok Jung
Ai for business -self car driving
wonseokjung
0
210
reinforcement_learning_.pdf
wonseokjung
2
1.5k
원석이의 모두연에서 강화학습 보석되기
wonseokjung
0
430
Introduction Deep Reinforcement Learning
wonseokjung
0
170
Deep reinforcemenet learning -2
wonseokjung
0
210
Deep Reinforcement Learning - Introduction
wonseokjung
1
650
How to become a datascientist ?
wonseokjung
2
2.3k
Review of Taylor series
wonseokjung
1
120
꿈꾸는 Agent
wonseokjung
2
160
Featured
See All Featured
The SEO Collaboration Effect
kristinabergwall1
0
350
How to Get Subject Matter Experts Bought In and Actively Contributing to SEO & PR Initiatives.
livdayseo
0
65
Designing for humans not robots
tammielis
254
26k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3.1k
Embracing the Ebb and Flow
colly
88
5k
Building Better People: How to give real-time feedback that sticks.
wjessup
370
20k
Redefining SEO in the New Era of Traffic Generation
szymonslowik
1
210
Navigating Algorithm Shifts & AI Overviews - #SMXNext
aleyda
0
1.1k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
2.1k
Testing 201, or: Great Expectations
jmmastey
46
8k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
16k
Chasing Engaging Ingredients in Design
codingconduct
0
110
Transcript
3-JOWJUFEUBML/FVSBM*14 8POTFPL+VOH
None
None
None
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent .
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent ܳ ೧ঠೠ.
None
None
None
None
None