Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
NeuralIPS
Search
Wonseok Jung
December 11, 2018
0
390
NeuralIPS
Wonseok Jung
December 11, 2018
Tweet
Share
More Decks by Wonseok Jung
See All by Wonseok Jung
Ai for business -self car driving
wonseokjung
0
190
reinforcement_learning_.pdf
wonseokjung
2
1.5k
원석이의 모두연에서 강화학습 보석되기
wonseokjung
0
400
Introduction Deep Reinforcement Learning
wonseokjung
0
140
Deep reinforcemenet learning -2
wonseokjung
0
180
Deep Reinforcement Learning - Introduction
wonseokjung
1
620
How to become a datascientist ?
wonseokjung
2
2.3k
Review of Taylor series
wonseokjung
1
120
꿈꾸는 Agent
wonseokjung
2
130
Featured
See All Featured
Art, The Web, and Tiny UX
lynnandtonic
298
20k
Bootstrapping a Software Product
garrettdimon
PRO
307
110k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
507
140k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
3.8k
Embracing the Ebb and Flow
colly
85
4.6k
Principles of Awesome APIs and How to Build Them.
keavy
126
17k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
30
2k
Designing for Performance
lara
608
69k
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
Producing Creativity
orderedlist
PRO
344
40k
Stop Working from a Prison Cell
hatefulcrawdad
268
20k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
52
2.4k
Transcript
3-JOWJUFEUBML/FVSBM*14 8POTFPL+VOH
None
None
None
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent .
43࠙ীࢲ gradient descent methodۄҊ ೞחؘ, gradient ascent. rewardܳ maximize ೞח
policyܳ ӝ ਤ೧ࢲח gradient aascent ܳ ೧ঠೠ.
None
None
None
None
None