NeuralIPS

December 11, 2018

440

NeuralIPS

Wonseok Jung

December 11, 2018

More Decks by Wonseok Jung

See All by Wonseok Jung

Ai for business -self car driving

0

210

reinforcement_learning_.pdf

2

1.5k

원석이의 모두연에서 강화학습 보석되기

0

440

Introduction Deep Reinforcement Learning

0

180

Deep reinforcemenet learning -2

0

220

Deep Reinforcement Learning - Introduction

1

670

How to become a datascientist ?

2

2.4k

Review of Taylor series

1

130

꿈꾸는 Agent

2

170

Featured

See All Featured

Site-Speed That Sticks

13

1.3k

DevOps and Value Stream Thinking: Enabling flow, efficiency and business value

1

260

Creating an realtime collaboration tool: Agile Flush - .NET Oxford

35

2.5k

HTML-Aware ERB: The Path to Reactive Rendering @ RubyCon 2026, Rimini, Italy

2

320

How Software Deployment tools have changed in the past 20 years

0

34k

Raft: Consensus for Rubyists

141

7.6k

455

43k

Ethics towards AI in product and experience design

2

330

New Earth Scene 8

3

2.4k

Sharpening the Axe: The Primacy of Toolmaking

46

2.9k

個人開発の失敗を避けるイケてる考え方 / tips for indie hackers

123

22k

How to Think Like a Performance Engineer

28

2.7k

Transcript

3-JOWJUFEUBML/FVSBM*14 8POTFPL+VOH
None
None
None
43࠙ীࢲ gradient descent methodۄҊ ೞ৓חؘ, gradient ascent੉׮. rewardܳ maximize ೞח
policyܳ ଺ӝ ਤ೧ࢲח gradient aascent ׮.
43࠙ীࢲ gradient descent methodۄҊ ೞ৓חؘ, gradient ascent੉׮. rewardܳ maximize ೞח
policyܳ ଺ӝ ਤ೧ࢲח gradient aascent ܳ ೧ঠೠ׮.
None
None
None
None
None