Slide 8
Slide 8 text
͡Ίʹ
ڧԽֶश 1981
Q-learning, Actor-Critic, DQN, AlphaGo, . . .
Psychological Review ࢽ 1981
Psychological Review
1981, Vol. 88, No. 2, 135-170
Copyright 1981 by the American Psychological Association, Inc.
0033-295X/8I/8802-OI35$00.75
Toward a Modern Theory of Adaptive Networks:
Expectation and Prediction
Richard S. Sutton and Andrew G. Barto
Computer and Information Science Department
University of Massachusetts—Amherst
Many adaptive neural network theories are based on neuronlike adaptive elements
that can behave as single unit analogs of associative conditioning. In this article
we develop a similar adaptive element, but one which is more closely in accord
with the facts of animal learning theory than elements commonly studied in
adaptive network research. We suggest that an essential feature of classical
ߴڮ ୡೋ (౦ژిػେֶ, υϫϯΰਓೳݚڀॴ) (SS3)
ೝՊֶ͔Βͷࢹɿ ຬԽʹΑΔΤϛϡϨʔγϣϯͱɼ ఆͱͯ͠ͷڧԽֶश
2018-06-23 Sat 8 / 23