Slide 146
Slide 146 text
紹介論文リスト(1)
● Bellemare, Marc G., et al. "The Arcade Learning Environment: An evaluation
platform for general agents." J. Artif. Intell. Res.(JAIR)47 (2013): 253-279.
● Mnih, Volodymyr, et al. "Playing atari with deep reinforcement learning." arXiv
preprint arXiv:1312.5602 (2013).
● Mnih, Volodymyr, et al. "Human-level control through deep reinforcement
learning." Nature 518.7540 (2015): 529-533.
● Lin, Long-Ji. Reinforcement learning for robots using neural networks. No.
CMU-CS-93-103. Carnegie-Mellon Univ Pittsburgh PA School of Computer
Science, 1993.
● Nair, Arun, et al. "Massively parallel methods for deep reinforcement
learning." arXiv preprint arXiv:1507.04296 (2015).
● Mnih, Volodymyr, et al. "Asynchronous methods for deep reinforcement
learning." International Conference on Machine Learning. 2016.
● Babaeizadeh, Mohammad, et al. "Reinforcement learning through
asynchronous advantage actor-critic on a gpu." (2016).
146