Slide 126
Slide 126 text
参考文献 (3)
⚫ Nikolay Savinov, Anton Raichuk, Raphaël Marinier, Damien Vincent, Marc Pollefeys, Timothy
Lillicrap, Sylvain Gelly (2018). Episodic Curiosity through Reachability.
https://arxiv.org/abs/1810.02274
⚫ Ildefons Magrans de Abril, Ryota Kanai (2018). Curiosity-driven reinforcement learning with
homeostatic regulation. https://arxiv.org/abs/1801.07440
⚫ Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub
Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon
Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba (2018). Learning Dexterous
In-Hand Manipulation. https://arxiv.org/abs/1808.00177
⚫ Nicolas Heess, Dhruva TB, Srinivasan Sriram, Jay Lemmon, Josh Merel, Greg Wayne, Yuval
Tassa, Tom Erez, Ziyu Wang, S. M. Ali Eslami, Martin Riedmiller, David Silver (2017).
Emergence of Locomotion Behaviours in Rich Environments.
https://arxiv.org/abs/1707.02286
⚫ Trapit Bansal, Jakub Pachocki, Szymon Sidor, Ilya Sutskever, Igor Mordatch (2018). Emergent
Complexity via Multi-Agent Competition. https://arxiv.org/abs/1710.03748
117