Slide 1
Slide 1 text
Survey on World Models and Reinforcement
Learning
Learning Latent Dynamics for Planning from Pixels,
Danijar Hafner, Timothy Lillicrap, Ian Fischer, Ruben Villegas, David Ha (Google) [arXiv’18]
Dream to Control: Learning Behaviors by Latent Imagination,
Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba (Google) [ICLR’20]
DayDreamer: World Models for Physical Robot Learning,
Philipp Wu* Alejandro Escontrela* Danijar Hafner* Ken Goldberg Pieter Abbeel (University of
California, Berkeley) [CoRL’22]
Mastering Diverse Domains through World Models,
Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap (DeepMind) [arXiv’23]
Mastering Atari with Discrete World Models,
Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba (Google) [ICLR’21]
1/27