Model Based Reinforcement Learning for Atari

26/04/2020

Model Based Reinforcement Learning for Atari

Łukasz Kaiser, Mohammad Babaeizadeh, Piotr Miłos, Błażej Osiński, Roy H Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Afroz Mohiuddin, Ryan Sepassi, George Tucker, Henryk Michalewski

Keywords: reinforcement learning, model based rl, video prediction model, atari

Abstract Paper Code Similar Papers

Abstract: Model-free reinforcement learning (RL) can be used to learn effective policies for complex tasks, such as Atari games, even from image observations. However, this typically requires very large amounts of interaction -- substantially more, in fact, than a human would need to learn the same games. How can people learn so quickly? Part of the answer may be that people can learn how the game works and predict which actions will lead to desirable outcomes. In this paper, we explore how video prediction models can similarly enable agents to solve Atari games with fewer interactions than model-free methods. We describe Simulated Policy Learning (SimPLe), a complete model-based deep RL algorithm based on video prediction models and present a comparison of several model architectures, including a novel architecture that yields the best results in our setting. Our experiments evaluate SimPLe on a range of Atari games in low data regime of 100k interactions between the agent and the environment, which corresponds to two hours of real-time play. In most games SimPLe outperforms state-of-the-art model-free algorithms, in some games by over an order of magnitude.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Pretraining Representations for Data-Efficient Reinforcement Learning

Max Schwarzer, Nitarshan Rajkumar, Michael Noukhovitch and
Ankesh Anand, Laurent Charlin, R Devon Hjelm, Philip Bachman, Aaron Courville

Keywords Paper

reinforcement learning and planning, self-supervised learning

0

0

0

0

14:38

03/05/2021

Mastering Atari with Discrete World Models

Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba

Keywords Paper

reinforcement learning, actor critic, model-based reinforcement learning, world models, Atari, planning

1

0

0

0

5:52

06/12/2021

Mastering Atari Games with Limited Data

Weirui Ye, Shaohuai Liu, Thanard Kurutach and
Pieter Abbeel, Yang Gao

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

14:19

12/07/2020

ConQUR: Mitigating Delusional Bias in Deep Q-Learning

DiJia Su, Jayden Ooi, Tyler Lu and
Dale Schuurmans, Craig Boutilier

Keywords Paper

Reinforcement Learning - General

0

0

0

0

15:04

12/07/2020

Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences

Daniel Brown, Scott Niekum, Russell Coleman, Ravi Srinivasan

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:11

06/12/2021

Machine versus Human Attention in Deep Reinforcement Learning Tasks

Suna (Sihang) Guo, Ruohan Zhang, Bo Liu and
Yifeng Zhu, Dana Ballard, Mary Hayhoe, Peter Stone

Keywords Paper

deep learning, reinforcement learning and planning, interpretability

0

0

0

0

13:13

18/07/2021

Generalizable Episodic Memory for Deep Reinforcement Learning

Hao Hu, Jianing Ye, Guangxiang Zhu and
Zhizhou Ren, Chongjie Zhang

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:51

26/04/2020

Fast Task Inference with Variational Intrinsic Successor Features

Steven Hansen, Will Dabney, Andre Barreto and
David Warde-Farley, Tom Van de Wiele, Volodymyr Mnih

Keywords Paper

Reinforcement Learning, Variational Intrinsic Control, Successor Features

0

0

0

0

14:47

06/12/2020

Non-Crossing Quantile Regression for Distributional Reinforcement Learning

Fan Zhou, Jianing Wang, Xingdong Feng

Keywords Paper

0

0

0

0

3:11

26/04/2020

Discriminative Particle Filter Reinforcement Learning for Complex Partial observations

Xiao Ma, Peter Karkus, David Hsu and
Wee Sun Lee, Nan Ye

Keywords Paper

Reinforcement Learning, Partial Observability, Differentiable Particle Filtering

0

0

0

0

5:08

03/05/2021

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Max Schwarzer, Ankesh Anand, Rishab Goel and
R Devon Hjelm, Aaron Courville, Philip Bachman

Keywords Paper

Representation Learning, Self-Supervised Learning, Reinforcement Learning, Sample Efficiency

0

0

0

1

10:04

02/02/2021

Augmenting Policy Learning with Routines Discovered from a Single Demonstration

Zelin Zhao, Chuang Gan, Jiajun Wu and
Xiaoxiao Guo, Joshua B. Tenenbaum

Keywords Paper

0

0

0

0

14:48

06/12/2021

Scalable Online Planning via Reinforcement Learning Fine-Tuning

Arnaud Fickinger, Hengyuan Hu, Brandon Amos and
Stuart Russell, Noam Brown

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

8:04

26/04/2020

Disagreement-Regularized Imitation Learning

Kiante Brantley, Wen Sun, Mikael Henaff

Keywords Paper

imitation learning, reinforcement learning, uncertainty

0

0

0

0

4:53

02/02/2021

Planning from Pixels in Atari with Learned Symbolic Representations

Andrea Dittadi, Frederik K. Drachmann, Thomas Bolander

Keywords Paper

0

0

0

0

14:43

06/12/2021

Learning Diverse Policies in MOBA Games via Macro-Goals

Yiming Gao, Bei Shi, Xueying Du and
Liang Wang, Guangwei Chen, Zhenjie Lian, Fuhao Qiu, GUOAN HAN, Weixuan Wang, Deheng Ye, Qiang Fu, Wei Yang, Lanxiao Huang

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:49

06/12/2020

Munchausen Reinforcement Learning

Nino Vieillard, Olivier Pietquin, Matthieu Geist

Keywords Paper

0

0

0

0

3:19

12/07/2020

Agent57: Outperforming the Atari Human Benchmark

Adrià Puigdomenech Badia, Bilal Piot, Steven Kapturowski and
Pablo Sprechmann, Oleksandr Vitvitskyi, Zhaohan Guo, Charles Blundell

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

10:01

06/12/2021

Online and Offline Reinforcement Learning by Planning with a Learned Model

Julian Schrittwieser, Thomas Hubert, Amol Mandhane and
Mohammadamin Barekatain, Ioannis Antonoglou, David Silver

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

13:52

18/07/2021

Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity

Dhruv Malik, Aldo Pacchiano, Vishwak Srinivasan, Yuanzhi Li

Keywords Paper

Deep Learning, Deep Learning, Efficient Training Methods; Deep Learning, Optimization for Deep Networks, Theory, RL, Decisions and Control Theory

0

0

0

0

5:11

12/07/2020

Off-Policy Actor-Critic with Shared Experience Replay

Simon Schmitt, Matteo Hessel, Karen Simonyan

Keywords Paper

Reinforcement Learning - Deep RL

1

0

0

1

14:38

02/02/2021

Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search

Li-Cheng Lan, Ti-Rong Wu, I-Chen Wu, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

20:33

18/07/2021

Emphatic Algorithms for Deep Reinforcement Learning

Ray Jiang, Tom Zahavy, Zhongwen Xu and
Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

5:21

06/12/2021

Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark

Stefan O'Toole, Nir Lipovetzky, Miquel Ramirez, Adrian Pearce

Keywords Paper

0

0

0

0

15:00

06/12/2020

High-Throughput Synchronous Deep RL

Adam Liu, Raymond Yeh, Alex Schwing

Keywords Paper

0

0

0

0

3:18

03/05/2021

Taming GANs with Lookahead-Minmax

Tatjana Chavdarova, Matteo Pagliardini, Sebastian Stich and
François Fleuret, Martin Jaggi

Keywords Paper

Generative Adversarial Networks, Minmax

0

0

0

0

5:25

16/11/2020

Learning Object Manipulation Skills via Approximate State Estimation from Real Videos

Vladimír Petrík, Makarand Tapaswi, Ivan Laptev, Josef Sivic

Keywords Paper

0

0

0

0

5:07

06/12/2021

Behavior From the Void: Unsupervised Active Pre-Training

Hao Liu, Pieter Abbeel

Keywords Paper

reinforcement learning and planning

0

0

0

0

10:34

06/12/2021

Deep Reinforcement Learning at the Edge of the Statistical Precipice

Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro and
Aaron Courville, Marc Bellemare

Keywords Paper

reinforcement learning and planning

0

0

0

0

19:36

14/06/2020

Improved Few-Shot Visual Classification

Peyman Bateni, Raghav Goyal, Vaden Masrani and
Frank Wood, Leonid Sigal

Keywords Paper

meta-learning, few-shot classification, transfer learning, mahalanobis metric, bergman divergences

0

0

0

0

1:01

03/05/2021

Return-Based Contrastive Representation Learning for Reinforcement Learning

Guoqing Liu, Chuheng Zhang, Li Zhao and
Tao Qin, Jinhua Zhu, Li Jian, Nenghai Yu, Tie-Yan Liu

Keywords Paper

reinforcement learning, auxiliary task, contrastive learning, representation learning

0

0

0

0

5:20

17/08/2020

Learned motion matching

Daniel Holden, Oussama Kanoun, Maksym Perepichka, Tiberiu Popa

Keywords Paper

neural networks, animation, character animation, motion matching, generative models

0

0

0

0

23:09

06/12/2021

Do Different Tracking Tasks Require Different Appearance Models?

Zhongdao Wang, Hengshuang Zhao, Ya-Li Li and
Shengjin Wang, Philip Torr, Luca Bertinetto

Keywords Paper

self-supervised learning, vision

0

0

0

0

7:20

26/04/2020

Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents

Christian Rupprecht, Cyril Ibrahim, Christopher J. Pal

Keywords Paper

Visualization, Reinforcement Learning, Safety

0

0

0

0

4:52

19/08/2021

Deep Reinforcement Learning for Navigation in AAA Video Games

Eloi Alonso, Maxim Peter, David Goumard, Joshua Romoff

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Applications of Reinforcement Learning, Game Playing

0

0

0

0

12:58

06/12/2020

Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards

Yijie Guo, Jongwook Choi, Marcin Moczulski and
Shengyu Feng, Samy Bengio, Mohammad Norouzi, Honglak Lee

Keywords Paper

0

0

1

1

3:30

19/08/2021

Learning Implicit Temporal Alignment for Few-shot Video Classification

Songyang Zhang, Jiale Zhou, Xuming He

Keywords Paper

Computer Vision, Action Recognition, Deep Learning

0

0

0

0

6:20

12/07/2020

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

Aleksei Petrenko, Zhehui Huang, Tushar Kumar and
Gaurav Sukhatme, Vladlen Koltun

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:56

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

02/02/2021

Probabilistic Programming Bots in Intuitive Physics Game Play

Fahad Alhasoun, Sarah Alneghiemish

Keywords Paper

0

0

0

0

13:18