Off-Policy Actor-Critic with Shared Experience Replay

12/07/2020

Off-Policy Actor-Critic with Shared Experience Replay

Simon Schmitt, Matteo Hessel, Karen Simonyan

Keywords: Reinforcement Learning - Deep RL

Abstract Paper Similar Papers

Abstract: We investigate the combination of actor-critic reinforcement learning algorithms with a uniform large-scale experience replay and propose solutions for two ensuing challenges: (a) efficient actor-critic learning with experience replay (b) the stability of off-policy learning where agents learn from other agents behaviour. To this end we analyze the bias-variance tradeoffs in V-trace, a form of importance sampling for actor-critic methods. Based on our analysis, we then argue for mixing experience sampled from replay with on-policy experience, and propose a new trust region scheme that scales effectively to data distributions where V-trace becomes unstable. We provide extensive empirical validation of the proposed solutions on DMLab-30. We further show the benefits of this setup in two training regimes for Atari: (1) a single agent is trained up until 200M environment frames per game (2) a population of agents is trained up until 200M environment frames each and may share experience. While (1) is a standard regime, (2) reflects the use case of concurrently executed hyper-parameter sweeps. We demonstrate state-of-the-art data efficiency among model-free agents in both regimes.

1

0

0

1

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Fast Task Inference with Variational Intrinsic Successor Features

Steven Hansen, Will Dabney, Andre Barreto and
David Warde-Farley, Tom Van de Wiele, Volodymyr Mnih

Keywords Paper

Reinforcement Learning, Variational Intrinsic Control, Successor Features

0

0

0

0

14:47

26/04/2020

Never Give Up: Learning Directed Exploration Strategies

Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi and
Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martin Arjovsky, Alexander Pritzel, Andrew Bolt, Charles Blundell

Keywords Paper

deep reinforcement learning, exploration, intrinsic motivation

0

0

0

0

5:30

18/07/2021

DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning

Wei-Fang Sun, Cheng-Kuang Lee, Chun-Yi Lee

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:43

18/07/2021

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

3:50

03/05/2021

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Max Schwarzer, Ankesh Anand, Rishab Goel and
R Devon Hjelm, Aaron Courville, Philip Bachman

Keywords Paper

Representation Learning, Self-Supervised Learning, Reinforcement Learning, Sample Efficiency

0

0

0

1

10:04

03/05/2021

Mastering Atari with Discrete World Models

Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba

Keywords Paper

reinforcement learning, actor critic, model-based reinforcement learning, world models, Atari, planning

1

0

0

0

5:52

26/04/2020

Disagreement-Regularized Imitation Learning

Kiante Brantley, Wen Sun, Mikael Henaff

Keywords Paper

imitation learning, reinforcement learning, uncertainty

0

0

0

0

4:53

06/12/2020

An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

0

0

0

0

2:53

26/04/2020

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Piyush Gupta, Nikaash Puri, Sukriti Verma and
Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Keywords Paper

Deep Reinforcement Learning, Saliency maps, Chess, Atari games, Interpretable AI

0

0

0

0

4:59

07/09/2020

Adversarial Concurrent Training: Optimizing Robustness and Accuracy Trade-off of Deep Neural Networks

Elahe Arani, Fahad Sarfraz, Bahram Zonooz

Keywords Paper

Adversarial Robustness, Generalization, Adversarial Training, Deep Learning, Collaborative Learning

0

0

0

0

3:39

18/07/2021

Decoupling Representation Learning from Reinforcement Learning

Adam Stooke, Kimin Lee, Pieter Abbeel, Michael Laskin

Keywords Paper

Optimization, Submodular Optimization, Algorithms, Bandit Algorithms; Algorithms, Online Learning, Deep Learning, Embedding and Representation learning

0

0

0

0

5:15

26/04/2020

Multi-Agent Interactions Modeling with Correlated Policies

Minghuan Liu, Ming Zhou, Weinan Zhang and
Yuzheng Zhuang, Jun Wang, Wulong Liu, Yong Yu

Keywords Paper

Multi-agent reinforcement learning, Imitation learning

0

0

0

0

4:33

06/12/2021

Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks

Maksym Yatsura, Jan Metzen, Matthias Hein

Keywords Paper

robustness, adversarial robustness and security, meta learning

0

0

0

0

12:55

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

02/02/2021

Bayesian Distributional Policy Gradients

Luchen Li, A. Aldo Faisal

Keywords Paper

1

0

0

0

18:06

06/12/2021

Learning Markov State Abstractions for Deep Reinforcement Learning

Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris

Keywords Paper

reinforcement learning and planning, contrastive learning, representation learning

0

0

0

0

12:31

06/12/2021

Provable Representation Learning for Imitation with Contrastive Fourier Features

Ofir Nachum, Mengjiao Yang

Keywords Paper

reinforcement learning and planning, contrastive learning, representation learning

0

0

0

0

15:06

06/12/2020

Evaluating and Rewarding Teamwork Using Cooperative Game Abstractions

Tom Yan, Christian Kroer, Alexander Peysakhovich

Keywords Paper

0

0

0

0

3:09

18/07/2021

Provably Efficient Algorithms for Multi-Objective Competitive RL

Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

17:04

12/07/2020

Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences

Daniel Brown, Scott Niekum, Russell Coleman, Ravi Srinivasan

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:11

02/02/2021

Estimating α-Rank by Maximizing Information Gain

Tabish Rashid, Cheng Zhang, Kamil Ciosek

Keywords Paper

0

0

0

0

14:52

12/07/2020

An Optimistic Perspective on Offline Deep Reinforcement Learning

Rishabh Agarwal, Dale Schuurmans, Mohammad Norouzi

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

13:54

03/05/2021

Learning to Represent Action Values as a Hypergraph on the Action Vertices

Arash Tavakoli, Mehdi Fatemi, Petar Kormushev

Keywords Paper

reinforcement learning, learning action representations, multi-dimensional discrete action spaces, structural inductive bias, structural credit assignment

0

0

0

0

3:43

06/12/2021

Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning

Siyuan Zhang, Nan Jiang

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:23

12/07/2020

A Distributional Framework For Data Valuation

Amirata Ghorbani, Michael Kim, James Zou

Keywords Paper

Learning Theory

0

0

0

0

14:15

19/08/2021

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

Weinan Zhang, Xihuai Wang, Jian Shen, Ming Zhou

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Multi-agent Learning

0

0

0

0

13:10

06/12/2021

Deep Learning on a Data Diet: Finding Important Examples Early in Training

Mansheej Paul, Surya Ganguli, Gintare Karolina Dziugaite

Keywords Paper

deep learning

0

0

0

0

10:18

06/12/2020

RD$^2$: Reward Decomposition with Representation Decomposition

Zichuan Lin, Derek Yang, Li Zhao and
Tao Qin, Guangwen Yang, Tie-Yan Liu

Keywords Paper

0

0

0

0

3:10

16/11/2020

Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity

Tanmay Gangwani, Jian Peng, Yuan Zhou

Keywords Paper

0

0

0

0

4:27

03/05/2021

Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?

Balázs Kégl, Gabriel Hurtado, Albert Thomas

Keywords Paper

model-based reinforcement learning, heteroscedasticity, dynamic systems, mixture density nets, generative models

0

0

0

0

6:26

18/07/2021

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Tarun Gupta, Anuj Mahajan, Bei Peng and
Wendelin Boehmer, Shimon Whiteson

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:23

06/12/2020

Predictive Information Accelerates Learning in RL

Kuang-Huei Lee, Ian Fischer, Anthony Liu and
Yijie Guo, Honglak Lee, John Canny, Sergio Guadarrama

Keywords Paper

0

0

0

0

3:10

12/07/2020

Learning Calibratable Policies using Programmatic Style-Consistency

Eric Zhan, Albert Tseng, Yisong Yue and
Adith Swaminathan, Matthew Hausknecht

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

15:05

13/04/2021

Sample elicitation

Jiaheng Wei, Zuyue Fu, Yang Liu and
Xingyu Li, Zhuoran Yang, Zhaoran Wang

Keywords Paper

0

0

0

0

3:16

06/12/2021

Behavior From the Void: Unsupervised Active Pre-Training

Hao Liu, Pieter Abbeel

Keywords Paper

reinforcement learning and planning

0

0

0

0

10:34

14/06/2020

Better Captioning With Sequence-Level Exploration

Jia Chen, Qin Jin

Keywords Paper

caption, sequece-level, diversity, precision

0

0

0

0

0:57

18/07/2021

Learning in Nonzero-Sum Stochastic Games with Potentials

David Mguni, Yutong Wu, Yali Du and
Yaodong Yang, Ziyi Wang, M. Li, Ying Wen, Joel Jennings, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:36

26/04/2020

SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards

Siddharth Reddy, Anca D. Dragan, Sergey Levine

Keywords Paper

Imitation Learning, Reinforcement Learning

0

0

0

0

4:38

06/12/2020

Effective Diversity in Population Based Reinforcement Learning

Jack Parker-Holder, Aldo Pacchiano, Krzysztof M Choromanski, Stephen J Roberts

Keywords Paper

0

0

0

0

3:23

18/07/2021

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal, Christian Schroeder, Bei Peng and
Wendelin Boehmer, Shimon Whiteson, Fei Sha

Keywords Paper

Optimization, Convex Optimization, Reinforcement Learning and Planning, Multi-Agent RL, Algorithms, Large Scale Learning; Probabilistic Methods, Distributed Inference

0

0

0

0

20:08