RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments

26/04/2020

RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments

Roberta Raileanu, Tim Rocktäschel

Keywords: reinforcement learning, exploration, curiosity

Abstract Paper Code Similar Papers

Abstract: Exploration in sparse reward environments remains one of the key challenges of model-free reinforcement learning. Instead of solely relying on extrinsic rewards provided by the environment, many state-of-the-art methods use intrinsic rewards to encourage exploration. However, we show that existing methods fall short in procedurally-generated environments where an agent is unlikely to visit a state more than once. We propose a novel type of intrinsic reward which encourages the agent to take actions that lead to significant changes in its learned state representation. We evaluate our method on multiple challenging procedurally-generated tasks in MiniGrid, as well as on tasks with high-dimensional observations used in prior work. Our experiments demonstrate that this approach is more sample efficient than existing exploration methods, particularly for procedurally-generated MiniGrid environments. Furthermore, we analyze the learned behavior as well as the intrinsic reward received by our agent. In contrast to previous approaches, our intrinsic reward does not diminish during the course of training and it rewards the agent substantially more for interacting with objects that it can control.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

Don’t Do What Doesn’t Matter: Intrinsic Motivation with Action Usefulness

Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin

Keywords Paper

Machine Learning, Reinforcement Learning, Deep Reinforcement Learning

0

0

0

0

14:48

03/05/2021

Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments

Daochen Zha, Wenye Ma, Lei Yuan and
Xia Hu, Ji Liu

Keywords Paper

Exploration, Reinforcement Learning, Self-Imitation, Generalization of Reinforcement Learning

0

0

0

0

5:10

03/05/2021

Mutual Information State Intrinsic Control

Rui Zhao, Yang Gao, Pieter Abbeel and
Volker Tresp, Wei Xu

Keywords Paper

Intrinsic Motivation, Intrinsic Reward, Intrinsically Motivated Reinforcement Learning, Deep Reinforcement Learning, Reinforcement Learning

0

0

0

0

9:55

18/07/2021

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Jin Zhang, Jianhao Wang, Hao Hu and
Tong Chen, Yingfeng Chen, Changjie Fan, Chongjie Zhang

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

4:19

06/12/2021

Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration

Lulu Zheng, Jiarui Chen, Jianhao Wang and
Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang

Keywords Paper

reinforcement learning and planning

0

0

0

0

12:25

06/12/2021

Contrastive Active Inference

Pietro Mazzaglia, Tim Verbelen, Bart Dhoedt

Keywords Paper

theory, reinforcement learning and planning, generative model

0

0

0

0

10:46

12/07/2020

Implicit Generative Modeling for Efficient Exploration

Neale Ratzlaff, Qinxun Bai, Fuxin Li, Wei Xu

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:01

06/12/2021

(Almost) Free Incentivized Exploration from Decentralized Learning Agents

Chengshuai Shi, Haifeng Xu, Wei Xiong, Cong Shen

Keywords Paper

reinforcement learning and planning, bandits

0

0

0

0

14:00

12/07/2020

What Can Learned Intrinsic Rewards Capture?

Zeyu Zheng, Junhyuk Oh, Matteo Hessel and
Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:47

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

19/08/2021

Reward-Constrained Behavior Cloning

Zhaorong Wang, Meng Wang, Jingqi Zhang and
Yingfeng Chen, Chongjie Zhang

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning, Constraint Optimization

0

0

0

0

14:43

06/12/2020

Novelty Search in Representational Space for Sample Efficient Exploration

David Tao, Vincent Francois-Lavet, Joelle Pineau

Keywords Paper

0

0

0

0

3:04

18/07/2021

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Luisa Zintgraf, Leo Feng, Cong Lu and
Max Igl, Kristian Hartikainen, Katja Hofmann, Shimon Whiteson

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:52

06/12/2021

Learning Markov State Abstractions for Deep Reinforcement Learning

Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris

Keywords Paper

reinforcement learning and planning, contrastive learning, representation learning

0

0

0

0

12:31

26/04/2020

SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards

Siddharth Reddy, Anca D. Dragan, Sergey Levine

Keywords Paper

Imitation Learning, Reinforcement Learning

0

0

0

0

4:38

16/11/2020

Positive-Unlabeled Reward Learning

Danfei Xu, Misha Denil

Keywords Paper

0

0

0

0

5:04

02/02/2021

Relative Variational Intrinsic Control

Kate Baumli, David Warde-Farley, Steven Hansen, Volodymyr Mnih

Keywords Paper

0

0

0

0

19:18

06/12/2021

MADE: Exploration via Maximizing Deviation from Explored Regions

Tianjun Zhang, Paria Rashidinejad, Jiantao Jiao and
Yuandong Tian, Joseph Gonzalez, Stuart Russell

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:09

06/12/2020

Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration

Andrea Zanette, Alessandro Lazaric, Mykel J Kochenderfer, Emma Brunskill

Keywords Paper

0

0

0

0

3:11

06/12/2021

Discovery of Options via Meta-Learned Subgoals

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

4:13

18/07/2021

Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices

Evan Liu, Aditi Raghunathan, Percy Liang, Chelsea Finn

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:41

18/07/2021

Fast active learning for pure exploration in reinforcement learning

Pierre MENARD, Omar Darwiche Domingues, Anders Jonsson and
Emilie Kaufmann, Edouard Leurent, Michal Valko

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

4:54

18/07/2021

State Entropy Maximization with Random Encoders for Efficient Exploration

Younggyo Seo, Lili Chen, Jinwoo Shin and
Honglak Lee, Pieter Abbeel, Kimin Lee

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:02

16/11/2020

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

Jun Yamada, Youngwoon Lee, Gautam Salhotra and
Karl Pertsch, Max Pflueger, Gaurav Sukhatme, Joseph Lim, Peter Englert

Keywords Paper

0

0

0

0

4:59

12/07/2020

Constrained Markov Decision Processes via Backward Value Functions

Harsh Satija, Philip Amortila, Joelle Pineau

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:40

06/12/2021

Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning

Jinxin Liu, Hao Shen, Donglin Wang and
Yachen Kang, Qiangxing Tian

Keywords Paper

reinforcement learning and planning, domain adaptation

0

0

0

0

8:07

06/12/2021

Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL

Charles Packer, Pieter Abbeel, Joseph Gonzalez

Keywords Paper

reinforcement learning and planning

1

0

0

0

14:03

18/07/2021

Demonstration-Conditioned Reinforcement Learning for Few-Shot Imitation

Christopher Dance, Perez Julien, Théo Cachet

Keywords Paper

Reinforcement Learning and Planning, Planning and Control

0

0

0

0

5:13

26/04/2020

Watch, Try, Learn: Meta-Learning from Demonstrations and Rewards

Allan Zhou, Eric Jang, Daniel Kappler and
Alex Herzog, Mohi Khansari, Paul Wohlhart, Yunfei Bai, Mrinal Kalakrishnan, Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, reinforcement learning, imitation learning

0

0

0

0

4:34

12/07/2020

Variational Imitation Learning with Diverse-quality Demonstrations

Voot Tangkaratt, Bo Han, Mohammad Emtiyaz Khan, Masashi Sugiyama

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

13:52

06/12/2021

A Max-Min Entropy Framework for Reinforcement Learning

Seungyul Han, Youngchul Sung

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

14:35

03/05/2021

Plan-Based Relaxed Reward Shaping for Goal-Directed Tasks

Ingmar Schubert, Oz Oguz, Marc Toussaint

Keywords Paper

reinforcement learning, robotics, robotic manipulation, plan-based reward shaping, reward shaping

0

0

0

0

4:38

03/05/2021

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Paper

reinforcement learning, imitation learning, reward learning

0

0

0

0

5:09

02/02/2021

Sequential Generative Exploration Model for Partially Observable Reinforcement Learning

Haiyan Yin, Jianda Chen, Sinno Jialin Pan, Sebastian Tschiatschek

Keywords Paper

0

0

0

0

14:40

06/12/2021

Gradient Driven Rewards to Guarantee Fairness in Collaborative Machine Learning

Xinyi Xu, Lingjuan Lyu, Xingjun Ma and
Chenglin Miao, Chuan Sheng Foo, Bryan Kian Hsiang Low

Keywords Paper

machine learning, fairness, federated learning

0

0

0

0

15:03

26/04/2020

Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control

Tsui-Wei Weng, Krishnamurthy (Dj) Dvijotham, Jonathan Uesato and
Kai Xiao, Sven Gowal, Robert Stanforth*, Pushmeet Kohli

Keywords Paper

deep learning, reinforcement learning, robustness, adversarial examples

0

0

0

0

6:00

18/07/2021

Policy Gradient Bayesian Robust Optimization for Imitation Learning

Zaynah Javed, Daniel Brown, Satvik Sharma and
Jerry Zhu, Ashwin Balakrishna, Marek Petrik, Anca Dragan, Ken Goldberg

Keywords Paper

Social Aspects of Machine Learning, AI Safety

0

0

0

1

5:10

06/12/2021

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Dweep Trivedi, Jesse Zhang, Shao-Hua Sun, Joseph Lim

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

10:30

18/07/2021

Interactive Learning from Activity Description

Khanh Nguyen, Dipendra Misra, Robert Schapire and
Miro Dudik, Patrick Shafto

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:57

18/07/2021

Contrastive Learning Inverts the Data Generating Process

Roland S. Zimmermann, Yash Sharma, Steffen Schneider and
Matthias Bethge, Wieland Brendel

Keywords Paper

Theory, Deep learning Theory

1

1

0

1

5:17