Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

18/07/2021

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Luisa Zintgraf, Leo Feng, Cong Lu, Max Igl, Kristian Hartikainen, Katja Hofmann, Shimon Whiteson

Keywords: Algorithms, Multitask, Transfer, and Meta Learning

Abstract Paper Similar Papers

Abstract: To rapidly learn a new task, it is often essential for agents to explore efficiently - especially when performance matters from the first timestep. One way to learn such behaviour is via meta-learning. Many existing methods however rely on dense rewards for meta-training, and can fail catastrophically if the rewards are sparse. Without a suitable reward signal, the need for exploration during meta-training is exacerbated. To address this, we propose HyperX, which uses novel reward bonuses for meta-training to explore in approximate hyper-state space (where hyper-states represent the environment state and the agent's task belief). We show empirically that HyperX meta-learns better task-exploration and adapts more successfully to new tasks than existing methods.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Discovery of Options via Meta-Learned Subgoals

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

4:13

18/07/2021

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Jin Zhang, Jianhao Wang, Hao Hu and
Tong Chen, Yingfeng Chen, Changjie Fan, Chongjie Zhang

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

4:19

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

03/05/2021

Latent Skill Planning for Exploration and Transfer

Kevin Xie, Homanga Bharadhwaj, Danijar Hafner and
Animesh Garg, Florian Shkurti

Keywords Paper

Partial Amortization, Model Predictive Control, Planning, Mutual Information, Skill Discovery, World Models, Model-Based Reinforcement Learning

0

0

0

0

5:10

26/04/2020

RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments

Roberta Raileanu, Tim Rocktäschel

Keywords Paper

reinforcement learning, exploration, curiosity

0

0

0

0

4:48

06/12/2021

Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning

Jinxin Liu, Hao Shen, Donglin Wang and
Yachen Kang, Qiangxing Tian

Keywords Paper

reinforcement learning and planning, domain adaptation

0

0

0

0

8:07

06/12/2021

The Role of Global Labels in Few-Shot Classification and How to Infer Them

Ruohan Wang, Massimiliano Pontil, Carlo Ciliberto

Keywords Paper

machine learning, meta learning, few shot learning

0

0

0

0

5:07

18/07/2021

PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training

Kimin Lee, Laura Smith, Pieter Abbeel

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

15:02

12/07/2020

Growing Action Spaces

Gregory Farquhar, Laura Gustafson, Zeming Lin and
Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

16:31

18/07/2021

TempoRL: Learning When to Act

André Biedenkapp, Raghu Rajan, Frank Hutter, Marius Lindauer

Keywords Paper

Optimization, Non-Convex Optimization, Reinforcement Learning and Planning, Neuroscience and Cognitive Science, Reasoning; Optimization, Combinatorial Optimization; Reinforcement Learning and Plannin

0

0

0

0

5:24

26/04/2020

Watch, Try, Learn: Meta-Learning from Demonstrations and Rewards

Allan Zhou, Eric Jang, Daniel Kappler and
Alex Herzog, Mohi Khansari, Paul Wohlhart, Yunfei Bai, Mrinal Kalakrishnan, Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, reinforcement learning, imitation learning

0

0

0

0

4:34

16/11/2020

Accelerating Reinforcement Learning with Learned Skill Priors

Karl Pertsch, Youngwoon Lee, Joseph Lim

Keywords Paper

0

0

0

0

5:12

06/12/2021

Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

Yichen Yang, Jeevana Priya Inala, Osbert Bastani and
Yewen Pu, Armando Solar-Lezama, Martin Rinard

Keywords Paper

reinforcement learning and planning, generative model

0

0

0

0

14:56

06/12/2021

Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL

Charles Packer, Pieter Abbeel, Joseph Gonzalez

Keywords Paper

reinforcement learning and planning

1

0

0

0

14:03

03/05/2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou and
Albert Yu, Nicholas Rhinehart, Sergey Levine

Keywords Paper

reinforcement learning, imitation learning

0

0

0

0

14:21

06/12/2021

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Dweep Trivedi, Jesse Zhang, Shao-Hua Sun, Joseph Lim

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

10:30

06/12/2020

Probabilistic Active Meta-Learning

Jean Kaddour, Steindor Saemundsson, Marc Deisenroth

Keywords Paper

0

0

0

0

3:17

12/07/2020

Provably Efficient Model-based Policy Adaptation

Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:49

16/11/2020

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

Jun Yamada, Youngwoon Lee, Gautam Salhotra and
Karl Pertsch, Max Pflueger, Gaurav Sukhatme, Joseph Lim, Peter Englert

Keywords Paper

0

0

0

0

4:59

06/12/2021

Hierarchical Reinforcement Learning with Timed Subgoals

Nico Gürtler, Dieter Büchler, Georg Martius

Keywords Paper

reinforcement learning and planning

0

0

0

0

8:17

06/12/2021

Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration

Lulu Zheng, Jiarui Chen, Jianhao Wang and
Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang

Keywords Paper

reinforcement learning and planning

0

0

0

0

12:25

18/07/2021

Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices

Evan Liu, Aditi Raghunathan, Percy Liang, Chelsea Finn

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:41

22/11/2021

Meta-learning the Learning Trends Shared Across Tasks

Jathushan Rajasegaran, Salman Khan, Munawar Hayat and
Fahad Shahbaz Khan, Mubarak Shah

Keywords Paper

Meta-learning, Few-shot learning

0

0

0

0

2:38

06/12/2020

Weakly-Supervised Reinforcement Learning for Controllable Behavior

Lisa Lee, Benjamin Eysenbach, Russ Salakhutdinov and
Shixiang (Shane) Gu, Chelsea Finn

Keywords Paper

0

0

0

0

3:31

06/12/2020

Automatic Curriculum Learning through Value Disagreement

Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto

Keywords Paper

0

0

0

0

3:17

06/12/2020

Learning to Incentivize Other Learning Agents

Jiachen Yang, Ang Li, Mehrdad Farajtabar and
Peter Sunehag, Edward Hughes, Hongyuan Zha

Keywords Paper

0

0

0

0

3:20

12/07/2020

Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?

Kei Ota, Tomoaki Oiki, Devesh Jha and
Toshisada Mariyama, Daniel Nikovski

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:55

18/11/2020

Learning 2-opt heuristics for the traveling salesman problem via deep reinforcement learning

Paulo R d O Costa, Jason Rhuggenaath, Yingqian Zhang, Alp Akcay

Keywords Paper

0

0

0

0

11:58

06/12/2020

Adversarial Style Mining for One-Shot Unsupervised Domain Adaptation

Yawei Luo, Ping Liu, Tao Guan and
Junqing Yu, Yi Yang

Keywords Paper

0

0

0

0

3:22

18/07/2021

Improving Generalization in Meta-learning via Task Augmentation

Huaxiu Yao, Long-Kai Huang, Linjun Zhang and
Ying WEI, Li Tian, James Zou, Junzhou Huang, Zhenhui (Jessie) Li

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

8:27

03/05/2021

Planning from Pixels using Inverse Dynamics Models

Keiran Paster, Sheila McIlraith, Jimmy Ba

Keywords Paper

model based reinforcement learning, deep learning, goal-conditioned reinforcement learning, deep reinforcement learning, multi-task learning

0

0

0

0

4:15

03/05/2021

Fast And Slow Learning Of Recurrent Independent Mechanisms

Kanika Madan, Nan Rosemary Ke, Anirudh Goyal and
Bernhard Schoelkopf, Yoshua Bengio

Keywords Paper

better generalization, modular representations, learning mechanisms

0

0

0

0

5:09

26/04/2020

Automated Relational Meta-learning

Huaxiu Yao, Xian Wu, Zhiqiang Tao and
Yaliang Li, Bolin Ding, Ruirui Li, Zhenhui Li

Keywords Paper

meta-learning, task heterogeneity, meta-knowledge graph

1

1

0

0

5:13

03/05/2021

Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments

Daochen Zha, Wenye Ma, Lei Yuan and
Xia Hu, Ji Liu

Keywords Paper

Exploration, Reinforcement Learning, Self-Imitation, Generalization of Reinforcement Learning

0

0

0

0

5:10

06/12/2020

Effective Diversity in Population Based Reinforcement Learning

Jack Parker-Holder, Aldo Pacchiano, Krzysztof M Choromanski, Stephen J Roberts

Keywords Paper

0

0

0

0

3:23

03/05/2021

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Paper

reinforcement learning, imitation learning, reward learning

0

0

0

0

5:09

06/12/2020

Information-theoretic Task Selection for Meta-Reinforcement Learning

Ricardo Luna Gutierrez, Matteo Leonetti

Keywords Paper

0

0

0

0

2:57

19/08/2021

Inter-Task Similarity for Lifelong Reinforcement Learning in Heterogeneous Tasks

Sergio A. Serrano

Keywords Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Reinforcement Learning, Incremental Learning, Learning in Robotics

0

0

0

0

11:02

12/07/2020

Active World Model Learning in Agent-rich Environments with Progress Curiosity

Kuno Kim, Megumi Sano, Julian De Freitas and
Nick Haber, Daniel Yamins

Keywords Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

0

0

0

0

15:25

06/12/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn

Keywords Paper

0

0

0

0

3:24