Goal-Conditioned Reinforcement Learning with Imagined Subgoals

18/07/2021

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Elliot Chane-Sane, Cordelia Schmid, Ivan Laptev

Keywords: Reinforcement Learning and Planning, Deep RL

Abstract Paper Similar Papers

Abstract: Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it often struggles to solve tasks that require more temporally extended reasoning. In this work, we propose to incorporate imagined subgoals into policy learning to facilitate learning of complex tasks. Imagined subgoals are predicted by a separate high-level policy, which is trained simultaneously with the policy and its critic. This high-level policy predicts intermediate states halfway to the goal using the value function as a reachability metric. We don’t require the policy to reach these subgoals explicitly. Instead, we use them to define a prior policy, and incorporate this prior into a KL-constrained policy iteration scheme to speed up and regularize learning. Imagined subgoals are used during policy learning, but not during test time, where we only apply the learned policy. We evaluate our approach on complex robotic navigation and manipulation tasks and show that it outperforms existing methods by a large margin.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

A distributional view on multi objective policy optimization

Abbas Abdolmaleki, Sandy Huang, Leonard Hasenclever and
Michael Neunert, Martina Zambelli, Murilo Martins, Francis Song, Nicolas Heess, Raia Hadsell, Martin Riedmiller

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:04

03/05/2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou and
Albert Yu, Nicholas Rhinehart, Sergey Levine

Keywords Paper

reinforcement learning, imitation learning

0

0

0

0

14:21

26/04/2020

Automated curriculum generation through setter-solver interactions

Sebastien Racaniere, Andrew Lampinen, Adam Santoro and
David Reichert, Vlad Firoiu, Timothy Lillicrap

Keywords Paper

Deep Reinforcement Learning, Automatic Curriculum

0

0

0

0

3:55

06/12/2021

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Dweep Trivedi, Jesse Zhang, Shao-Hua Sun, Joseph Lim

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

10:30

03/05/2021

Learning to Reach Goals via Iterated Supervised Learning

Dibya Ghosh, Abhishek Gupta, Ashwin D Reddy and
Justin Fu, Coline M Devin, Ben Eysenbach, Sergey Levine

Keywords Paper

goal reaching, reinforcement learning, goal-conditioned RL, behavior cloning

0

0

0

0

15:19

06/12/2020

Multi-Task Reinforcement Learning with Soft Modularization

Ruihan Yang, Huazhe Xu, YI WU, Xiaolong Wang

Keywords Paper

0

0

0

0

3:18

16/11/2020

Chaining Behaviors from Data with Model-Free Reinforcement Learning

Avi Singh, Albert Yu, Jonathan Yang and
Jesse Zhang, Aviral Kumar, Sergey Levine

Keywords Paper

0

0

0

0

5:01

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

06/12/2021

Continual Auxiliary Task Learning

Matthew McLeod, Chunlok Lo, Matthew Schlegel and
Andrew Jacobsen, Raksha Kumaraswamy, Martha White, Adam White

Keywords Paper

reinforcement learning and planning

0

0

0

0

5:36

16/11/2020

PLAS: Latent Action Space for Offline Reinforcement Learning

Wenxuan Zhou, Sujay Bajracharya, David Held

Keywords Paper

0

0

0

0

5:06

16/11/2020

Learning Dexterous Manipulation from Suboptimal Experts

Rae Jeong, Jost Tobias Springenberg, Jackie Kay and
Dan Zheng, Alexandre Galashov, Nicolas Heess, Francesco Nori

Keywords Paper

0

0

0

0

5:03

06/12/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn

Keywords Paper

0

0

0

0

3:24

04/07/2020

Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

Ryuichi Takanobu, Runze Liang, Minlie Huang

Keywords Paper

pretraining, Multi-Agent Learning, Role-Aware Decomposition, reinforcement learning

0

0

0

0

13:00

06/12/2021

Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality

Songyuan Zhang, ZHANGJIE CAO, Dorsa Sadigh, Yanan Sui

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:50

06/12/2021

Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning

Jinxin Liu, Hao Shen, Donglin Wang and
Yachen Kang, Qiangxing Tian

Keywords Paper

reinforcement learning and planning, domain adaptation

0

0

0

0

8:07

03/05/2021

Self-Supervised Policy Adaptation during Deployment

Nicklas Hansen, Rishabh Jangir, Yu Sun and
Guillem Alenyà, Pieter Abbeel, Alyosha Efros, Lerrel Pinto, Xiaolong Wang

Keywords Paper

reinforcement learning, self-supervised learning, robotics, sim2real, generalization

0

0

0

0

8:52

16/11/2020

Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability

Hai Nguyen, Brett Daley, Xinchao Song and
Christopher Amato, Robert Platt

Keywords Paper

0

0

0

0

5:05

18/07/2021

PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training

Kimin Lee, Laura Smith, Pieter Abbeel

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

15:02

03/05/2021

Adaptive Procedural Task Generation for Hard-Exploration Problems

Kuan Fang, Yuke Zhu, Silvio Savarese, Li Fei-Fei

Keywords Paper

reinforcement learning, task generation, procedural generation, curriculum learning

0

0

0

0

5:06

26/04/2020

Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery

Kristian Hartikainen, Xinyang Geng, Tuomas Haarnoja, Sergey Levine

Keywords Paper

reinforcement learning, semi-supervised learning, unsupervised learning, robotics, deep learning

0

0

0

0

5:07

12/07/2020

Provably Efficient Model-based Policy Adaptation

Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:49

19/08/2021

Inter-Task Similarity for Lifelong Reinforcement Learning in Heterogeneous Tasks

Sergio A. Serrano

Keywords Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Reinforcement Learning, Incremental Learning, Learning in Robotics

0

0

0

0

11:02

12/07/2020

Growing Action Spaces

Gregory Farquhar, Laura Gustafson, Zeming Lin and
Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

16:31

06/12/2021

Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

Yichen Yang, Jeevana Priya Inala, Osbert Bastani and
Yewen Pu, Armando Solar-Lezama, Martin Rinard

Keywords Paper

reinforcement learning and planning, generative model

0

0

0

0

14:56

06/12/2021

Teachable Reinforcement Learning via Advice Distillation

Olivia Watkins, Abhishek Gupta, Trevor Darrell and
Pieter Abbeel, Jacob Andreas

Keywords Paper

reinforcement learning and planning, active learning

0

0

0

0

12:45

06/12/2021

Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

Tim Seyde, Igor Gilitschenski, Wilko Schwarting and
Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:48

06/12/2021

Visual Adversarial Imitation Learning using Variational Models

Rafael Rafailov, Tianhe Yu, Aravind Rajeswaran, Chelsea Finn

Keywords Paper

theory, reinforcement learning and planning, adversarial robustness and security, representation learning

0

0

0

0

7:25

06/12/2020

Neural Dynamic Policies for End-to-End Sensorimotor Learning

Shikhar Bahl, Mustafa Mukadam, Abhinav Gupta, Deepak Pathak

Keywords Paper

0

0

0

0

3:35

12/07/2020

Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Jie Xu, Yunsheng Tian, Pingchuan Ma and
Daniela Rus, Shinjiro Sueda, Wojciech Matusik

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:15

06/12/2021

Discovery of Options via Meta-Learned Subgoals

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

4:13

06/12/2020

Automatic Curriculum Learning through Value Disagreement

Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto

Keywords Paper

0

0

0

0

3:17

06/12/2021

The Value of Information When Deciding What to Learn

Dilip Arumugam, Benjamin Van Roy

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

9:27

18/07/2021

Representation Matters: Offline Pretraining for Sequential Decision Making

Mengjiao Yang, Ofir Nachum

Keywords Paper

Reinforcement Learning and Planning

1

0

0

0

5:06

06/12/2020

Policy Improvement via Imitation of Multiple Oracles

Ching-An Cheng, Andrey Kolobov, Alekh Agarwal

Keywords Paper

0

0

0

0

3:12

26/04/2020

Watch, Try, Learn: Meta-Learning from Demonstrations and Rewards

Allan Zhou, Eric Jang, Daniel Kappler and
Alex Herzog, Mohi Khansari, Paul Wohlhart, Yunfei Bai, Mrinal Kalakrishnan, Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, reinforcement learning, imitation learning

0

0

0

0

4:34

18/07/2021

Learning Routines for Effective Off-Policy Reinforcement Learning

Edoardo Cetin, Oya Celiktutan

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:17

26/04/2020

Intrinsic Motivation for Encouraging Synergistic Behavior

Rohan Chitnis, Shubham Tulsiani, Saurabh Gupta, Abhinav Gupta

Keywords Paper

reinforcement learning, intrinsic motivation, synergistic, robot manipulation

0

0

0

0

5:02

26/04/2020

Thinking While Moving: Deep Reinforcement Learning with Concurrent Control

Ted Xiao, Eric Jang, Dmitry Kalashnikov and
Sergey Levine, Julian Ibarz, Karol Hausman, Alexander Herzog

Keywords Paper

deep reinforcement learning, continuous-time, robotics

0

0

0

0

4:42

06/12/2020

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

Siddharth Desai, Ishan Durugkar, Haresh Karnan and
Garrett Warnell, Josiah Hanna, Peter Stone

Keywords Paper

0

0

0

0

3:22

26/08/2020

Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration

Matteo Papini, Andrea Battistello, Marcello Restelli

Keywords Paper

0

0

0

0

12:47