Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors

06/12/2020

Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors

Karl Pertsch, Oleh Rybkin, Frederik Ebert, Shenghao Zhou, Dinesh Jayaraman, Chelsea Finn, Sergey Levine

Keywords: Applications -> Robotics; Reinforcement Learning and Planning -> Exploration; Reinforcement Learning and Planning -> Reinforcem, Algorithms -> Multitask and Transfer Learning

Abstract Paper Similar Papers

Abstract: The ability to predict and plan into the future is fundamental for agents acting in the world. To reach a faraway goal, we predict trajectories at multiple timescales, first devising a coarse plan towards the goal and then gradually filling in details. In contrast, current learning approaches for visual prediction and planning fail on long-horizon tasks as they generate predictions (1)~without considering goal information, and (2)~at the finest temporal resolution, one step at a time. In this work we propose a framework for visual prediction and planning that is able to overcome both of these limitations. First, we formulate the problem of predicting towards a goal and propose the corresponding class of latent space goal-conditioned predictors (GCPs). GCPs significantly improve planning efficiency by constraining the search space to only those trajectories that reach the goal. Further, we show how GCPs can be naturally formulated as hierarchical models that, given two observations, predict an observation between them, and by recursively subdividing each part of the trajectory generate complete sequences. This divide-and-conquer strategy is effective at long-term prediction, and enables us to design an effective hierarchical planning algorithm that optimizes trajectories in a coarse-to-fine manner. We show that by using both goal-conditioning and hierarchical prediction, GCPs enable us to solve visual planning tasks with much longer horizon than previously possible. See prediction and planning videos on the supplementary website: sites.google.com/view/video-gcp.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Model-Based Reinforcement Learning via Latent-Space Collocation

Oleg Rybkin, Chuning Zhu, Anusha Nagabandi and
Kostas Daniilidis, Igor Mordatch, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:14

06/12/2021

Neural Algorithmic Reasoners are Implicit Planners

Andreea-Ioana Deac, Petar Veličković, Ognjen Milinkovic and
Pierre-Luc Bacon, Jian Tang, Mladen Nikolic

Keywords Paper

deep learning, reinforcement learning and planning, self-supervised learning, generative model, graph learning

0

0

0

0

13:10

18/07/2021

Temporal Predictive Coding For Model-Based Planning In Latent Space

Tung Nguyen, Rui Shu, Tuan Pham and
Hung Bui, Stefano Ermon

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:19

03/05/2021

On the role of planning in model-based deep reinforcement learning

Jessica Hamrick, Abram Friesen, Feryal Behbahani and
Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Buesing, Petar Veličković, Theo Weber

Keywords Paper

planning, MuZero, model-based RL

0

0

0

0

5:15

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49

05/01/2021

Goal-Driven Long-Term Trajectory Prediction

Hung Tran, Vuong Le, Truyen Tran

Keywords Paper

0

0

0

0

5:01

18/07/2021

World Model as a Graph: Learning Latent Landmarks for Planning

Lunjun Zhang, Ge Yang, Bradly Stadie

Keywords Paper

Applications, Computer Vision, Algorithms, Classification; Applications, Computational Social Science; Applications, Visual Scene Analysis and Interpret, Reinforcement Learning and Planning, Deep RL

0

0

0

0

12:48

02/02/2021

Synthesis of Search Heuristics for Temporal Planning via Reinforcement Learning

Andrea Micheli, Alessandro Valentini

Keywords Paper

0

0

0

0

19:00

06/12/2020

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

06/12/2020

Is Long Horizon RL More Difficult Than Short Horizon RL?

Ruosong Wang, Simon Du, Lin Yang, Sham Kakade

Keywords Paper

0

0

0

0

3:20

03/05/2021

Set Prediction without Imposing Structure as Conditional Density Estimation

David W Zhang, Gertjan J Burghouts, Cees G Snoek

Keywords Paper

energy based models, set prediction

0

0

0

0

5:02

18/07/2021

Differentiable Spatial Planning using Transformers

Devendra Singh Chaplot, Deepak Pathak, Jitendra Malik

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:19

06/12/2021

Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning

Christopher Hoang, Sungryull Sohn, Jongwook Choi and
Wilka Carvalho, Honglak Lee

Keywords Paper

reinforcement learning and planning, graph learning

0

0

0

0

14:57

06/12/2021

Automated Dynamic Mechanism Design

Hanrui Zhang, Vincent Conitzer

Keywords Paper

0

0

0

0

14:35

18/07/2021

Learning and Planning in Complex Action Spaces

Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou and
Amin Barekatain, Simon Schmitt, David Silver

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:22

13/04/2021

Abstract value iteration for hierarchical reinforcement learning

Kishor Jothimurugan, Osbert Bastani, Rajeev Alur

Keywords Paper

0

0

0

0

2:57

26/10/2020

Learning Neural Search Policies for Classical Planning

Pawel Gomoluch, Dalal Alrajeh, Alessandra Russo, Antonio Bucchiarone

Keywords Paper

classical planning, policy search, machine learning, cross-entropy method

0

0

0

0

9:55

02/02/2021

Progression Heuristics for Planning with Probabilistic LTL Constraints

Ian Mallett, Sylvie Thiebaux, Felipe Trevizan

Keywords Paper

0

0

0

0

18:23

25/07/2020

Dual sequential network for temporal sets prediction

Leilei Sun, Yansong Bai, Bowen Du and
Chuanren Liu, Hui Xiong, Weifeng Lv

Keywords Paper

temporal sets prediction, set embedding, deep neural network

0

0

0

0

17:27

18/07/2021

Off-Belief Learning

Hengyuan Hu, Adam Lerer, Brandon Cui and
Luis Pineda, Noam Brown, Jakob Foerster

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:10

02/02/2021

Present-Biased Optimization

Fedor V. Fomin, Pierre Fraigniaud, Petr A. Golovach

Keywords Paper

0

0

0

0

19:38

02/02/2021

Improved Knowledge Modeling and Its Use for Signaling in Multi-Agent Planning with Partial Observability

Shashank Shekhar, Ronen I. Brafman, Guy Shani

Keywords Paper

0

0

0

0

18:22

06/12/2020

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Masha Itkina, Boris Ivanovic, Ransalu Senanayake and
Mykel J Kochenderfer, Marco Pavone

Keywords Paper

0

0

0

0

3:39

03/05/2021

Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

Keywords Paper

reinforcement learning, model-predictive control

0

0

0

0

5:09

02/02/2021

Dynamic Memory based Attention Network for Sequential Recommendation

Qiaoyu Tan, Jianwei Zhang, Ninghao Liu and
Xiao Huang, Hongxia Yang, Jingren Zhou, Xia Hu

Keywords Paper

0

0

0

0

15:56

26/10/2020

On timeline-based games and their complexity

Nicola Gigante, Angelo Montanari, Andrea Orlandini and
Marta Cialdea Mayer, Mark Reynolds

Keywords Paper

Timeline-based planning, Computational complexity, Games

0

0

0

0

10:50

18/07/2021

Outside the Echo Chamber: Optimizing the Performative Risk

John Miller, Juan Perdomo, Tijana Zrnic

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:05

22/11/2021

ASFormer: Transformer for Action Segmentation

Fangqiu Yi, Hongyu Wen, Tingting Jiang

Keywords Paper

action segmentation, transformer, action detection

0

0

0

0

2:53

06/12/2020

Planning in Markov Decision Processes with Gap-Dependent Sample Complexity

Anders Jonsson, Emilie Kaufmann, Pierre Menard and
Omar Darwiche Domingues, Edouard Leurent, Michal Valko

Keywords Paper

0

0

0

0

3:19

06/12/2020

Reinforcement Learning for Control with Multiple Frequencies

Jongmin Lee, Byung-Jun Lee, Kee-Eung Kim

Keywords Paper

Algorithms -> Multitask and Transfer Learning; Deep Learning -> Supervised Deep Networks; Theory -> Learning Theory; Theory -> , Deep Learning

0

0

0

0

3:21

02/02/2021

Learning General Planning Policies from Small Examples Without Supervision

Guillem Francès, Blai Bonet, Hector Geffner

Keywords Paper

0

0

0

0

17:17

06/12/2020

Belief-Dependent Macro-Action Discovery in POMDPs using the Value of Information

Genevieve Flaspohler, Nicholas Roy, John Fisher III

Keywords Paper

0

0

0

0

3:23

26/10/2020

Probabilistic Robust Multi-Agent Path Finding

Dor Atzmon, Roni Stern, Ariel Felner and
Nathan R. Sturtevant, Sven Koenig

Keywords Paper

Probabilistic, Robust, Multi-Agent Path Finding, MAPF, Collisions, Conflicts, Planning

0

0

0

0

10:00

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

03/05/2021

C-Learning: Learning to Achieve Goals via Recursive Classification

Ben Eysenbach, Ruslan Salakhutdinov, Sergey Levine

Keywords Paper

reinforcement learning, goal reaching, density estimation, hindsight relabeling, Q-learning

0

0

0

0

5:09

06/12/2020

Beyond Individualized Recourse: Interpretable and Interactive Summaries of Actionable Recourses

Kai Rawal, Himabindu Lakkaraju

Keywords Paper

0

0

0

0

3:31

02/02/2021

On-line Learning of Planning Domains from Sensor Data in PAL: Scaling up to Large State Spaces

Leonardo Lamanna, Alfonso Emilio Gerevini, Alessandro Saetti and
Luciano Serafini, Paolo Traverso

Keywords Paper

0

0

0

0

16:05

12/07/2020

Flexible and Efficient Long-Range Planning Through Curious Exploration

Aidan Curtis, Minjian Xin, Dilip Arumugam and
Kevin Feigelis, Daniel Yamins

Keywords Paper

Reinforcement Learning - General

0

0

0

0

16:25

26/10/2020

Exploring Context-Free Languages via Planning: The Case for Automating Machine Learning

Michael Katz, Parikshit Ram, Shirin Sohrabi, Octavian Udrea

Keywords Paper

Context-Free Grammar, HTN Planning, Classical Planning, AutoML

0

0

0

0

9:25

18/07/2021

Active Feature Acquisition with Generative Surrogate Models

Yang Li, Junier Oliva

Keywords Paper

Deep Learning, Generative Models, Applications, Computational Biology and Bioinformatics, Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:44