Planning from Pixels using Inverse Dynamics Models

03/05/2021

Planning from Pixels using Inverse Dynamics Models

Keiran Paster, Sheila McIlraith, Jimmy Ba

Keywords: model based reinforcement learning, deep learning, goal-conditioned reinforcement learning, deep reinforcement learning, multi-task learning

Abstract Paper Similar Papers

Abstract: Learning dynamics models in high-dimensional observation spaces can be challenging for model-based RL agents. We propose a novel way to learn models in a latent space by learning to predict sequences of future actions conditioned on task completion. These models track task-relevant environment dynamics over a distribution of tasks, while simultaneously serving as an effective heuristic for planning with sparse rewards. We evaluate our method on challenging visual goal completion tasks and show a substantial increase in performance compared to prior model-free approaches.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Dream to Control: Learning Behaviors by Latent Imagination

Danijar Hafner, Timothy Lillicrap, Jimmy Ba, Mohammad Norouzi

Keywords Paper

world model, latent dynamics, imagination, planning by backprop, policy optimization, planning, reinforcement learning, control, representations, latent variable model, visual control, value function

0

0

0

0

5:01

03/05/2021

Latent Skill Planning for Exploration and Transfer

Kevin Xie, Homanga Bharadhwaj, Danijar Hafner and
Animesh Garg, Florian Shkurti

Keywords Paper

Partial Amortization, Model Predictive Control, Planning, Mutual Information, Skill Discovery, World Models, Model-Based Reinforcement Learning

0

0

0

0

5:10

06/12/2020

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

03/05/2021

Learning Task Decomposition with Ordered Memory Policy Network

Yuchen Lu, Yikang Shen, Siyuan Zhou and
Aaron Courville, Joshua B Tenenbaum, Chuang Gan

Keywords Paper

Task Segmentation, Network Inductive Bias, Hierarchical Imitation Learning

0

0

0

0

4:57

02/02/2021

Progressive Multi-task Learning with Controlled Information Flow for Joint Entity and Relation Extraction

Kai Sun, Richong Zhang, Samuel Mensah and
Yongyi Mao, Xudong Liu

Keywords Paper

0

0

0

0

13:45

18/07/2021

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Luisa Zintgraf, Leo Feng, Cong Lu and
Max Igl, Kristian Hartikainen, Katja Hofmann, Shimon Whiteson

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:52

19/08/2021

Deep Reinforcement Learning with Hierarchical Structures

Siyuan Li

Keywords Paper

Machine Learning, Reinforcement Learning

0

0

0

0

13:34

03/05/2021

Learning Associative Inference Using Fast Weight Memory

Imanol Schlag, Tsendsuren Munkhdalai, Jürgen Schmidhuber

Keywords Paper

fast weights, memory-augmented neural networks, tensor product

0

0

0

0

4:29

12/07/2020

Optimizing Data Usage via Differentiable Rewards

Xinyi Wang, Hieu Pham, Paul Michel and
Antonios Anastasopoulos, Jaime Carbonell, Graham Neubig

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

12:53

06/12/2021

Hierarchical Reinforcement Learning with Timed Subgoals

Nico Gürtler, Dieter Büchler, Georg Martius

Keywords Paper

reinforcement learning and planning

0

0

0

0

8:17

06/12/2021

Discovery of Options via Meta-Learned Subgoals

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

4:13

12/07/2020

Growing Action Spaces

Gregory Farquhar, Laura Gustafson, Zeming Lin and
Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

16:31

06/12/2021

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Hung Le, Thommen Karimpanal George, Majid Abdolshah and
Truyen Tran, Svetha Venkatesh

Keywords Paper

reinforcement learning and planning

0

0

0

0

15:06

06/12/2020

PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals

Henry Charlesworth, Giovanni Montana

Keywords Paper

0

0

0

0

3:20

06/12/2020

Self-Paced Deep Reinforcement Learning

Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Keywords Paper

0

0

0

0

3:00

26/04/2020

Frequency-based Search-control in Dyna

Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand

Keywords Paper

Model-based reinforcement learning, search-control, Dyna, frequency of a signal

0

0

0

0

4:32

12/07/2020

Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning

Zhaohan Guo, Bernardo Avila Pires, Mohammad Gheshlaghi Azar and
Bilal Piot, Florent Altché, Jean-Bastien Grill, Remi Munos

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

12:47

03/05/2021

Adaptive Procedural Task Generation for Hard-Exploration Problems

Kuan Fang, Yuke Zhu, Silvio Savarese, Li Fei-Fei

Keywords Paper

reinforcement learning, task generation, procedural generation, curriculum learning

0

0

0

0

5:06

18/07/2021

Inverse Decision Modeling: Learning Interpretable Representations of Behavior

Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

16:11

06/12/2020

Learning to Incentivize Other Learning Agents

Jiachen Yang, Ang Li, Mehrdad Farajtabar and
Peter Sunehag, Edward Hughes, Hongyuan Zha

Keywords Paper

0

0

0

0

3:20

06/12/2020

Automatic Curriculum Learning through Value Disagreement

Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto

Keywords Paper

0

0

0

0

3:17

06/12/2021

Learning Knowledge Graph-based World Models of Textual Environments

Prithviraj Ammanabrolu, Mark Riedl

Keywords Paper

reinforcement learning and planning, transformers, graph learning, language

0

0

0

0

15:32

18/07/2021

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal, Christian Schroeder, Bei Peng and
Wendelin Boehmer, Shimon Whiteson, Fei Sha

Keywords Paper

Optimization, Convex Optimization, Reinforcement Learning and Planning, Multi-Agent RL, Algorithms, Large Scale Learning; Probabilistic Methods, Distributed Inference

0

0

0

0

20:08

13/04/2021

A theory of multiple-source adaptation with limited target labeled data

Yishay Mansour, Mehryar Mohri, Jae Ro and
Ananda Theertha Suresh, Ke Wu

Keywords Paper

0

0

0

0

2:39

22/11/2021

Single-Modal Entropy based Active Learning for Visual Question Answering

Dong-Jin Kim, Jae Won Cho, Jinsoo Choi and
Yunjae Jung, In So Kweon

Keywords Paper

Visual Question Answering, Vision and Language, Active Learning

0

0

0

0

2:42

03/05/2021

Learning perturbation sets for robust machine learning

Eric Wong, Zico Kolter

Keywords Paper

conditional variational autoencoder, adversarial examples, perturbation sets, robust machine learning

0

1

0

0

5:06

02/02/2021

Reinforcement Learning of Sequential Price Mechanisms

Gianluca Brero, Alon Eden, Matthias Gerstgrasser and
David Parkes, Duncan Rheingans-Yoo

Keywords Paper

0

0

0

0

18:11

12/07/2020

Provably Efficient Model-based Policy Adaptation

Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:49

06/12/2020

Value-driven Hindsight Modelling

Arthur Guez, Fabio Viola, Theophane Weber and
Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess

Keywords Paper

1

0

0

0

3:20

06/12/2020

Continuous Meta-Learning without Tasks

James Harrison, Apoorva Sharma, Chelsea Finn, Marco Pavone

Keywords Paper

0

0

0

0

3:09

02/02/2021

Dynamic Automaton-Guided Reward Shaping for Monte Carlo Tree Search

Alvaro Velasquez, Brett Bissey, Lior Barak and
Andre Beckus, Ismail Alkhouri, Daniel Melcer, George Atia

Keywords Paper

0

0

0

0

18:52

06/12/2020

EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning

Jiachen Li, Fan Yang, Masayoshi Tomizuka, Chiho Choi

Keywords Paper

0

0

0

0

3:21

06/12/2021

Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning

Christopher Hoang, Sungryull Sohn, Jongwook Choi and
Wilka Carvalho, Honglak Lee

Keywords Paper

reinforcement learning and planning, graph learning

0

0

0

0

14:57

19/08/2021

State-Based Recurrent SPMNs for Decision-Theoretic Planning under Partial Observability

Layton Hayes, Prashant Doshi, Swaraj Pawar, Hari Teja Tatavarti

Keywords Paper

Machine Learning, Learning Graphical Models, Model-Based Reasoning, Planning under Uncertainty

0

0

0

0

12:39

16/11/2020

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

Jun Yamada, Youngwoon Lee, Gautam Salhotra and
Karl Pertsch, Max Pflueger, Gaurav Sukhatme, Joseph Lim, Peter Englert

Keywords Paper

0

0

0

0

4:59

26/04/2020

Environmental drivers of systematicity and generalization in a situated agent

Felix Hill, Andrew Lampinen, Rosalia Schneider and
Stephen Clark, Matthew Botvinick, James L. McClelland, Adam Santoro

Keywords Paper

systematicitiy, systematic, generalization, combinatorial, agent, policy, language, compositionality

0

0

0

0

5:44

03/05/2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou and
Albert Yu, Nicholas Rhinehart, Sergey Levine

Keywords Paper

reinforcement learning, imitation learning

0

0

0

0

14:21

06/12/2020

Deep Reinforcement and InfoMax Learning

Bogdan Mazoure, Remi Tachet des Combes, Thang Doan and
Philip Bachman, R Devon Hjelm

Keywords Paper

0

0

0

0

3:15

26/04/2020

Automated Relational Meta-learning

Huaxiu Yao, Xian Wu, Zhiqiang Tao and
Yaliang Li, Bolin Ding, Ruirui Li, Zhenhui Li

Keywords Paper

meta-learning, task heterogeneity, meta-knowledge graph

1

1

0

0

5:13

06/12/2020

Learning Multi-Agent Communication through Structured Attentive Reasoning

Murtaza Rangwala, Ryan K Williams

Keywords Paper

0

0

0

1

3:21