Data-efficient Hindsight Off-policy Option Learning

18/07/2021

Data-efficient Hindsight Off-policy Option Learning

Markus Wulfmeier, Dushyant Rao, Roland Hafner, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala Bukkapatnam, Noah Siegel, Nicolas Heess, Martin Riedmiller

Keywords: Reinforcement Learning and Planning, Deep RL

Abstract Paper Similar Papers

Abstract: We introduce Hindsight Off-policy Options (HO2), a data-efficient option learning algorithm. Given any trajectory, HO2 infers likely option choices and backpropagates through the dynamic programming inference procedure to robustly train all policy components off-policy and end-to-end. The approach outperforms existing option learning methods on common benchmarks. To better understand the option framework and disentangle benefits from both temporal and action abstraction, we evaluate ablations with flat policies and mixture policies with comparable optimization. The results highlight the importance of both types of abstraction as well as off-policy training and trust-region constraints, particularly in challenging, simulated 3D robot manipulation tasks from raw pixel inputs. Finally, we intuitively adapt the inference step to investigate the effect of increased temporal abstraction on training with pre-trained options and from scratch.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging Problems

Kaixuan Wei, Angelica I Aviles-Rivero, Jingwei Liang and
Ying Fu, Carola-Bibiane Schönlieb, Hua Huang

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

11:48

14/06/2020

Autolabeling 3D Objects With Differentiable Rendering of SDF Shape Priors

Sergey Zakharov, Wadim Kehl, Arjun Bhargava, Adrien Gaidon

Keywords Paper

autolabeling, differentiable rendering, pose and shape optimization, curriculum learning, object detection, autonomous driving, 3d shape modeling

0

0

0

0

4:59

16/11/2020

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

Bryan Chen, Alexander Sax, Francis Lewis and
Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik, Lerrel Pinto

Keywords Paper

0

0

0

0

5:06

02/02/2021

Generalized Adversarially Learned Inference

Yatin Dandi, Homanga Bharadhwaj, Abhishek Kumar, Piyush Rai

Keywords Paper

0

0

0

0

16:22

06/12/2021

Searching Parameterized AP Loss for Object Detection

Tao Chenxin, Zizhang Li, Xizhou Zhu and
Gao Huang, Yong Liu, jifeng dai

Keywords Paper

machine learning, vision

0

0

0

0

6:13

12/07/2020

Generalization to New Actions in Reinforcement Learning

Ayush Jain, Andrew Szot, Joseph Lim

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:01

06/12/2020

Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

Guoliang Kang, Yunchao Wei, Yi Yang and
Yueting Zhuang, Alexander Hauptmann

Keywords Paper

0

0

0

0

3:16

07/09/2020

Unsupervised Domain Adaptation for Spatio-Temporal Action Localization

Nakul Agarwal, Yi-Ting Chen, Behzad Dariush, Ming-Hsuan Yang

Keywords Paper

Spatio-Temporal Action Localization, Unsupervised Domain Adaptation, Adversarial Learning, Video Analysis, Deep Learning

0

0

0

0

9:28

14/09/2020

A Generic and Model-Agnostic Exemplar Synthetization Framework for Explainable AI

Antonio Barbalau, Adrian Cosma, Radu Tudor Ionescu, Marius Popescu

Keywords Paper

explainable ai, black-box, generative modelling, evolutionary algorithm, prototype synthetization, exemplar generation

0

0

0

0

10:08

22/11/2021

On Automatic Data Augmentation for 3D Point Cloud Classification

Wanyue Zhang, Xun Xu, Fayao Liu and
Le Zhang, Chuan Sheng Foo

Keywords Paper

point cloud, automatic data augmentation

0

0

0

0

2:59

03/05/2021

Generating Adversarial Computer Programs using Optimized Obfuscations

Shashank Srikant, Sijia Liu, Tamara Mitrovska and
Shiyu Chang, Quanfu Fan, Gaoyuan Zhang, Una-May O'Reilly

Keywords Paper

Models for code, Differentiable program generator, Combinatorial optimization, Program obfuscation, Adversarial computer programs, Machine Learning (ML) for Programming Languages (PL)/Software Engineering (SE)

0

0

0

0

6:27

26/04/2020

A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms

Yoshua Bengio, Tristan Deleu, Nasim Rahaman and
Nan Rosemary Ke, Sebastien Lachapelle, Olexa Bilaniuk, Anirudh Goyal, Christopher Pal

Keywords Paper

meta-learning, transfer learning, structure learning, modularity, causality

0

0

0

0

5:25

19/08/2021

DACBench: A Benchmark Library for Dynamic Algorithm Configuration

Theresa Eimer, André Biedenkapp, Maximilian Reimer and
Steven Adriansen, Frank Hutter, Marius Lindauer

Keywords Paper

Heuristic Search and Game Playing, Evaluation and Analysis, Heuristic Search and Machine Learning, Meta-Reasoning and Meta-Heuristics

0

0

0

0

13:51

12/07/2020

ControlVAE: Controllable Variational Autoencoder

Huajie Shao, Shuochao Yao, Dachun Sun and
Aston Zhang, Shengzhong Liu, Dongxin Liu, Jun Wang, Tarek Abdelzaher

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

14:22

18/07/2021

DORO: Distributional and Outlier Robust Optimization

Runtian Zhai, Chen Dan, Zico Kolter, Pradeep Ravikumar

Keywords Paper

Probabilistic Methods, Robust statistics

0

0

0

1

5:06

12/07/2020

Sub-Goal Trees -- a Framework for Goal-Based Reinforcement Learning

Tom Jurgenson, Or Avner, Edward Groshev, Aviv Tamar

Keywords Paper

Reinforcement Learning - General

0

0

0

0

15:04

14/06/2020

SQE: a Self Quality Evaluation Metric for Parameters Optimization in Multi-Object Tracking

Yanru Huang, Feiyu Zhu, Zheni Zeng and
Xi Qiu, Yuan Shen, Jianan Wu

Keywords Paper

multi-object tracking, self quality evaluation, gaussian mixture model, parameters self-optimization

0

0

0

0

1:00

03/05/2021

HalentNet: Multimodal Trajectory Forecasting with Hallucinative Intents

Deyao Zhu, Mohamed Zahran, Li Erran Li, Mohamed Elhoseiny

Keywords Paper

0

0

0

0

5:18

02/02/2021

Model Uncertainty Guides Visual Object Tracking

Lijun Zhou, Antoine Ledent, Qintao Hu and
Ting Liu, Jianlin Zhang, Marius Kloft

Keywords Paper

0

0

0

0

18:06

06/12/2021

Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning

Siyuan Zhang, Nan Jiang

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:23

03/05/2021

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

Ossama Ahmed, Frederik Träuble, Anirudh Goyal and
Alexander Neitz, Manuel Wuthrich, Yoshua Bengio, Bernhard Schoelkopf, Stefan Bauer

Keywords Paper

reinforcement learning, transfer learning, robotics, domain adaptation, generalization, causality, sim2real transfer

0

0

0

0

5:03

18/07/2021

Decoupling Representation Learning from Reinforcement Learning

Adam Stooke, Kimin Lee, Pieter Abbeel, Michael Laskin

Keywords Paper

Optimization, Submodular Optimization, Algorithms, Bandit Algorithms; Algorithms, Online Learning, Deep Learning, Embedding and Representation learning

0

0

0

0

5:15

14/06/2020

Density-Based Clustering for 3D Object Detection in Point Clouds

Syeda Mariam Ahmed, Chee Meng Chew

Keywords Paper

3d object detection, edge-aware pointnet, instance segmentation, unsupervised clustering, cascaded modules, semantic segmentation, amodal bounding box detection

0

0

0

0

0:51

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

16/11/2020

Self-Supervised 3D Keypoint Learning for Ego-Motion Estimation

Jiexiong Tang, Rareș Ambruș, Vitor Guizilini and
Sudeep Pillai, Hanme Kim, Patric Jensfelt, Adrien Gaidon

Keywords Paper

0

0

0

0

5:05

14/09/2020

Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

Arjun Manoharan, Rahul Ramesh, Balaraman Ravindran

Keywords Paper

hierarchical reinforcement learning, policy distillation

0

0

0

0

13:49

02/02/2021

Embodied Visual Active Learning for Semantic Segmentation

David Nilsson, Aleksis Pirinen, Erik Gärtner, Cristian Sminchisescu

Keywords Paper

0

0

0

0

18:49

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

16/11/2020

Probably Approximately Correct Vision-Based Planning using Motion Primitives

Sushant Veer, Anirudha Majumdar

Keywords Paper

0

0

0

0

5:01

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

06/12/2021

Robust Generalization despite Distribution Shift via Minimum Discriminating Information

Tobias Sutter, Andreas Krause, Daniel Kuhn

Keywords Paper

optimization, machine learning

0

0

0

0

15:05

14/06/2020

DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing

Shaohui Liu, Yinda Zhang, Songyou Peng and
Boxin Shi, Marc Pollefeys, Zhaopeng Cui

Keywords Paper

differentiable rendering, 3d reconstruction, implicit representations, multi-view reconstruction, depth completion, 3d deep learning

0

0

0

0

1:01

18/07/2021

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

3:50

16/11/2020

Learning Dexterous Manipulation from Suboptimal Experts

Rae Jeong, Jost Tobias Springenberg, Jackie Kay and
Dan Zheng, Alexandre Galashov, Nicolas Heess, Francesco Nori

Keywords Paper

0

0

0

0

5:03

06/12/2020

Posterior Re-calibration for Imbalanced Datasets

Junjiao Tian, Yen-Cheng Liu, Nathaniel Glaser and
Yen-Chang Hsu, Zsolt Kira

Keywords Paper

Algorithms -> Few-Shot Learning, Applications -> Computer Vision

0

0

0

0

3:23

06/12/2021

Grounding inductive biases in natural images: invariance stems from variations in data

Diane Bouchacourt, Mark Ibrahim, Ari Morcos

Keywords Paper

machine learning, transformers

0

0

0

0

14:19

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

03/05/2021

OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning

Anurag Ajay, Aviral Kumar, Pulkit Agrawal and
Sergey Levine, Ofir Nachum

Keywords Paper

Unsupervised Learning, Offline Reinforcement Learning, Primitive Discovery

0

0

0

0

5:08

30/11/2020

Synthetic-to-real domain adaptation for lane detection

Noa Garnett, Roy Uziel, Netalee Efrat, Dan Levi

Keywords Paper

0

0

0

0

9:15

06/12/2021

Post-Training Quantization for Vision Transformer

Zhenhua Liu, Yunhe Wang, Kai Han and
Wei Zhang, Siwei Ma, Wen Gao

Keywords Paper

deep learning, transformers, vision

0

0

0

0

5:52