Learning Subgoal Representations with Slow Dynamics

03/05/2021

Learning Subgoal Representations with Slow Dynamics

Siyuan Li, Lulu Zheng, Jianhao Wang, Chongjie Zhang

Keywords: Exploration, Hierarchical Reinforcement Learning, Representation Learning

Abstract Paper Similar Papers

Abstract: In goal-conditioned Hierarchical Reinforcement Learning (HRL), a high-level policy periodically sets subgoals for a low-level policy, and the low-level policy is trained to reach those subgoals. A proper subgoal representation function, which abstracts a state space to a latent subgoal space, is crucial for effective goal-conditioned HRL, since different low-level behaviors are induced by reaching subgoals in the compressed representation space. Observing that the high-level agent operates at an abstract temporal scale, we propose a slowness objective to effectively learn the subgoal representation (i.e., the high-level action space). We provide a theoretical grounding for the slowness objective. That is, selecting slow features as the subgoal space can achieve efficient hierarchical exploration. As a result of better exploration ability, our approach significantly outperforms state-of-the-art HRL and exploration methods on a number of benchmark continuous-control tasks. Thanks to the generality of the proposed subgoal representation learning method, empirical results also demonstrate that the learned representation and corresponding low-level policies can be transferred between distinct tasks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Learning to Forget for Meta-Learning

Sungyong Baik, Seokil Hong, Kyoung Mu Lee

Keywords Paper

meta learning, few-shot learning, reinforcement learning

0

0

0

0

1:01

06/12/2021

Learning One Representation to Optimize All Rewards

Ahmed Touati, Yann Ollivier

Keywords Paper

deep learning, reinforcement learning and planning, representation learning

0

0

0

0

14:52

19/08/2021

Conditional Self-Supervised Learning for Few-Shot Classification

Yuexuan An, Hui Xue, Xingyu Zhao, Lu Zhang

Keywords Paper

Machine Learning, Classification, Transfer, Adaptation, Multi-task Learning, Unsupervised Learning

0

0

0

0

9:06

06/12/2021

Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning

Junsu Kim, Younggyo Seo, Jinwoo Shin

Keywords Paper

reinforcement learning and planning, graph learning

0

0

0

0

13:42

16/11/2020

Safe Policy Learning for Continuous Control

Yinlam Chow, Ofir Nachum, Aleksandra Faust and
Edgar Dueñez-Guzman, Mohammad Ghavamzadeh

Keywords Paper

0

0

0

0

5:20

12/07/2020

Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization

Sicheng Zhu, Xiao Zhang, David Evans

Keywords Paper

Adversarial Examples

0

0

0

0

10:03

06/12/2020

Off-Policy Imitation Learning from Observations

Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

Keywords Paper

0

0

0

1

3:24

06/12/2021

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning

Kai Wang, Sanket Shah, Haipeng Chen and
Andrew Perrault, Finale Doshi-Velez, Milind Tambe

Keywords Paper

deep learning, optimization, reinforcement learning and planning

0

0

0

0

14:52

06/12/2020

Model-based Adversarial Meta-Reinforcement Learning

Zichuan Lin, Garrett Thomas, Guangwen Yang, Tengyu Ma

Keywords Paper

0

0

0

0

3:31

06/12/2021

Counterfactual Maximum Likelihood Estimation for Training Deep Networks

Xinyi Wang, Wenhu Chen, Michael Saxon, William Yang Wang

Keywords Paper

deep learning, domain adaptation, causality, language

0

0

0

0

14:07

18/07/2021

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:06

05/01/2021

Zero-Shot Recognition via Optimal Transport

Wenlin Wang, Hongteng Xu, Guoyin Wang and
Wenqi Wang, Lawrence Carin

Keywords Paper

0

0

0

2

3:35

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

03/05/2021

UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers

Siyi Hu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang

Keywords Paper

Transfer Learning, Multi-agent Reinforcement Learning

0

0

0

0

2:46

12/07/2020

Automatic Shortcut Removal for Self-Supervised Representation Learning

Matthias Minderer, Olivier Bachem, Neil Houlsby, Michael Tschannen

Keywords Paper

Representation Learning

0

0

0

0

13:28

18/07/2021

Temporal Predictive Coding For Model-Based Planning In Latent Space

Tung Nguyen, Rui Shu, Tuan Pham and
Hung Bui, Stefano Ermon

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:19

02/02/2021

Deep Metric Learning with Self-Supervised Ranking

Zheren Fu, Yan Li, Zhendong Mao and
Quan Wang, Yongdong Zhang

Keywords Paper

0

0

0

0

12:36

06/12/2020

Generalization Bound of Gradient Descent for Non-Convex Metric Learning

MINGZHI DONG, Xiaochen Yang, Rui Zhu and
Yujiang Wang, Jing-Hao Xue

Keywords Paper

0

0

0

0

3:18

13/04/2021

Bayesian active learning by soft mean objective cost of uncertainty

Guang Zhao, Edward Dougherty, Byung-Jun Yoon and
Francis J. Alexander, Xiaoning Qian

Keywords Paper

0

0

0

0

3:02

19/04/2021

Exploring supervised and unsupervised rewards in machine translation

Julia Ive, Zixu Wang, Marina Fomicheva, Lucia Specia

Keywords Paper

0

0

0

0

10:52

06/12/2021

COMBO: Conservative Offline Model-Based Policy Optimization

Tianhe Yu, Aviral Kumar, Rafael Rafailov and
Aravind Rajeswaran, Sergey Levine, Chelsea Finn

Keywords Paper

deep learning, optimization, reinforcement learning and planning

0

0

0

0

12:35

06/12/2020

MOReL: Model-Based Offline Reinforcement Learning

Rahul Kidambi, Aravind Rajeswaran, Praneeth Netrapalli, Thorsten Joachims

Keywords Paper

1

0

0

0

3:23

18/07/2021

Decision-Making Under Selective Labels: Optimal Finite-Domain Policies and Beyond

Dennis Wei

Keywords Paper

Applications, Computer Vision, Deep Learning, Adversarial Networks; Deep Learning, Generative Models, Social Aspects of Machine Learning

0

0

0

0

5:13

13/04/2021

Learning with gradient descent and weakly convex losses

Dominic Richards, Mike Rabbat

Keywords Paper

0

0

0

0

3:20

18/07/2021

Lower Bounds on Cross-Entropy Loss in the Presence of Test-time Adversaries

Arjun Nitin Bhagoji, Daniel Cullina, Vikash Sehwag, Prateek Mittal

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:10

12/07/2020

Goal-Aware Prediction: Learning to Model What Matters

Suraj Nair, Silvio Savarese, Chelsea Finn

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

11:16

06/12/2020

Greedy inference with structure-exploiting lazy maps

Michael Brennan, Daniele Bigoni, Olivier Zahm and
Alessio Spantini, Youssef Marzouk

Keywords Paper

0

0

0

0

3:22

06/12/2020

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

Alekh Agarwal, Sham Kakade, Akshay Krishnamurthy, Wen Sun

Keywords Paper

0

0

0

0

3:34

06/12/2020

Auxiliary Task Reweighting for Minimum-data Learning

Baifeng Shi, Judy Hoffman, Kate Saenko and
Trevor Darrell, Huijuan Xu

Keywords Paper

0

0

0

0

3:28

01/07/2020

Adaptive Dialog Policy Learning with Hindsight and User Modeling

Yan Cao, Keting Lu, Xiaoping Chen, Shiqi Zhang

Keywords Paper

0

0

0

0

11:22

07/09/2020

Towards a Hypothesis on Visual Transformation based Self-Supervision

Dipan Pal, Sreena Nallamothu, Marios Savvides

Keywords Paper

self supervision, rotation transformation, rot net, visual transformation self supervision

0

0

0

0

7:31

14/09/2020

A Decision-Theoretic Approach for Model Interpretability in Bayesian Framework

Homayun Afrabandpey, Tomi Peltola, Juho Piironen and
Aki Vehtari, Samuel Kaski

Keywords Paper

0

0

0

0

15:22

26/04/2020

Ranking Policy Gradient

Kaixiang Lin, Jiayu Zhou

Keywords Paper

Sample-efficient reinforcement learning, off-policy learning.

0

0

0

0

5:43

18/07/2021

Is Pessimism Provably Efficient for Offline RL?

Ying Jin, Zhuoran Yang, Zhaoran Wang

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

5:17

13/04/2021

Spectral tensor train parameterization of deep learning layers

Anton Obukhov, Maxim Rakhuba, Alexander Liniger and
Zhiwu Huang, Stamatios Georgoulis, Dengxin Dai, Luc Van Gool

Keywords Paper

0

0

0

0

3:09

13/04/2021

Non-stationary off-policy optimization

Joey Hong, Branislav Kveton, Manzil Zaheer and
Yinlam Chow, Amr Ahmed

Keywords Paper

0

0

0

0

2:57

06/12/2020

Probably Approximately Correct Constrained Learning

Luiz Chamon, Alejandro Ribeiro

Keywords Paper

0

0

0

0

3:19

13/04/2021

Regularized policies are reward robust

Hisham Husain, Kamil Ciosek, Ryota Tomioka

Keywords Paper

0

0

0

0

2:21

06/12/2021

Taxonomizing local versus global structure in neural network loss landscapes

Yaoqing Yang, Liam Hodgkinson, Ryan Theisen and
Joe Zou, Joseph Gonzalez, Kannan Ramchandran, Michael W Mahoney

Keywords Paper

deep learning, machine learning

0

0

0

0

13:56

03/05/2021

Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning

Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang

Keywords Paper

Unsupervised Learning, Variational Autoencoders, Unsupervised Meta-learning, Meta-Learning

0

0

0

0

13:31