Dynamic Bottleneck for Robust Self-Supervised Exploration

06/12/2021

Dynamic Bottleneck for Robust Self-Supervised Exploration

Chenjia Bai, Lingxiao Wang, Lei Han, Animesh Garg, Jianye Hao, Peng Liu, Zhaoran Wang

Keywords: reinforcement learning and planning

Abstract Paper Similar Papers

Abstract: Exploration methods based on pseudo-count of transitions or curiosity of dynamics have achieved promising results in solving reinforcement learning with sparse rewards. However, such methods are usually sensitive to environmental dynamics-irrelevant information, e.g., white-noise. To handle such dynamics-irrelevant information, we propose a Dynamic Bottleneck (DB) model, which attains a dynamics-relevant representation based on the information-bottleneck principle. Based on the DB model, we further propose DB-bonus, which encourages the agent to explore state-action pairs with high information gain. We establish theoretical connections between the proposed DB-bonus, the upper confidence bound (UCB) for linear case, and the visiting count for tabular case. We evaluate the proposed method on Atari suits with dynamics-irrelevant noises. Our experiments show that exploration with DB bonus outperforms several state-of-the-art exploration methods in noisy environments.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

3:50

06/12/2020

Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards

Yijie Guo, Jongwook Choi, Marcin Moczulski and
Shengyu Feng, Samy Bengio, Mohammad Norouzi, Honglak Lee

Keywords Paper

0

0

1

1

3:30

19/08/2021

Hindsight Trust Region Policy Optimization

Hanbo Zhang, Site Bai, Xuguang Lan and
David Hsu, Nanning Zheng

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning

0

0

0

0

13:14

06/12/2020

Latent World Models For Intrinsically Motivated Exploration

Aleksandr Ermolov, Nicu Sebe

Keywords Paper

0

0

0

0

2:47

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

06/12/2021

Open-set Label Noise Can Improve Robustness Against Inherent Label Noise

Hongxin Wei, Lue Tao, RENCHUNZI XIE, Bo An

Keywords Paper

deep learning, robustness

0

0

0

0

2:46

26/04/2020

On Bonus Based Exploration Methods In The Arcade Learning Environment

Adrien Ali Taiga, William Fedus, Marlos C. Machado and
Aaron Courville, Marc G. Bellemare

Keywords Paper

exploration, arcade learning environment, bonus-based methods

0

0

0

0

4:50

19/08/2021

Don’t Do What Doesn’t Matter: Intrinsic Motivation with Action Usefulness

Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin

Keywords Paper

Machine Learning, Reinforcement Learning, Deep Reinforcement Learning

0

0

0

0

14:48

19/08/2021

Partial Multi-Label Optimal Margin Distribution Machine

Nan Cao, Teng Zhang, Hai Jin

Keywords Paper

Machine Learning, Classification, Multi-instance; Multi-label; Multi-view learning, Weakly Supervised Learning

0

0

0

0

11:43

06/12/2020

Novelty Search in Representational Space for Sample Efficient Exploration

David Tao, Vincent Francois-Lavet, Joelle Pineau

Keywords Paper

0

0

0

0

3:04

26/04/2020

Discriminative Particle Filter Reinforcement Learning for Complex Partial observations

Xiao Ma, Peter Karkus, David Hsu and
Wee Sun Lee, Nan Ye

Keywords Paper

Reinforcement Learning, Partial Observability, Differentiable Particle Filtering

0

0

0

0

5:08

18/07/2021

Ensemble Bootstrapping for Q-Learning

Oren Peer, Chen Tessler, Nadav Merlis, Ron Meir

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:17

06/12/2020

See, Hear, Explore: Curiosity via Audio-Visual Association

Victoria Dean, Shubham Tulsiani, Abhinav Gupta

Keywords Paper

0

0

0

0

3:23

18/07/2021

APS: Active Pretraining with Successor Features

Hao Liu, Pieter Abbeel

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

14:29

06/12/2021

Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration

Lulu Zheng, Jiarui Chen, Jianhao Wang and
Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang

Keywords Paper

reinforcement learning and planning

0

0

0

0

12:25

18/07/2021

Emphatic Algorithms for Deep Reinforcement Learning

Ray Jiang, Tom Zahavy, Zhongwen Xu and
Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

5:21

12/07/2020

Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences

Daniel Brown, Scott Niekum, Russell Coleman, Ravi Srinivasan

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:11

26/08/2020

A Nonparametric Off-Policy Policy Gradient

Samuele Tosatto, Joao Carvalho, Hany Abdulsamad, Jan Peters

Keywords Paper

0

0

0

0

12:19

18/07/2021

Cooperative Exploration for Multi-Agent Deep Reinforcement Learning

Iou-Jen Liu, Unnat Jain, Raymond Yeh, Alex Schwing

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

20:35

06/12/2020

Trust the Model When It Is Confident: Masked Model-based Actor-Critic

Feiyang Pan, Jia He, Dandan Tu, Qing He

Keywords Paper

0

0

0

0

2:57

06/12/2021

Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks

Rong Zhu, Mattia Rigotti

Keywords Paper

theory, deep learning, reinforcement learning and planning, bandits

0

0

0

0

8:45

06/12/2020

Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning

Sebastian Curi, Felix Berkenkamp, Andreas Krause

Keywords Paper

0

0

0

0

3:23

02/02/2021

Distributional Reinforcement Learning via Moment Matching

Thanh Nguyen-Tang, Sunil Gupta, Svetha Venkatesh

Keywords Paper

0

0

0

0

20:01

18/07/2021

Confidence Scores Make Instance-dependent Label-noise Learning Possible

Antonin Berthon, Bo Han, Gang Niu and
Tongliang Liu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

20:38

18/07/2021

Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks

Sungryull Sohn, Sungtae Lee, Jongwook Choi and
Harm van Seijen, Mehdi Fatemi, Honglak Lee

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:19

18/07/2021

Wasserstein Distributional Normalization For Robust Distributional Certification of Noisy Labeled Data

Sung Woo Park, Junseok Kwon

Keywords Paper

Deep Learning, Generative Models, Algorithms, Representation Learning; Optimization, Submodular Optimization, Probabilistic Methods, Robust statistics

0

0

0

0

5:20

06/12/2020

Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval

Stefano Sarao Mannelli, Giulio Biroli, Chiara Cammarota and
Florent Krzakala, Pierfrancesco Urbani, Lenka Zdeborová

Keywords Paper

, Theory -> Learning Theory

0

0

0

0

3:17

14/06/2020

Attack to Explain Deep Representation

Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Keywords Paper

interpreting deep learning, adversarial attack, explanation attack, explainable ai, image generation

0

0

0

0

1:01

02/02/2021

Uncertainty-Aware Multi-View Representation Learning

Yu Geng, Zongbo Han, Changqing Zhang, Qinghua Hu

Keywords Paper

0

0

0

0

14:19

06/12/2021

COMBO: Conservative Offline Model-Based Policy Optimization

Tianhe Yu, Aviral Kumar, Rafael Rafailov and
Aravind Rajeswaran, Sergey Levine, Chelsea Finn

Keywords Paper

deep learning, optimization, reinforcement learning and planning

0

0

0

0

12:35

18/07/2021

Provably Correct Optimization and Exploration with Non-linear Policies

Fei Feng, Wotao Yin, Alekh Agarwal, Lin Yang

Keywords Paper

Deep Learning, Adversarial Networks, Applications, Fairness, Accountability, and Transparency, Theory, RL, Decisions and Control Theory

0

0

0

0

5:03

18/07/2021

Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels

Zhaowei Zhu, Yiwen Song, Yang Liu

Keywords Paper

Deep Learning

0

0

0

0

5:24

12/07/2020

Implicit Generative Modeling for Efficient Exploration

Neale Ratzlaff, Qinxun Bai, Fuxin Li, Wei Xu

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:01

03/05/2021

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Max Schwarzer, Ankesh Anand, Rishab Goel and
R Devon Hjelm, Aaron Courville, Philip Bachman

Keywords Paper

Representation Learning, Self-Supervised Learning, Reinforcement Learning, Sample Efficiency

0

0

0

1

10:04

03/05/2021

Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments

Daochen Zha, Wenye Ma, Lei Yuan and
Xia Hu, Ji Liu

Keywords Paper

Exploration, Reinforcement Learning, Self-Imitation, Generalization of Reinforcement Learning

0

0

0

0

5:10

06/12/2021

Provable Representation Learning for Imitation with Contrastive Fourier Features

Ofir Nachum, Mengjiao Yang

Keywords Paper

reinforcement learning and planning, contrastive learning, representation learning

0

0

0

0

15:06

18/07/2021

Principled Exploration via Optimistic Bootstrapping and Backward Induction

Chenjia Bai, Lingxiao Wang, Lei Han and
Jianye Hao, Animesh Garg, Peng Liu, Zhaoran Wang

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:18

06/12/2021

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?

Kate Rakelly, Abhishek Gupta, Carlos Florensa, Sergey Levine

Keywords Paper

reinforcement learning and planning, representation learning

1

0

0

0

10:44

06/12/2020

Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

Sreejith Balakrishnan, Quoc Phong Nguyen, Bryan Kian Hsiang Low, Harold Soh

Keywords Paper

0

0

0

0

3:22

14/06/2020

Multi-Scale Interactive Network for Salient Object Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Keywords Paper

saliency detection, salient object detection, feature interaction strategy, scale-insensitive loss, multi-scale features, multi-level features, fully convolutional network, deep learning

0

0

0

0

1:01