Decoupling Value and Policy for Generalization in Reinforcement Learning

18/07/2021

Decoupling Value and Policy for Generalization in Reinforcement Learning

Roberta Raileanu, Rob Fergus

Keywords: Theory, Learning Theory, Theory, Large Deviations and Asymptotic Analysis, Reinforcement Learning and Planning, Deep RL

Abstract Paper Similar Papers

Abstract: Standard deep reinforcement learning algorithms use a shared representation for the policy and value function, especially when training directly from images. However, we argue that more information is needed to accurately estimate the value function than to learn the optimal policy. Consequently, the use of a shared representation for the policy and value function can lead to overfitting. To alleviate this problem, we propose two approaches which are combined to create IDAAC: Invariant Decoupled Advantage Actor-Critic. First, IDAAC decouples the optimization of the policy and value function, using separate networks to model them. Second, it introduces an auxiliary loss which encourages the representation to be invariant to task-irrelevant properties of the environment. IDAAC shows good generalization to unseen environments, achieving a new state-of-the-art on the Procgen benchmark and outperforming popular methods on DeepMind Control tasks with distractors. Our implementation is available at https://github.com/rraileanu/idaac.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

18/07/2021

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Stéphane d'Ascoli, Hugo Touvron, Matthew Leavitt and
Ari Morcos, Giulio Biroli, Levent Sagun

Keywords Paper

Deep Learning, Architectures

0

0

0

0

5:16

06/12/2021

Counterfactual Maximum Likelihood Estimation for Training Deep Networks

Xinyi Wang, Wenhu Chen, Michael Saxon, William Yang Wang

Keywords Paper

deep learning, domain adaptation, causality, language

0

0

0

0

14:07

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

22/11/2021

Structured Latent Embeddings for Recognizing Unseen Classes in Unseen Domains

Shivam Chandhok, Sanath Narayan, Hisham Cholakkal and
Rao Muhammad Anwer, Vineeth N Balasubramanian, Fahad Shahbaz Khan, Ling Shao

Keywords Paper

Zero-Shot, Domain Generalization, multimodal-alignment, domain-invariant, conceptual partition, semantics

0

0

0

0

2:48

14/06/2020

Selective Transfer With Reinforced Transfer Network for Partial Domain Adaptation

Zhihong Chen, Chao Chen, Zhaowei Cheng and
Boyuan Jiang, Ke Fang, Xinyu Jin

Keywords Paper

partial domain adaptation, selective transfer, pixel-level information, reconstruct error, reinforcement learning

1

1

0

0

1:01

26/04/2020

Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification

Yixiao Ge, Dapeng Chen, Hongsheng Li

Keywords Paper

Label Refinery, Unsupervised Domain Adaptation, Person Re-identification

0

0

0

0

5:03

03/05/2021

Negative Data Augmentation

Abhishek Sinha, Kumar Ayush, Jiaming Song and
Burak Uzkent, Hongxia Jin, Stefano Ermon

Keywords Paper

self-supervised learning, anomaly detection, generative models, data augmentation

0

1

0

0

4:59

19/08/2021

Enhance Image as You Like with Unpaired Learning

Xiaopeng Sun, Muxingzi Li, Tianyu He, Lubin Fan

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Applications of Unsupervised Learning

0

0

0

0

11:20

02/02/2021

Visualization of Supervised and Self-Supervised Neural Networks via Attribution Guided Factorization

Shir Gur, Ameen Ali, Lior Wolf

Keywords Paper

0

0

0

0

14:14

06/12/2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

contrastive learning

0

0

0

0

10:12

06/12/2020

AdvFlow: Inconspicuous Black-box Adversarial Attacks using Normalizing Flows

Hadi Mohaghegh Dolatabadi, Sarah Erfani, Christopher Leckie

Keywords Paper

0

0

0

0

3:59

06/12/2021

Object-aware Contrastive Learning for Debiased Scene Representation

Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

10:25

06/12/2021

Towards Context-Agnostic Learning Using Synthetic Data

Charles Jin, Martin Rinard

Keywords Paper

machine learning, vision

0

0

0

0

14:20

26/04/2020

Nesterov Accelerated Gradient and Scale Invariance for Adversarial Attacks

Jiadong Lin, Chuanbiao Song, Kun He and
Liwei Wang, John E. Hopcroft

Keywords Paper

adversarial examples, adversarial attack, transferability, Nesterov accelerated gradient, scale invariance

0

0

0

0

3:59

14/06/2020

Domain-Aware Visual Bias Eliminating for Generalized Zero-Shot Learning

Shaobo Min, Hantao Yao, Hongtao Xie and
Chaoqun Wang, Zheng-Jun Zha, Yongdong Zhang

Keywords Paper

generalized zero-shot learning, domain detection, recognition, segmentation, margin loss, bilinear pooling, nas, transfer learning, domain adaption, computer vision.

0

0

0

0

0:58

06/12/2021

Rebooting ACGAN: Auxiliary Classifier GANs with Stable Training

Minguk Kang, Woohyeon Shim, Minsu Cho, Jaesik Park

Keywords Paper

generative model

0

0

0

0

9:03

02/02/2021

Tailoring Embedding Function to Heterogeneous Few-Shot Tasks by Global and Local Feature Adaptors

Su Lu, Han-Jia Ye, De-Chuan Zhan

Keywords Paper

0

0

0

0

14:21

18/07/2021

Towards Domain-Agnostic Contrastive Learning

Vikas Verma, Thang Luong, Kenji Kawaguchi and
Hieu Pham, Quoc Le

Keywords Paper

Deep Learning

0

0

0

0

4:54

02/02/2021

Generalizable Representation Learning for Mixture Domain Face Anti-Spoofing

Zhihong Chen, Taiping Yao, Kekai Sheng and
Shouhong Ding, Ying Tai, Jilin Li, Feiyue Huang, Xinyu Jin

Keywords Paper

0

0

0

0

14:08

06/12/2021

Designing Counterfactual Generators using Deep Model Inversion

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and
Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Paper

optimization, representation learning, interpretability

0

0

0

0

13:28

14/06/2020

Hierarchically Robust Representation Learning

Qi Qian, Juhua Hu, Hao Li

Keywords Paper

representation learning, hierarchical robustness

0

0

0

0

1:01

03/05/2021

OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning

Anurag Ajay, Aviral Kumar, Pulkit Agrawal and
Sergey Levine, Ofir Nachum

Keywords Paper

Unsupervised Learning, Offline Reinforcement Learning, Primitive Discovery

0

0

0

0

5:08

06/12/2020

Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation

Han Zhao, Jianfeng Chi, Yuan Tian, Geoffrey Gordon

Keywords Paper

0

0

0

0

3:17

14/06/2020

Attack to Explain Deep Representation

Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Keywords Paper

interpreting deep learning, adversarial attack, explanation attack, explainable ai, image generation

0

0

0

0

1:01

18/07/2021

Understanding Invariance via Feedforward Inversion of Discriminatively Trained Classifiers

Piotr Teterwak, Chiyuan Zhang, Dilip Krishnan, Mike Mozer

Keywords Paper

Deep Learning

0

0

0

0

4:52

14/06/2020

BiDet: An Efficient Binarized Object Detector

Ziwei Wang, Ziyi Wu, Jiwen Lu, Jie Zhou

Keywords Paper

binary neural networks, object detection, information bottleneck, sparse object priors, false positive elimination

0

0

0

0

1:00

30/11/2020

Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation

Siqi Yang, Lin Wu, Arnold Wiliem, Brian C. Lovell

Keywords Paper

0

0

0

0

6:32

06/12/2021

Intriguing Properties of Contrastive Losses

Ting Chen, Calvin Luo, Lala Li

Keywords Paper

self-supervised learning, vision, contrastive learning

0

0

0

0

13:36

14/06/2020

Erasing Integrated Learning: A Simple Yet Effective Approach for Weakly Supervised Object Localization

Jinjie Mai, Meng Yang, Wenfeng Luo

Keywords Paper

weakly supervised, object localization, adversarial erasing

0

0

0

0

5:00

16/11/2020

Attention-Privileged Reinforcement Learning

Sasha Salter, Dushyant Rao, Markus Wulfmeier and
Raia Hadsell, Ingmar Posner

Keywords Paper

0

0

0

0

4:54

12/07/2020

Implicit Class-Conditioned Domain Alignment for Unsupervised Domain Adaptation

Xiang Jiang, Qicheng Lao, Stan Matwin, Mohammad Havaei

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

14:47

22/11/2021

Feature Fusion Vision Transformer for Fine-Grained Visual Categorization

Jun Wang, Xiaohan Yu, Yongsheng Gao

Keywords Paper

Fine-grained visual categorization, Vision transformer, Self-attention, Feature Fusion

0

0

0

0

3:02

14/06/2020

Self-Supervised Domain-Aware Generative Network for Generalized Zero-Shot Learning

Jiamin Wu, Tianzhu Zhang, Zheng-Jun Zha and
Jiebo Luo, Yongdong Zhang, Feng Wu

Keywords Paper

zero-shot learning, self-supervised learning, generative adversarial network, generative model, multi-label learning

0

0

0

0

1:01

22/11/2021

Unsupervised Domain Adaptation of Black-Box Source Models

Haojian Zhang, Yabin Zhang, Kui Jia, Lei Zhang

Keywords Paper

domain adaptation, black box, unsupervised, noisy label, iterative

0

0

0

0

2:57

02/02/2021

Deep Low-Contrast Image Enhancement using Structure Tensor Representation

Hyungjoo Jung, Hyunsung Jang, Namkoo Ha, Kwanghoon Sohn

Keywords Paper

0

0

0

0

16:31

06/12/2021

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning

Jongjin Park, Younggyo Seo, Chang Liu and
Li Zhao, Tao Qin, Jinwoo Shin, Tie-Yan Liu

Keywords Paper

reinforcement learning and planning, causality

0

0

0

0

12:12

25/07/2020

Disentangled graph collaborative filtering

Xiang Wang, Hongye Jin, An Zhang and
Xiangnan He, Tong Xu, Tat-Seng Chua

Keywords Paper

explainable recommendation, disentangled representation learning, collaborative filtering, graph neural networks

0

0

0

0

15:17

03/05/2021

Property Controllable Variational Autoencoder via Invertible Mutual Dependence

Xiaojie Guo, Yuanqi Du, Liang Zhao

Keywords Paper

deep generative models, disentangled representation learning, interpretable latent representation

0

0

0

0

4:45

03/05/2021

Counterfactual Generative Networks

Axel Sauer, Andreas Geiger

Keywords Paper

Generative Models, Data Augmentation, Image Classification, Counterfactuals, Robustness, Causality

0

0

0

0

5:25