Fighting Copycat Agents in Behavioral Cloning from Observation Histories

06/12/2020

Fighting Copycat Agents in Behavioral Cloning from Observation Histories

Chuan Wen, Jierui Lin, Trevor Darrell, Dinesh Jayaraman, Yang Gao

Keywords: , Reinforcement Learning and Planning -> Exploration

Abstract Paper Similar Papers

Abstract: Imitation learning trains policies to map from input observations to the actions that an expert would choose. In this setting, distribution shift frequently exacerbates the effect of misattributing expert actions to nuisance correlates among the observed variables. We observe that a common instance of this causal confusion occurs in partially observed settings when expert actions are strongly correlated over time: the imitator learns to cheat by predicting the expert's previous action, rather than the next action. To combat this "copycat problem", we propose an adversarial approach to learn a feature representation that removes excess information about the previous expert action nuisance correlate, while retaining the information necessary to predict the next action. In our experiments, our approach improves performance significantly across a variety of partially observed imitation learning tasks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Invariant Causal Imitation Learning for Generalizable Policies

Ioana Bica, Daniel Jarrett, Mihaela van der Schaar

Keywords Paper

reinforcement learning and planning, representation learning

0

0

0

0

14:49

06/12/2020

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

Siddharth Desai, Ishan Durugkar, Haresh Karnan and
Garrett Warnell, Josiah Hanna, Peter Stone

Keywords Paper

0

0

0

0

3:22

26/04/2020

Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

Yuping Luo, Huazhe Xu, Tengyu Ma

Keywords Paper

imitation learning, model-based imitation learning, model-based RL, behavior cloning, covariate shift

0

0

0

0

4:38

26/04/2020

Imitation Learning via Off-Policy Distribution Matching

Ilya Kostrikov, Ofir Nachum, Jonathan Tompson

Keywords Paper

reinforcement learning, deep learning, imitation learning, adversarial learning

0

0

0

0

5:31

18/07/2021

Robust Asymmetric Learning in POMDPs

Andrew Warrington, Jonathan Lavington, Adam Scibior and
Mark Schmidt, Frank Wood

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

16:53

06/12/2020

Learning from Failure: De-biasing Classifier from Biased Classifier

Junhyun Nam, Hyuntak Cha, Sungsoo Ahn and
Jaeho Lee, Jinwoo Shin

Keywords Paper

0

0

0

0

3:21

03/05/2021

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Paper

reinforcement learning, imitation learning, reward learning

0

0

0

0

5:09

18/07/2021

Cross-domain Imitation from Observations

Dripta S. Raychaudhuri, Sujoy Paul, Jeroen Vanbaar, Amit Roy-Chowdhury

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

20:49

03/05/2021

Correcting experience replay for multi-agent communication

Sanjeevan Ahilan, Peter Dayan

Keywords Paper

multi-agent reinforcement learning, communication, experience replay, relabelling

1

0

0

0

10:31

06/12/2020

Causal Imitation Learning With Unobserved Confounders

Junzhe Zhang, Daniel Kumor, Elias Bareinboim

Keywords Paper

0

0

0

0

3:18

07/09/2020

Imitating Unknown Policies via Exploration

Nathan Gavenski, Juarez Monteiro, Roger Granada and
Felipe Meneguzzi, Rodrigo Barros

Keywords Paper

imitation learning, learning from demonstration, behavioral cloning

0

0

0

0

7:42

18/07/2021

Imitation by Predicting Observations

Andrew Jaegle, Yury Sulsky, Arun Ahuja and
Jake Bruce, Rob Fergus, Greg Wayne

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

5:15

06/12/2020

Off-Policy Imitation Learning from Observations

Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

Keywords Paper

0

0

0

1

3:24

06/12/2020

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

Paul Barde, Julien Roy, Wonseok Jeon and
Joelle Pineau, Chris Pal, Derek Nowrouzezahrai

Keywords Paper

0

0

0

0

3:08

26/04/2020

State Alignment-based Imitation Learning

Fangchen Liu, Zhan Ling, Tongzhou Mu, Hao Su

Keywords Paper

Imitation learning, Reinforcement Learning

0

0

0

0

4:56

18/07/2021

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal, Christian Schroeder, Bei Peng and
Wendelin Boehmer, Shimon Whiteson, Fei Sha

Keywords Paper

Optimization, Convex Optimization, Reinforcement Learning and Planning, Multi-Agent RL, Algorithms, Large Scale Learning; Probabilistic Methods, Distributed Inference

0

0

0

0

20:08

06/12/2020

Error Bounds of Imitating Policies and Environments

Tian Xu, Ziniu Li, Yang Yu

Keywords Paper

0

0

0

0

3:07

16/11/2020

Tolerance-Guided Policy Learning for Adaptable and Transferrable Delicate Industrial Insertion

Boshen Niu, Chenxi Wang, Changliu Liu

Keywords Paper

0

0

0

0

5:36

06/12/2020

Inverse Reinforcement Learning from a Gradient-based Learner

Giorgia Ramponi, Gianluca Drappo, Marcello Restelli

Keywords Paper

0

0

0

0

2:42

13/04/2021

On data efficiency of meta-learning

Maruan Al-Shedivat, Liam Li, Eric Xing, Ameet Talwalkar

Keywords Paper

0

0

0

0

3:24

03/05/2021

Domain-Robust Visual Imitation Learning with Mutual Information Constraints

Edoardo Cetin, Oya Celiktutan

Keywords Paper

Domain Adaption, Third-Person Imitation, Observational Imitation, Reinforcement Learning, Machine Learning, Mutual Information, Imitation Learning

0

0

0

0

4:51

25/07/2020

Influence function for unbiased recommendation

Jiangxing Yu, Hong Zhu, Chih-Yao Chang and
Xinhua Feng, Bowen Yuan, Xiuqiang He, Zhenhua Dong

Keywords Paper

recommender system, influence function, counterfactual learning

0

0

0

0

9:43

06/12/2020

From Predictions to Decisions: Using Lookahead Regularization

Nir Rosenfeld, Sophie Hilgard, Sai Ravindranath, David Parkes

Keywords Paper

0

0

0

0

3:10

06/12/2021

Conservative Data Sharing for Multi-Task Offline Reinforcement Learning

Tianhe Yu, Aviral Kumar, Yevgen Chebotar and
Karol Hausman, Sergey Levine, Chelsea Finn

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:27

02/02/2021

Curse or Redemption? How Data Heterogeneity Affects the Robustness of Federated Learning

Syed Zawad, Ahsan Ali, Pin-Yu Chen and
Ali Anwar, Yi Zhou, Nathalie Baracaldo, Yuan Tian, Feng Yan

Keywords Paper

0

0

0

0

19:26

13/04/2021

Comparing the value of labeled and unlabeled data in method-of-moments latent variable estimation

Mayee Chen, Benjamin Cohen-Wang, Stephen Mussmann and
Frederic Sala, Christopher Re

Keywords Paper

0

0

0

0

3:04

06/12/2021

Bridging the Imitation Gap by Adaptive Insubordination

Luca Weihs, Unnat Jain, Iou-Jen Liu and
Jordi Salvador, Svetlana Lazebnik, Aniruddha Kembhavi, Alex Schwing

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:51

26/04/2020

Disagreement-Regularized Imitation Learning

Kiante Brantley, Wen Sun, Mikael Henaff

Keywords Paper

imitation learning, reinforcement learning, uncertainty

0

0

0

0

4:53

18/07/2021

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:06

06/12/2021

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning

Jongjin Park, Younggyo Seo, Chang Liu and
Li Zhao, Tao Qin, Jinwoo Shin, Tie-Yan Liu

Keywords Paper

reinforcement learning and planning, causality

0

0

0

0

12:12

26/04/2020

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Piyush Gupta, Nikaash Puri, Sukriti Verma and
Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Keywords Paper

Deep Reinforcement Learning, Saliency maps, Chess, Atari games, Interpretable AI

0

0

0

0

4:59

03/05/2021

Parameter-Based Value Functions

Francesco Faccio, Louis Kirsch, Jürgen Schmidhuber

Keywords Paper

Off-Policy Reinforcement Learning, Reinforcement Learning

0

0

0

0

2:45

25/07/2020

Fair classification with counterfactual learning

Maryam Tavakol

Keywords Paper

fairness-aware learning, classification, counterfactual reasoning

0

0

0

0

10:29

06/12/2020

f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning

Xin Zhang, Yanhua Li, Ziming Zhang, Zhi-Li Zhang

Keywords Paper

0

0

0

0

3:22

02/02/2021

Group Fairness by Probabilistic Modeling with Latent Fair Decisions

YooJung Choi, Meihua Dang, Guy Van den Broeck

Keywords Paper

0

0

0

0

19:30

12/07/2020

Weakly-Supervised Disentanglement Without Compromises

Francesco Locatello, Ben Poole, Gunnar Raetsch and
Bernhard Schölkopf, Olivier Bachem, Michael Tschannen

Keywords Paper

Representation Learning

0

0

0

0

14:47

12/07/2020

Ready Policy One: World Building Through Active Learning

Philip Ball, Jack Parker-Holder, Aldo Pacchiano and
Krzysztof Choromanski, Stephen Roberts

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:31

18/07/2021

Adversarial Policy Learning in Two-player Competitive Games

Wenbo Guo, Xian Wu, Sui Huang, Xinyu Xing

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:14

18/07/2021

Deciding What to Learn: A Rate-Distortion Approach

Dilip Arumugam, Benjamin Van Roy

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:12

14/06/2020

NestedVAE: Isolating Common Factors via Weak Supervision

Matthew J. Vowels, Necati Cihan Camgöz, Richard Bowden

Keywords Paper

fairness, bias, representation learning, invariance, vae, variational, weakly supervised, information bottleneck

0

0

0

0

1:00