Error Bounds of Imitating Policies and Environments

06/12/2020

Error Bounds of Imitating Policies and Environments

Tian Xu, Ziniu Li, Yang Yu

Keywords:

Abstract Paper Similar Papers

Abstract: Imitation learning trains a policy by mimicking expert demonstrations. Various imitation methods were proposed and empirically evaluated, meanwhile, their theoretical understanding needs further studies. In this paper, we firstly analyze the value gap between the expert policy and imitated policies by two imitation methods, behavioral cloning and generative adversarial imitation. The results support that generative adversarial imitation can reduce the compounding errors compared to behavioral cloning, and thus has a better sample complexity. Noticed that by considering the environment transition model as a dual agent, imitation learning can also be used to learn the environment model. Therefore, based on the bounds of imitating policies, we further analyze the performance of imitating environments. The results show that environment models can be more effectively imitated by generative adversarial imitation than behavioral cloning, suggesting a novel application of adversarial imitation for model-based reinforcement learning. We hope these results could inspire future advances in imitation learning and model-based reinforcement learning.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Invariant Causal Imitation Learning for Generalizable Policies

Ioana Bica, Daniel Jarrett, Mihaela van der Schaar

Keywords Paper

reinforcement learning and planning, representation learning

0

0

0

0

14:49

06/12/2020

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

Siddharth Desai, Ishan Durugkar, Haresh Karnan and
Garrett Warnell, Josiah Hanna, Peter Stone

Keywords Paper

0

0

0

0

3:22

18/07/2021

Cross-domain Imitation from Observations

Dripta S. Raychaudhuri, Sujoy Paul, Jeroen Vanbaar, Amit Roy-Chowdhury

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

20:49

26/04/2020

Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

Yuping Luo, Huazhe Xu, Tengyu Ma

Keywords Paper

imitation learning, model-based imitation learning, model-based RL, behavior cloning, covariate shift

0

0

0

0

4:38

06/12/2021

Outcome-Driven Reinforcement Learning via Variational Inference

Tim G. J. Rudner, Vitchyr Pong, Rowan McAllister and
Yarin Gal, Sergey Levine

Keywords Paper

reinforcement learning and planning, generative model

0

0

0

0

12:21

06/12/2020

The Value Equivalence Principle for Model-Based Reinforcement Learning

Christopher Grimm, Andre Barreto, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:19

26/08/2020

Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning

Tianyu Li, Bogdan Mazoure, Doina Precup, Guillaume Rabusseau

Keywords Paper

0

0

0

0

13:49

06/12/2021

Model-Based Reinforcement Learning via Imagination with Derived Memory

Yao Mu, Yuzheng Zhuang, Bin Wang and
Guangxiang Zhu, Wulong Liu, Jianyu Chen, Ping Luo, Shengbo Li, Chongjie Zhang, Jianye Hao

Keywords Paper

reinforcement learning and planning, robustness

0

0

0

0

9:31

06/12/2020

Causal Imitation Learning With Unobserved Confounders

Junzhe Zhang, Daniel Kumor, Elias Bareinboim

Keywords Paper

0

0

0

0

3:18

06/12/2020

Inverse Reinforcement Learning from a Gradient-based Learner

Giorgia Ramponi, Gianluca Drappo, Marcello Restelli

Keywords Paper

0

0

0

0

2:42

06/12/2021

An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning

Tianpei Yang, Weixun Wang, Hongyao Tang and
Jianye Hao, Zhaopeng Meng, Hangyu Mao, Dong Li, Wulong Liu, Yingfeng Chen, Yujing Hu, Changjie Fan, Chengwei Zhang

Keywords Paper

reinforcement learning and planning, transfer learning

0

0

0

0

15:21

19/08/2021

Robot Manipulation Learning Using Generative Adversarial Imitation Learning

Mohamed Khalil Jabri

Keywords Paper

Machine Learning, Reinforcement Learning, Adversarial Machine Learning, Unsupervised Learning, Learning in Robotics

0

0

0

0

8:12

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

06/12/2020

Cooperative Heterogeneous Deep Reinforcement Learning

Han Zheng, Pengfei Wei, Jing Jiang and
Guodong Long, Qinghua Lu, Chengqi Zhang

Keywords Paper

0

0

0

0

3:08

26/04/2020

State Alignment-based Imitation Learning

Fangchen Liu, Zhan Ling, Tongzhou Mu, Hao Su

Keywords Paper

Imitation learning, Reinforcement Learning

0

0

0

0

4:56

16/11/2020

Tolerance-Guided Policy Learning for Adaptable and Transferrable Delicate Industrial Insertion

Boshen Niu, Chenxi Wang, Changliu Liu

Keywords Paper

0

0

0

0

5:36

06/12/2020

Fighting Copycat Agents in Behavioral Cloning from Observation Histories

Chuan Wen, Jierui Lin, Trevor Darrell and
Dinesh Jayaraman, Yang Gao

Keywords Paper

, Reinforcement Learning and Planning -> Exploration

0

0

0

0

3:21

06/12/2021

Generating High-Quality Explanations for Navigation in Partially-Revealed Environments

Gregory Stein

Keywords Paper

deep learning

0

0

0

0

13:11

06/12/2020

Model-based Policy Optimization with Unsupervised Model Adaptation

Jian Shen, Han Zhao, Weinan Zhang, Yong Yu

Keywords Paper

0

0

0

0

3:09

18/07/2021

Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning

Sumedh Sontakke, Arash Mehrjou, Laurent Itti, Bernhard Schölkopf

Keywords Paper

Applications, Robotics

0

0

0

0

5:18

06/12/2021

Bridging the Imitation Gap by Adaptive Insubordination

Luca Weihs, Unnat Jain, Iou-Jen Liu and
Jordi Salvador, Svetlana Lazebnik, Aniruddha Kembhavi, Alex Schwing

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:51

18/07/2021

Imitation by Predicting Observations

Andrew Jaegle, Yury Sulsky, Arun Ahuja and
Jake Bruce, Rob Fergus, Greg Wayne

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

5:15

19/08/2021

Variational Model-based Policy Optimization

Yinlam Chow, Brandon Cui, Moonkyung Ryu, Mohammad Ghavamzadeh

Keywords Paper

Machine Learning, Reinforcement Learning

0

0

0

0

15:31

06/12/2021

Local policy search with Bayesian optimization

Sarah Müller, Alexander von Rohr, Sebastian Trimpe

Keywords Paper

theory, optimization, reinforcement learning and planning, active learning

0

0

0

0

11:42

03/05/2021

Primal Wasserstein Imitation Learning

Robert Dadashi, Hussenot Hussenot-Desenonges, Matthieu Geist, Olivier Pietquin

Keywords Paper

Wasserstein distance, Optimal Transport, Inverse Reinforcement Learning, Reinforcement Learning, Imitation Learning

0

0

0

0

4:57

18/07/2021

Interactive Learning from Activity Description

Khanh Nguyen, Dipendra Misra, Robert Schapire and
Miro Dudik, Patrick Shafto

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:57

19/08/2021

Novelty Detection via Contrastive Learning with Negative Data Augmentation

Chengwei Chen, Yuan Xie, Shaohui Lin and
Ruizhi Qiao, Jian Zhou, Xin Tan, Yi Zhang, Lizhuang Ma

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Anomaly/Outlier Detection

0

0

0

0

14:37

06/12/2021

Adversarial Intrinsic Motivation for Reinforcement Learning

Ishan Durugkar, Mauricio Tec, Scott Niekum, Peter Stone

Keywords Paper

reinforcement learning and planning, generative model

0

0

0

0

13:11

06/12/2020

MOReL: Model-Based Offline Reinforcement Learning

Rahul Kidambi, Aravind Rajeswaran, Praneeth Netrapalli, Thorsten Joachims

Keywords Paper

1

0

0

0

3:23

06/12/2021

Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration

Lulu Zheng, Jiarui Chen, Jianhao Wang and
Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang

Keywords Paper

reinforcement learning and planning

0

0

0

0

12:25

19/08/2021

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

Weinan Zhang, Xihuai Wang, Jian Shen, Ming Zhou

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Multi-agent Learning

0

0

0

0

13:10

18/07/2021

Representation Matters: Offline Pretraining for Sequential Decision Making

Mengjiao Yang, Ofir Nachum

Keywords Paper

Reinforcement Learning and Planning

1

0

0

0

5:06

06/12/2021

Discovery of Options via Meta-Learned Subgoals

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

4:13

02/02/2021

An Adaptive Hybrid Framework for Cross-domain Aspect-based Sentiment Analysis

Yan Zhou, Fuqing Zhu, Pu Song and
Jizhong Han, Tao Guo, Songlin Hu

Keywords Paper

0

0

0

0

17:23

07/09/2020

Imitating Unknown Policies via Exploration

Nathan Gavenski, Juarez Monteiro, Roger Granada and
Felipe Meneguzzi, Rodrigo Barros

Keywords Paper

imitation learning, learning from demonstration, behavioral cloning

0

0

0

0

7:42

26/04/2020

Multi-Agent Interactions Modeling with Correlated Policies

Minghuan Liu, Ming Zhou, Weinan Zhang and
Yuzheng Zhuang, Jun Wang, Wulong Liu, Yong Yu

Keywords Paper

Multi-agent reinforcement learning, Imitation learning

0

0

0

0

4:33

26/04/2020

Watch, Try, Learn: Meta-Learning from Demonstrations and Rewards

Allan Zhou, Eric Jang, Daniel Kappler and
Alex Herzog, Mohi Khansari, Paul Wohlhart, Yunfei Bai, Mrinal Kalakrishnan, Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, reinforcement learning, imitation learning

0

0

0

0

4:34

03/05/2021

Learning and Evaluating Representations for Deep One-Class Classification

Kihyuk Sohn, Chun-Liang Li, Jinsung Yoon and
Minho Jin, Tomas Pfister

Keywords Paper

self-supervised learning, deep one-class classification

0

0

0

1

5:13

06/12/2021

Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

Tim Seyde, Igor Gilitschenski, Wilko Schwarting and
Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:48

06/12/2020

Offline Imitation Learning with a Misspecified Simulator

Shengyi Jiang, Jingcheng Pang, Yang Yu

Keywords Paper

Reinforcement Learning and Planning -> Exploration, Algorithms -> Bandit Algorithms

0

0

0

0

3:29