An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

06/12/2020

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

Siddharth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah Hanna, Peter Stone

Keywords:

Abstract Paper Similar Papers

Abstract: We examine the problem of transferring a policy learned in a source environment to a target environment with different dynamics, particularly in the case where it is critical to reduce the amount of interaction with the target environment during learning. This problem is particularly important in sim-to-real transfer because simulators inevitably model real-world dynamics imperfectly. In this paper, we show that one existing solution to this transfer problem-- grounded action transformation --is closely related to the problem of imitation from observation (IfO): learning behaviors that mimic the observations of behavior demonstrations. After establishing this relationship, we hypothesize that recent state-of-the-art approaches from the IfO literature can be effectively repurposed for grounded transfer learning. To validate our hypothesis we derive a new algorithm -- generative adversarial reinforced action transformation (GARAT) -- based on adversarial imitation from observation techniques. We run experiments in several domains with mismatched dynamics, and find that agents trained with GARAT achieve higher returns in the target environment compared to existing black-box transfer methods.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Inverse Reinforcement Learning from a Gradient-based Learner

Giorgia Ramponi, Gianluca Drappo, Marcello Restelli

Keywords Paper

0

0

0

0

2:42

06/12/2020

Model-based Adversarial Meta-Reinforcement Learning

Zichuan Lin, Garrett Thomas, Guangwen Yang, Tengyu Ma

Keywords Paper

0

0

0

0

3:31

18/07/2021

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Dong Ki Kim, Miao Liu, Matthew Riemer and
Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan How

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL, Algorithms, Representation Learning, Algorithms, Relational Learning

0

0

0

0

5:20

03/05/2021

Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning

Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang

Keywords Paper

Unsupervised Learning, Variational Autoencoders, Unsupervised Meta-learning, Meta-Learning

0

0

0

0

13:31

06/12/2021

Provable Representation Learning for Imitation with Contrastive Fourier Features

Ofir Nachum, Mengjiao Yang

Keywords Paper

reinforcement learning and planning, contrastive learning, representation learning

0

0

0

0

15:06

06/12/2020

Model-based Policy Optimization with Unsupervised Model Adaptation

Jian Shen, Han Zhao, Weinan Zhang, Yong Yu

Keywords Paper

0

0

0

0

3:09

18/07/2021

Representation Matters: Offline Pretraining for Sequential Decision Making

Mengjiao Yang, Ofir Nachum

Keywords Paper

Reinforcement Learning and Planning

1

0

0

0

5:06

18/07/2021

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:06

06/12/2020

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

19/08/2021

Novelty Detection via Contrastive Learning with Negative Data Augmentation

Chengwei Chen, Yuan Xie, Shaohui Lin and
Ruizhi Qiao, Jian Zhou, Xin Tan, Yi Zhang, Lizhuang Ma

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Anomaly/Outlier Detection

0

0

0

0

14:37

03/05/2021

Control-Aware Representations for Model-based Reinforcement Learning

Brandon Cui, Yinlam Chow, Mohammad Ghavamzadeh

Keywords Paper

0

0

0

0

4:57

03/05/2021

Learning perturbation sets for robust machine learning

Eric Wong, Zico Kolter

Keywords Paper

conditional variational autoencoder, adversarial examples, perturbation sets, robust machine learning

0

1

0

0

5:06

18/07/2021

DriftSurf: Stable-State / Reactive-State Learning under Concept Drift

Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phil Gibbons

Keywords Paper

Algorithms, Online Learning Algorithms

0

0

0

0

5:07

02/02/2021

Adversarial Partial Multi-Label Learning with Label Disambiguation

Yan Yan, Yuhong Guo

Keywords Paper

0

0

0

0

14:38

06/12/2020

The Value Equivalence Principle for Model-Based Reinforcement Learning

Christopher Grimm, Andre Barreto, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:19

06/12/2021

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Dweep Trivedi, Jesse Zhang, Shao-Hua Sun, Joseph Lim

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

10:30

03/05/2021

Correcting experience replay for multi-agent communication

Sanjeevan Ahilan, Peter Dayan

Keywords Paper

multi-agent reinforcement learning, communication, experience replay, relabelling

1

0

0

0

10:31

06/12/2021

Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

Tim Seyde, Igor Gilitschenski, Wilko Schwarting and
Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:48

03/05/2021

Iterative Empirical Game Solving via Single Policy Best Response

Max Smith, Thomas Anthony, Michael Wellman

Keywords Paper

Reinforcement Learning, Multiagent Learning, Empirical Game Theory

0

0

0

0

8:49

26/04/2020

Multi-Agent Interactions Modeling with Correlated Policies

Minghuan Liu, Ming Zhou, Weinan Zhang and
Yuzheng Zhuang, Jun Wang, Wulong Liu, Yong Yu

Keywords Paper

Multi-agent reinforcement learning, Imitation learning

0

0

0

0

4:33

07/09/2020

Unsupervised Domain Adaptation for Spatio-Temporal Action Localization

Nakul Agarwal, Yi-Ting Chen, Behzad Dariush, Ming-Hsuan Yang

Keywords Paper

Spatio-Temporal Action Localization, Unsupervised Domain Adaptation, Adversarial Learning, Video Analysis, Deep Learning

0

0

0

0

9:28

03/05/2021

Meta-Learning of Structured Task Distributions in Humans and Machines

Sreejan Kumar, Ishita Dasgupta, Jonathan Cohen and
Nathaniel Daw, Thomas L Griffiths

Keywords Paper

reinforcement learning, compositionality, human cognition, meta-learning

0

0

0

0

5:18

06/12/2021

Counterfactual Maximum Likelihood Estimation for Training Deep Networks

Xinyi Wang, Wenhu Chen, Michael Saxon, William Yang Wang

Keywords Paper

deep learning, domain adaptation, causality, language

0

0

0

0

14:07

03/05/2021

Self-supervised Learning from a Multi-view Perspective

Yao-Hung Hubert Tsai, Yue Wu, Ruslan Salakhutdinov, LP Morency

Keywords Paper

Self-supervised Learning, Unsupervised Learning, Multi-view Representation Learning

0

0

0

0

5:36

06/12/2021

Time-series Generation by Contrastive Imitation

Daniel Jarrett, Ioana Bica, Mihaela van der Schaar

Keywords Paper

generative model

0

0

0

0

8:47

06/12/2020

Off-Policy Imitation Learning from Observations

Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

Keywords Paper

0

0

0

1

3:24

18/07/2021

On Recovering from Modeling Errors Using Testing Bayesian Networks

Haiying Huang, Adnan Darwiche

Keywords Paper

Probabilistic Methods, Graphical Models

0

0

0

0

5:09

18/07/2021

Function Contrastive Learning of Transferable Meta-Representations

Waleed Gondal, Shruti Joshi, Nasim Rahaman and
Stefan Bauer, Manuel Wuthrich, Bernhard Schölkopf

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:46

19/08/2021

MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks

Menghui Zhu, Minghuan Liu, Jian Shen and
Zhicheng Zhang, Sheng Chen, Weinan Zhang, Deheng Ye, Yong Yu, Qiang Fu, Wei Yang

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning

0

0

0

0

11:28

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

06/12/2020

Error Bounds of Imitating Policies and Environments

Tian Xu, Ziniu Li, Yang Yu

Keywords Paper

0

0

0

0

3:07

06/12/2020

Fighting Copycat Agents in Behavioral Cloning from Observation Histories

Chuan Wen, Jierui Lin, Trevor Darrell and
Dinesh Jayaraman, Yang Gao

Keywords Paper

, Reinforcement Learning and Planning -> Exploration

0

0

0

0

3:21

03/05/2021

On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning

Ren Wang, Kaidi Xu, Sijia Liu and
Pin-Yu Chen, Lily Weng, Chuang Gan, Meng Wang

Keywords Paper

0

0

0

0

5:12

02/02/2021

Improving Sample Efficiency in Model-Free Reinforcement Learning from Images

Denis Yarats, Amy Zhang, Ilya Kostrikov and
Brandon Amos, Joelle Pineau, Rob Fergus

Keywords Paper

0

0

0

0

12:19

18/07/2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

0

0

0

0

5:16

06/12/2021

Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

Kibeom Kim, Min Whoo Lee, Yoonsung Kim and
JeHwan Ryu, Minsu Lee, Byoung-Tak Zhang

Keywords Paper

reinforcement learning and planning

0

0

0

0

11:15

06/12/2021

Adversarial Intrinsic Motivation for Reinforcement Learning

Ishan Durugkar, Mauricio Tec, Scott Niekum, Peter Stone

Keywords Paper

reinforcement learning and planning, generative model

0

0

0

0

13:11

02/02/2021

An Adaptive Hybrid Framework for Cross-domain Aspect-based Sentiment Analysis

Yan Zhou, Fuqing Zhu, Pu Song and
Jizhong Han, Tao Guo, Songlin Hu

Keywords Paper

0

0

0

0

17:23

26/04/2020

Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

Yuping Luo, Huazhe Xu, Tengyu Ma

Keywords Paper

imitation learning, model-based imitation learning, model-based RL, behavior cloning, covariate shift

0

0

0

0

4:38

16/11/2020

Generative adversarial training of product of policies for robust and adaptive movement primitives

Emmanuel Pignat, Hakan Girgin, Sylvain Calinon

Keywords Paper

0

0

0

0

4:26