Augmenting Policy Learning with Routines Discovered from a Single Demonstration

02/02/2021

Augmenting Policy Learning with Routines Discovered from a Single Demonstration

Zelin Zhao, Chuang Gan, Jiajun Wu, Xiaoxiao Guo, Joshua B. Tenenbaum

Keywords:

Abstract Paper Similar Papers

Abstract: Humans can abstract prior knowledge from very little data and use it to boost skill learning. In this paper, we propose routine-augmented policy learning (RAPL), which discovers routines composed of primitive actions from a single demonstration and uses discovered routines to augment policy learning. To discover routines from the demonstration, we first abstract routine candidates by identifying grammar over the demonstrated action trajectory. Then, the best routines measured by length and frequency are selected to form a routine library. We propose to learn policy simultaneously at primitive-level and routine-level with discovered routines, leveraging the temporal structure of routines. Our approach enables imitating expert behavior at multiple temporal scales for imitation learning and promotes reinforcement learning exploration. Extensive experiments on Atari games demonstrate that RAPL improves the state-of-the-art imitation learning method SQIL and reinforcement learning method A2C. Further, we show that discovered routines can generalize to unseen levels and difficulties on the CoinRun benchmark.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38947875

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Discovering Reinforcement Learning Algorithms

Junhyuk Oh, Matteo Hessel, Wojciech Czarnecki and
Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:21

03/05/2021

Mastering Atari with Discrete World Models

Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba

Keywords Paper

reinforcement learning, actor critic, model-based reinforcement learning, world models, Atari, planning

1

0

0

0

5:52

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

06/12/2021

Learning Diverse Policies in MOBA Games via Macro-Goals

Yiming Gao, Bei Shi, Xueying Du and
Liang Wang, Guangwei Chen, Zhenjie Lian, Fuhao Qiu, GUOAN HAN, Weixuan Wang, Deheng Ye, Qiang Fu, Wei Yang, Lanxiao Huang

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:49

12/07/2020

ConQUR: Mitigating Delusional Bias in Deep Q-Learning

DiJia Su, Jayden Ooi, Tyler Lu and
Dale Schuurmans, Craig Boutilier

Keywords Paper

Reinforcement Learning - General

0

0

0

0

15:04

12/07/2020

Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences

Daniel Brown, Scott Niekum, Russell Coleman, Ravi Srinivasan

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:11

06/12/2021

Behavior From the Void: Unsupervised Active Pre-Training

Hao Liu, Pieter Abbeel

Keywords Paper

reinforcement learning and planning

0

0

0

0

10:34

26/04/2020

Model Based Reinforcement Learning for Atari

Łukasz Kaiser, Mohammad Babaeizadeh, Piotr Miłos and
Błażej Osiński, Roy H Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Afroz Mohiuddin, Ryan Sepassi, George Tucker, Henryk Michalewski

Keywords Paper

reinforcement learning, model based rl, video prediction model, atari

0

0

0

0

5:02

16/11/2020

Chaining Behaviors from Data with Model-Free Reinforcement Learning

Avi Singh, Albert Yu, Jonathan Yang and
Jesse Zhang, Aviral Kumar, Sergey Levine

Keywords Paper

0

0

0

0

5:01

18/07/2021

Guided Exploration with Proximal Policy Optimization using a Single Demonstration

Gabriele Libardi, Gianni De Fabritiis, Sebastian Dittert

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:12

03/05/2021

Learning to Reach Goals via Iterated Supervised Learning

Dibya Ghosh, Abhishek Gupta, Ashwin D Reddy and
Justin Fu, Coline M Devin, Ben Eysenbach, Sergey Levine

Keywords Paper

goal reaching, reinforcement learning, goal-conditioned RL, behavior cloning

0

0

0

0

15:19

16/11/2020

Few-Shot Complex Knowledge Base Question Answering via Meta Reinforcement Learning

Yuncheng Hua, Yuan-Fang Li, Gholamreza Haffari and
Guilin Qi, Tongtong Wu

Keywords Paper

program induction, meta-training, cqa, neural approach

0

0

0

0

12:41

06/12/2021

Provable Representation Learning for Imitation with Contrastive Fourier Features

Ofir Nachum, Mengjiao Yang

Keywords Paper

reinforcement learning and planning, contrastive learning, representation learning

0

0

0

0

15:06

06/12/2020

Latent World Models For Intrinsically Motivated Exploration

Aleksandr Ermolov, Nicu Sebe

Keywords Paper

0

0

0

0

2:47

26/04/2020

Fast Task Inference with Variational Intrinsic Successor Features

Steven Hansen, Will Dabney, Andre Barreto and
David Warde-Farley, Tom Van de Wiele, Volodymyr Mnih

Keywords Paper

Reinforcement Learning, Variational Intrinsic Control, Successor Features

0

0

0

0

14:47

03/05/2021

Concept Learners for Few-Shot Learning

Kaidi Cao, Maria Brbic, Jure Leskovec

Keywords Paper

few-shot learning, meta learning

0

0

0

0

4:55

26/04/2020

Disagreement-Regularized Imitation Learning

Kiante Brantley, Wen Sun, Mikael Henaff

Keywords Paper

imitation learning, reinforcement learning, uncertainty

0

0

0

0

4:53

03/05/2021

Learning Generalizable Visual Representations via Interactive Gameplay

Luca Weihs, Ani Kembhavi, Kiana Ehsani and
Sarah M Pratt, Winson Han, Alvaro Herrasti, Eric Kolve, Dustin Schwenk, Roozbeh Mottaghi, Ali Farhadi

Keywords Paper

computer vision, deep reinforcement learning, representation learning

0

0

0

0

14:17

06/12/2020

Language-Conditioned Imitation Learning for Robot Manipulation Tasks

Simon Stepputtis, Joseph Campbell, Mariano Phielipp and
Stefan Lee, Chitta Baral, Heni Ben Amor

Keywords Paper

0

0

0

0

3:09

06/12/2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

Weili Nie, Zhiding Yu, Lei Mao and
Ankit Patel, Yuke Zhu, Anima Anandkumar

Keywords Paper

0

0

0

0

3:23

19/08/2021

Cross-Domain Few-Shot Classification via Adversarial Task Augmentation

Haoqing Wang, Zhi-Hong Deng

Keywords Paper

Computer Vision, Recognition, Adversarial Machine Learning, Deep Learning

0

0

0

0

10:39

06/12/2021

Neural Auto-Curricula in Two-Player Zero-Sum Games

Xidong Feng, Oliver Slumbers, Ziyu Wan and
Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang

Keywords Paper

deep learning, optimization, reinforcement learning and planning, meta learning

0

0

0

0

14:46

03/05/2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou and
Albert Yu, Nicholas Rhinehart, Sergey Levine

Keywords Paper

reinforcement learning, imitation learning

0

0

0

0

14:21

16/11/2020

ALICE: Active Learning with Contrastive Natural Language Explanations

Weixin Liang, James Zou, Zhou Yu

Keywords Paper

active learning, data learning, learning, visual tasks

0

0

0

0

10:26

03/05/2021

Evolving Reinforcement Learning Algorithms

John Co-Reyes, Yingjie Miao, Daiyi Peng and
Esteban Real, Quoc V Le, Sergey Levine, Honglak Lee, Aleksandra Faust

Keywords Paper

reinforcement learning, genetic programming, meta-learning, evolutionary algorithms

0

0

0

0

13:59

29/06/2020

What is the vocabulary of flaky tests?

Gustavo Pinto, Breno Miranda, Supun Dissanayake and
Marcelo Amorim, Christoph Treude, Antonia Bertolino

Keywords Paper

Regression testing, Text classification, Test flakiness

0

0

0

0

13:04

03/05/2021

Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning

Valerie Chen, Abhinav Gupta, Kenny Marino

Keywords Paper

0

0

0

0

5:04

06/12/2021

Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality

Songyuan Zhang, ZHANGJIE CAO, Dorsa Sadigh, Yanan Sui

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:50

26/04/2020

Never Give Up: Learning Directed Exploration Strategies

Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi and
Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martin Arjovsky, Alexander Pritzel, Andrew Bolt, Charles Blundell

Keywords Paper

deep reinforcement learning, exploration, intrinsic motivation

0

0

0

0

5:30

16/11/2020

Few-shot Object Grounding and Mapping for Natural Language Robot Instruction Following

Valts Blukis, Ross Knepper, Yoav Artzi

Keywords Paper

0

0

0

0

5:06

18/07/2021

Emphatic Algorithms for Deep Reinforcement Learning

Ray Jiang, Tom Zahavy, Zhongwen Xu and
Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

5:21

16/11/2020

Learning Object Manipulation Skills via Approximate State Estimation from Real Videos

Vladimír Petrík, Makarand Tapaswi, Ivan Laptev, Josef Sivic

Keywords Paper

0

0

0

0

5:07

06/12/2021

Supervising the Transfer of Reasoning Patterns in VQA

Corentin Kervadec, Christian Wolf, Grigory Antipov and
Moez Baccouche, Madiha Nadri

Keywords Paper

theory, deep learning, vision

0

0

0

0

12:54

18/07/2021

Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment

Philip Ball, Cong Lu, Jack Parker-Holder, Stephen Roberts

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:35

12/07/2020

Agent57: Outperforming the Atari Human Benchmark

Adrià Puigdomenech Badia, Bilal Piot, Steven Kapturowski and
Pablo Sprechmann, Oleksandr Vitvitskyi, Zhaohan Guo, Charles Blundell

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

10:01

06/12/2020

Munchausen Reinforcement Learning

Nino Vieillard, Olivier Pietquin, Matthieu Geist

Keywords Paper

0

0

0

0

3:19

06/12/2021

Play to Grade: Testing Coding Games as Classifying Markov Decision Process

Allen Nie, Emma Brunskill, Chris Piech

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:27

06/12/2021

Visual Adversarial Imitation Learning using Variational Models

Rafael Rafailov, Tianhe Yu, Aravind Rajeswaran, Chelsea Finn

Keywords Paper

theory, reinforcement learning and planning, adversarial robustness and security, representation learning

0

0

0

0

7:25

16/11/2020

Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning

Ryan Julian, Benjamin Swanson, Gaurav Sukhatme and
Sergey Levine, Chelsea Finn, Karol Hausman

Keywords Paper

0

0

0

0

5:47

03/05/2021

Model-Based Visual Planning with Self-Supervised Functional Distances

Stephen Tian, Suraj Nair, Frederik Ebert and
Sudeep Dasari, Ben Eysenbach, Chelsea Finn, Sergey Levine

Keywords Paper

reinforcement learning, distance learning, model learning, robotics, planning

0

0

0

0

9:11