The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning

06/12/2020

The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning

Harm Van Seijen, Hadi Nekoei, Evan Racah, Sarath Chandar

Keywords:

Abstract Paper Similar Papers

Abstract: Deep model-based Reinforcement Learning (RL) has the potential to substantially improve the sample-efficiency of deep RL. While various challenges have long held it back, a number of papers have recently come out reporting success with deep model-based methods. This is a great development, but the lack of a consistent metric to evaluate such methods makes it difficult to compare various approaches. For example, the common single-task sample-efficiency metric conflates improvements due to model-based learning with various other aspects, such as representation learning, making it difficult to assess true progress on model-based RL. To address this, we introduce an experimental setup to evaluate model-based behavior of RL methods, inspired by work from neuroscience on detecting model-based behavior in humans and animals. Our metric based on this setup, the Local Change Adaptation (LoCA) regret, measures how quickly an RL method adapts to a local change in the environment. Our metric can identify model-based behavior, even if the method uses a poor representation and provides insight in how close a method's behavior is from optimal model-based behavior. We use our setup to evaluate the model-based behavior of MuZero on a variation of the classic Mountain Car task.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation

Junhong Shen, Lin F. Yang

Keywords Paper

0

0

0

0

19:12

06/12/2021

Predify: Augmenting deep neural networks with brain-inspired predictive coding dynamics

Bhavin Choksi, Milad Mozafari, Callum Biggs O'May and
B. ADOR, Andrea Alamia, Rufin VanRullen

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security, neuroscience, vision

0

0

0

0

11:21

16/11/2020

Domain Knowledge Empowered Structured Neural Net for End-to-End Event Temporal Relation Extraction

Rujun Han, Yichao Zhou, Nanyun Peng

Keywords Paper

extracting relations, information extraction, natural understanding, maximum inference

0

0

0

0

12:03

06/12/2020

Dynamic allocation of limited memory resources in reinforcement learning

Nisheet Patel, Luigi Acerbi, Alexandre Pouget

Keywords Paper

0

0

0

0

3:19

19/08/2021

Reward-Constrained Behavior Cloning

Zhaorong Wang, Meng Wang, Jingqi Zhang and
Yingfeng Chen, Chongjie Zhang

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning, Constraint Optimization

0

0

0

0

14:43

12/07/2020

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Jesse Zhang, Brian Cheung, Chelsea Finn and
Sergey Levine, Dinesh Jayaraman

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:54

22/11/2021

Incremental Learning for Animal Pose Estimation using RBF k-DPP

Gaurav Kumar Nayak, Het Shah, Anirban Chakraborty

Keywords Paper

animal pose estimation, incremental learning, Determinantal Point Processes, k-DPP, RBF k-DPP, image warping, exemplar memory

0

0

0

0

2:54

06/12/2020

Factorized Neural Processes for Neural Processes: K-Shot Prediction of Neural Responses

Ronald (James) Cotton, Fabian Sinz, Andreas Tolias

Keywords Paper

0

0

0

0

3:18

06/12/2021

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Beining Han, Chongyi Zheng, Harris Chan and
Keiran Paster, Michael Zhang, Jimmy Ba

Keywords Paper

reinforcement learning and planning, domain adaptation, representation learning

2

0

0

0

9:31

12/07/2020

Adversarial Filters of Dataset Biases

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and
Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

15:25

06/12/2020

Trust the Model When It Is Confident: Masked Model-based Actor-Critic

Feiyang Pan, Jia He, Dandan Tu, Qing He

Keywords Paper

0

0

0

0

2:57

06/12/2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

8:43

03/05/2021

Sample-Efficient Automated Deep Reinforcement Learning

Jörg Franke, Gregor Koehler, André Biedenkapp, Frank Hutter

Keywords Paper

Neuroevolution, Hyperparameter Optimization, Deep Reinforcement Learning, AutoRL

0

0

0

0

4:36

06/12/2021

Visual Search Asymmetry: Deep Nets and Humans Share Similar Inherent Biases

Shashi Kant Gupta, Mengmi Zhang, CHIA-CHIEN WU and
Jeremy Wolfe, Gabriel Kreiman

Keywords Paper

deep learning, neuroscience

0

0

0

0

12:06

06/12/2021

Associative Memories via Predictive Coding

Tommaso Salvatori, Yuhang Song, Yujian Hong and
Lei Sha, Simon Frieder, Zhenghua Xu, Rafal Bogacz, Thomas Lukasiewicz

Keywords Paper

deep learning, robustness, generative model

0

0

0

0

12:22

30/11/2020

Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation

Siqi Yang, Lin Wu, Arnold Wiliem, Brian C. Lovell

Keywords Paper

0

0

0

0

6:32

16/11/2020

Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data

Shachar Rosenman, Alon Jacovi, Yoav Goldberg

Keywords Paper

data process, re collection, sota models, tacred

0

0

0

0

5:55

12/07/2020

Enhancing Simple Models by Exploiting What They Already Know

Amit Dhurandhar, Karthikeyan Shanmugam, Ronny Luss

Keywords Paper

Supervised Learning

0

0

0

0

13:57

06/12/2020

Self-Adaptive Training: beyond Empirical Risk Minimization

Lang Huang, Chao Zhang, Hongyang Zhang

Keywords Paper

Deep Learning -> Generative Models, Algorithms -> Semi-Supervised Learning

0

0

0

0

3:23

06/12/2020

A Theoretical Framework for Target Propagation

Alexander Meulemans, Francesco Carzaniga, Johan Suykens and
João Sacramento, Benjamin F. Grewe

Keywords Paper

0

0

0

0

3:20

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

16/11/2020

Learning to Communicate and Correct Pose Errors

Nicholas Vadivelu, Mengye Ren, James Tu and
Jingkang Wang, Raquel Urtasun

Keywords Paper

0

0

0

0

5:02

06/12/2021

Fitting summary statistics of neural data with a differentiable spiking network simulator

Guillaume Bellec, Shuqi Wang, Alireza Modirshanechi and
Johanni Brea, Wulfram Gerstner

Keywords Paper

optimization, neuroscience

0

0

0

0

13:07

14/06/2020

Modeling Biological Immunity to Adversarial Examples

Edward Kim, Jocelyn Rego, Yijing Watkins, Garrett T. Kenyon

Keywords Paper

adversarial examples, sparse coding, retina, cortex, neuron, biology, robust, feedback

0

0

0

0

1:01

26/04/2020

Variational Recurrent Models for Solving Partially Observable Control Tasks

Dongqi Han, Kenji Doya, Jun Tani

Keywords Paper

Reinforcement Learning, Deep Learning, Variational Inference, Recurrent Neural Network, Partially Observable, Robotic Control, Continuous Control

0

0

0

0

4:59

16/11/2020

Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference

Xiaoan Ding, Tianyu Liu, Baobao Chang and
Zhifang Sui, Kevin Gimpel

Keywords Paper

natural inference, nli tasks, discriminative fine-tuning, discriminative classifiers

0

0

0

0

11:37

06/12/2021

Learning to Generate Visual Questions with Noisy Supervision

Shen Kai, Lingfei Wu, Siliang Tang and
Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Paper

generative model

0

0

0

0

14:54

26/04/2020

On Generalization Error Bounds of Noisy Gradient Methods for Non-Convex Learning

Jian Li, Xuanyuan Luo, Mingda Qiao

Keywords Paper

learning theory, generalization, nonconvex learning, stochastic gradient descent, Langevin dynamics

0

0

0

0

4:50

02/02/2021

Improving Sample Efficiency in Model-Free Reinforcement Learning from Images

Denis Yarats, Amy Zhang, Ilya Kostrikov and
Brandon Amos, Joelle Pineau, Rob Fergus

Keywords Paper

0

0

0

0

12:19

06/12/2020

Deep Graph Pose: a semi-supervised deep graphical model for improved animal pose tracking

Anqi Wu, Kelly Buchanan, Matthew Whiteway and
Michael Schartner, Guido Meijer, Jean-Paul Noel, Erica Rodriguez, Claire Everett, Amy Norovich, Evan Schaffer, Neeli Mishra, C. Daniel Salzman, Dora Angelaki, Andrés Bendesky, The International Brain Laboratory The International Brain Laboratory, John Cunningham, Liam Paninski

Keywords Paper

0

0

0

0

3:24

06/12/2020

Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations

Huan Zhang, Hongge Chen, Chaowei Xiao and
Bo Li, Mingyan Liu, Duane Boning, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

3:18

02/02/2021

Evolutionary Approach for AutoAugment Using the Thermodynamical Genetic Algorithm

Akira Terauchi, Naoki Mori

Keywords Paper

0

0

0

0

17:42

06/12/2021

Scalable Diverse Model Selection for Accessible Transfer Learning

Daniel Bolya, Rohit Mittapalli, Judy Hoffman

Keywords Paper

deep learning, vision, transfer learning

0

0

0

0

7:04

06/12/2020

A meta-learning approach to (re)discover plasticity rules that carve a desired function into a neural network

Basile Confavreux, Friedemann Zenke, Everton Agnes and
Timothy Lillicrap, Tim Vogels

Keywords Paper

0

0

0

0

3:25

02/02/2021

Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

Kaixin Ma, Filip Ilievski, Jonathan Francis and
Yonatan Bisk, Eric Nyberg, Alessandro Oltramari

Keywords Paper

0

0

0

0

18:24

02/02/2021

Successor Feature Sets: Generalizing Successor Representations Across Policies

Kianté Brantley, Soroush Mehri, Geoff J. Gordon

Keywords Paper

0

0

0

0

17:43

06/12/2021

Learning interaction rules from multi-animal trajectories via augmented behavioral models

Keisuke Fujii, Naoya Takeishi, Kazushi Tsutsui and
Emyo Fujioka, Nozomi Nishiumi, Ryoya Tanaka, Mika Fukushiro, Kaoru Ide, Hiroyoshi Kohno, Ken Yoda, Susumu Takahashi, Shizuko Hiryu, Yoshinobu Kawahara

Keywords Paper

theory, deep learning, causality, interpretability

0

0

0

0

12:28

26/04/2020

Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control

Tsui-Wei Weng, Krishnamurthy (Dj) Dvijotham, Jonathan Uesato and
Kai Xiao, Sven Gowal, Robert Stanforth*, Pushmeet Kohli

Keywords Paper

deep learning, reinforcement learning, robustness, adversarial examples

0

0

0

0

6:00

07/09/2020

Transferring Pretrained Networks to Small Data via Category Decorrelation

Ying Jin, Zhangjie Cao, Mingsheng Long, Jianmin Wang

Keywords Paper

Category Decorrelation, Under Transfer

1

1

0

0

8:39

06/12/2020

Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards

Yijie Guo, Jongwook Choi, Marcin Moczulski and
Shengyu Feng, Samy Bengio, Mohammad Norouzi, Honglak Lee

Keywords Paper

0

0

1

1

3:30