The Value of Information When Deciding What to Learn

06/12/2021

The Value of Information When Deciding What to Learn

Dilip Arumugam, Benjamin Van Roy

Keywords: theory, reinforcement learning and planning, bandits

Abstract Paper Similar Papers

Abstract: All sequential decision-making agents explore so as to acquire knowledge about a particular target. It is often the responsibility of the agent designer to construct this target which, in rich and complex environments, constitutes a onerous burden; without full knowledge of the environment itself, a designer may forge a sub-optimal learning target that poorly balances the amount of information an agent must acquire to identify the target against the target's associated performance shortfall. While recent work has developed a connection between learning targets and rate-distortion theory to address this challenge and empower agents that decide what to learn in an automated fashion, the proposed algorithm does not optimally tackle the equally important challenge of efficient information acquisition. In this work, building upon the seminal design principle of information-directed sampling (Russo & Van Roy, 2014), we address this shortcoming directly to couple optimal information acquisition with the optimal design of learning targets. Along the way, we offer new insights into learning targets from the literature on rate-distortion theory before turning to empirical results that confirm the value of information when deciding what to learn.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Deciding What to Learn: A Rate-Distortion Approach

Dilip Arumugam, Benjamin Van Roy

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:12

26/04/2020

Automated curriculum generation through setter-solver interactions

Sebastien Racaniere, Andrew Lampinen, Adam Santoro and
David Reichert, Vlad Firoiu, Timothy Lillicrap

Keywords Paper

Deep Reinforcement Learning, Automatic Curriculum

0

0

0

0

3:55

19/10/2020

ALEX: Active learning based enhancement of a classification model’s EXplainability

Ishani Mondal, Debasis Ganguly

Keywords Paper

image classification, model interpretability, active learning

0

0

0

0

5:00

03/05/2021

Fast And Slow Learning Of Recurrent Independent Mechanisms

Kanika Madan, Nan Rosemary Ke, Anirudh Goyal and
Bernhard Schoelkopf, Yoshua Bengio

Keywords Paper

better generalization, modular representations, learning mechanisms

0

0

0

0

5:09

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

06/12/2020

Continuous Meta-Learning without Tasks

James Harrison, Apoorva Sharma, Chelsea Finn, Marco Pavone

Keywords Paper

0

0

0

0

3:09

26/04/2020

Meta-Learning without Memorization

Mingzhang Yin, George Tucker, Mingyuan Zhou and
Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, memorization, regularization, overfitting, mutually-exclusive

0

0

0

0

5:09

12/07/2020

Robust Black Box Explanations Under Distribution Shift

Himabindu Lakkaraju, Nino Arsov, Osbert Bastani

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

14:02

06/12/2021

Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

Kibeom Kim, Min Whoo Lee, Yoonsung Kim and
JeHwan Ryu, Minsu Lee, Byoung-Tak Zhang

Keywords Paper

reinforcement learning and planning

0

0

0

0

11:15

26/04/2020

Composing Task-Agnostic Policies with Deep Reinforcement Learning

Ahmed H. Qureshi, Jacob J. Johnson, Yuzhe Qin and
Taylor Henderson, Byron Boots, Michael C. Yip

Keywords Paper

composition, transfer learning, deep reinforcement learning

0

0

0

0

4:57

14/09/2020

Active Learning for Hierarchical Multi-Label Classification

Felipe Kenji Nakano, Ricardo Cerri, Vens Celin

Keywords Paper

0

0

0

0

15:42

14/09/2020

A Taxonomy of Interactive Online Machine Learning Strategies

Agnes Tegen, Paul Davidsson, Jan A. Persson

Keywords Paper

interactive machine learning, online learning, active learning

0

0

0

0

14:20

02/02/2021

Learning Generalized Relational Heuristic Networks for Model-Agnostic Planning

Rushang Karia, Siddharth Srivastava

Keywords Paper

0

0

0

0

16:56

07/09/2020

Unsupervised Domain Adaptation for Spatio-Temporal Action Localization

Nakul Agarwal, Yi-Ting Chen, Behzad Dariush, Ming-Hsuan Yang

Keywords Paper

Spatio-Temporal Action Localization, Unsupervised Domain Adaptation, Adversarial Learning, Video Analysis, Deep Learning

0

0

0

0

9:28

06/12/2020

Probabilistic Active Meta-Learning

Jean Kaddour, Steindor Saemundsson, Marc Deisenroth

Keywords Paper

0

0

0

0

3:17

26/04/2020

Automated Relational Meta-learning

Huaxiu Yao, Xian Wu, Zhiqiang Tao and
Yaliang Li, Bolin Ding, Ruirui Li, Zhenhui Li

Keywords Paper

meta-learning, task heterogeneity, meta-knowledge graph

1

1

0

0

5:13

06/12/2020

Automatic Curriculum Learning through Value Disagreement

Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto

Keywords Paper

0

0

0

0

3:17

06/12/2021

Discovery of Options via Meta-Learned Subgoals

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

4:13

06/12/2021

Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

Yichen Yang, Jeevana Priya Inala, Osbert Bastani and
Yewen Pu, Armando Solar-Lezama, Martin Rinard

Keywords Paper

reinforcement learning and planning, generative model

0

0

0

0

14:56

06/12/2020

Information-theoretic Task Selection for Meta-Reinforcement Learning

Ricardo Luna Gutierrez, Matteo Leonetti

Keywords Paper

0

0

0

0

2:57

18/07/2021

Self-Damaging Contrastive Learning

Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

1

5:10

18/07/2021

World Model as a Graph: Learning Latent Landmarks for Planning

Lunjun Zhang, Ge Yang, Bradly Stadie

Keywords Paper

Applications, Computer Vision, Algorithms, Classification; Applications, Computational Social Science; Applications, Visual Scene Analysis and Interpret, Reinforcement Learning and Planning, Deep RL

0

0

0

0

12:48

02/02/2021

Progressive Multi-task Learning with Controlled Information Flow for Joint Entity and Relation Extraction

Kai Sun, Richong Zhang, Samuel Mensah and
Yongyi Mao, Xudong Liu

Keywords Paper

0

0

0

0

13:45

12/07/2020

Provably Efficient Model-based Policy Adaptation

Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:49

03/05/2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou and
Albert Yu, Nicholas Rhinehart, Sergey Levine

Keywords Paper

reinforcement learning, imitation learning

0

0

0

0

14:21

06/12/2021

Hierarchical Reinforcement Learning with Timed Subgoals

Nico Gürtler, Dieter Büchler, Georg Martius

Keywords Paper

reinforcement learning and planning

0

0

0

0

8:17

03/05/2021

Adaptive Procedural Task Generation for Hard-Exploration Problems

Kuan Fang, Yuke Zhu, Silvio Savarese, Li Fei-Fei

Keywords Paper

reinforcement learning, task generation, procedural generation, curriculum learning

0

0

0

0

5:06

06/12/2021

No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data

Mi Luo, Fei Chen, Dapeng Hu and
Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, federated learning

0

0

0

0

3:27

06/12/2021

Automatic Unsupervised Outlier Model Selection

Yue Zhao, Ryan Rossi, Leman Akoglu

Keywords Paper

machine learning, self-supervised learning, meta learning, clustering

0

0

0

0

15:08

06/12/2020

Early-Learning Regularization Prevents Memorization of Noisy Labels

Sheng Liu, Jonathan Niles-Weed, Narges Razavian, Carlos Fernandez-Granda

Keywords Paper

0

0

0

0

3:06

02/02/2021

Dialog Policy Learning for Joint Clarification and Active Learning Queries

Aishwarya Padmakumar, Raymond J. Mooney

Keywords Paper

0

0

0

0

18:46

02/02/2021

Deeplite NeutrinoTM: A BlackBox Framework for Constrained Deep Learning Model Optimization

Anush Sankaran, Olivier Mastropietro, Ehsan Saboori and
Yasser Idris, Davis Sawyer, MohammadHossein AskariHemmat, Ghouthi Boukli Hacene

Keywords Paper

0

0

0

0

18:29

03/05/2021

Learning the Pareto Front with Hypernetworks

Aviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik

Keywords Paper

multi-task learning, Multi-objective optimization

0

0

0

0

5:19

06/12/2021

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

Jannik Kossen, Neil Band, Clare Lyle and
Aidan Gomez, Thomas Rainforth, Yarin Gal

Keywords Paper

deep learning, transformers

0

0

0

0

9:54

18/07/2021

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Elliot Chane-Sane, Cordelia Schmid, Ivan Laptev

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

4:57

06/12/2021

An Information-theoretic Approach to Distribution Shifts

Marco Federici, Ryota Tomioka, Patrick Forré

Keywords Paper

theory, deep learning, machine learning, graph learning, domain adaptation, representation learning

0

0

0

0

9:50

06/12/2021

Meta-learning with an Adaptive Task Scheduler

Huaxiu Yao, Yu Wang, Ying Wei and
Peilin Zhao, Mehrdad Mahdavi, Defu Lian, Chelsea Finn

Keywords Paper

optimization, meta learning

0

0

0

0

15:12

13/04/2021

Learning to defend by learning to attack

Haoming Jiang, Zhehui Chen, Yuyang Shi and
Bo Dai, Tuo Zhao

Keywords Paper

0

0

0

0

2:58

03/05/2021

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Paper

reinforcement learning, imitation learning, reward learning

0

0

0

0

5:09

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49