Deciding What to Learn: A Rate-Distortion Approach

18/07/2021

Deciding What to Learn: A Rate-Distortion Approach

Dilip Arumugam, Benjamin Van Roy

Keywords: Reinforcement Learning and Planning, Bandits

Abstract Paper Similar Papers

Abstract: Agents that learn to select optimal actions represent a prominent focus of the sequential decision-making literature. In the face of a complex environment or constraints on time and resources, however, aiming to synthesize such an optimal policy can become infeasible. These scenarios give rise to an important trade-off between the information an agent must acquire to learn and the sub-optimality of the resulting policy. While an agent designer has a preference for how this trade-off is resolved, existing approaches further require that the designer translate these preferences into a fixed learning target for the agent. In this work, leveraging rate-distortion theory, we automate this process such that the designer need only express their preferences via a single hyperparameter and the agent is endowed with the ability to compute its own learning targets that best achieve the desired trade-off. We establish a general bound on expected discounted regret for an agent that decides what to learn in this manner along with computational experiments that illustrate the expressiveness of designer preferences and even show improvements over Thompson sampling in identifying an optimal policy.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Online learning with dynamics: A minimax perspective

Kush Bhatia, Karthik Sridharan

Keywords Paper

0

0

0

0

3:09

06/12/2021

The Value of Information When Deciding What to Learn

Dilip Arumugam, Benjamin Van Roy

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

9:27

03/05/2021

Fast And Slow Learning Of Recurrent Independent Mechanisms

Kanika Madan, Nan Rosemary Ke, Anirudh Goyal and
Bernhard Schoelkopf, Yoshua Bengio

Keywords Paper

better generalization, modular representations, learning mechanisms

0

0

0

0

5:09

26/08/2020

Multi-attribute Bayesian optimization with interactive preference learning

Raul Astudillo, Peter Frazier

Keywords Paper

0

0

0

0

14:06

18/07/2021

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:06

18/07/2021

Zeroth-Order Non-Convex Learning via Hierarchical Dual Averaging

Amélie Héliou, Matthieu Martin, Panayotis Mertikopoulos, Thibaud J Rahier

Keywords Paper

Optimization, Non-Convex Optimization

0

0

0

0

5:25

03/05/2021

Learning to Make Decisions via Submodular Regularization

Ayya Alieva, Aiden Aceves, Jialin Song and
Stephen Mayo, Yisong Yue, Yuxin Chen

Keywords Paper

0

0

0

0

5:53

06/12/2020

Learning Linear Programs from Optimal Decisions

Yingcong Tan, Daria Terekhov, Andrew Delong

Keywords Paper

, Applications -> Privacy, Anonymity, and Security

0

0

0

0

3:21

26/04/2020

Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distribution Tasks

Hae Beom Lee, Hayeon Lee, Donghyun Na and
Saehoon Kim, Minseop Park, Eunho Yang, Sung Ju Hwang

Keywords Paper

meta-learning, few-shot learning, Bayesian neural network, variational inference, learning to learn, imbalanced and out-of-distribution tasks for few-shot learning

0

0

0

1

13:46

06/12/2021

Learning Equilibria in Matching Markets from Bandit Feedback

Meena Jagadeesan, Alexander Wei, Yixin Wang and
Michael Jordan, Jacob Steinhardt

Keywords Paper

bandits

0

0

0

0

15:04

16/11/2020

Tolerance-Guided Policy Learning for Adaptable and Transferrable Delicate Industrial Insertion

Boshen Niu, Chenxi Wang, Changliu Liu

Keywords Paper

0

0

0

0

5:36

03/05/2021

Learning the Pareto Front with Hypernetworks

Aviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik

Keywords Paper

multi-task learning, Multi-objective optimization

0

0

0

0

5:19

03/05/2021

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Paper

reinforcement learning, imitation learning, reward learning

0

0

0

0

5:09

06/12/2021

Learning-to-learn non-convex piecewise-Lipschitz functions

Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet S Talwalkar

Keywords Paper

optimization, machine learning, robustness, meta learning, online learning

0

0

0

0

14:13

14/09/2020

Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

Arjun Manoharan, Rahul Ramesh, Balaraman Ravindran

Keywords Paper

hierarchical reinforcement learning, policy distillation

0

0

0

0

13:49

22/09/2020

Improving one-class recommendation with multi-tasking on various preference intensities

Chu-Jen Shao, Hao-Ming Fu, Pu-Jen Cheng

Keywords Paper

implicit feedback, graph convolutional network, one-class recommendation, collaborative filtering

0

0

0

0

2:38

06/12/2020

Agnostic Learning with Multiple Objectives

Corinna Cortes, Mehryar Mohri, Javier Gonzalvo, Dmitry Storcheus

Keywords Paper

0

0

0

0

3:07

26/04/2020

Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

Anirudh Goyal, Shagun Sodhani, Jonathan Binas and
Xue Bin Peng, Sergey Levine, Yoshua Bengio

Keywords Paper

Reinforcement Learning, Variational Information Bottleneck, Learning primitives

0

0

0

0

5:05

13/04/2021

Non-stationary off-policy optimization

Joey Hong, Branislav Kveton, Manzil Zaheer and
Yinlam Chow, Amr Ahmed

Keywords Paper

0

0

0

0

2:57

18/07/2021

Off-Belief Learning

Hengyuan Hu, Adam Lerer, Brandon Cui and
Luis Pineda, Noam Brown, Jakob Foerster

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:10

14/09/2020

A Taxonomy of Interactive Online Machine Learning Strategies

Agnes Tegen, Paul Davidsson, Jan A. Persson

Keywords Paper

interactive machine learning, online learning, active learning

0

0

0

0

14:20

06/12/2021

Compositional Reinforcement Learning from Logical Specifications

Kishor Jothimurugan, Suguman Bansal, Osbert Bastani, Rajeev Alur

Keywords Paper

deep learning, reinforcement learning and planning, graph learning

0

0

0

0

14:50

06/12/2020

Model-based Adversarial Meta-Reinforcement Learning

Zichuan Lin, Garrett Thomas, Guangwen Yang, Tengyu Ma

Keywords Paper

0

0

0

0

3:31

12/07/2020

Global Decision-Making via Local Economic Transactions

Michael Chang, Sid Kaushik, S. Matthew Weinberg and
Sergey Levine, Thomas Griffiths

Keywords Paper

Reinforcement Learning - General

0

0

0

0

14:46

03/05/2021

Batch Reinforcement Learning Through Continuation Method

Yijie Guo, Shengyu Feng, Nicolas Le Roux and
Ed H. Chi, Honglak Lee, Minmin Chen

Keywords Paper

batch reinforcement learning, relaxed regularization, continuation method

1

0

0

0

5:34

19/08/2021

Contrastive Losses and Solution Caching for Predict-and-Optimize

Maxime Mulamba, Jayanta Mandi, Michelangelo Diligenti and
Michele Lombardi, Victor Bucarey, Tias Guns

Keywords Paper

Machine Learning, Neuro-Symbolic Methods, Structured Prediction, Constraint Optimization

0

0

0

0

12:10

06/12/2020

Learning Differentiable Programs with Admissible Neural Heuristics

Ameesh Shah, Eric Zhan, Jennifer Sun and
Abhinav Verma, Yisong Yue, Swarat Chaudhuri

Keywords Paper

Algorithms -> Missing Data; Algorithms -> Uncertainty Estimation; Probabilistic Methods -> Causal Inference; Probabilistic Meth, Probabilistic Methods -> Bayesian Nonparametrics

0

0

0

0

3:28

06/12/2020

Functional Regularization for Representation Learning: A Unified Theoretical Perspective

Siddhant Garg, Yingyu Liang

Keywords Paper

0

0

0

0

3:19

06/12/2020

Online Bayesian Persuasion

Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Nicola Gatti

Keywords Paper

0

0

0

0

3:00

13/04/2021

Linear models are robust optimal under strategic behavior

Wei Tang, Chien-Ju Ho, Yang Liu

Keywords Paper

0

0

0

0

3:32

03/08/2020

Semi-bandit Optimization in the Dispersed Setting

Travis Dick, Wesley Pegden, Maria-Florina Balcan

Keywords Paper

0

0

0

0

8:04

06/12/2020

End-to-End Learning and Intervention in Games

Jiayang Li, Jing Yu, Yu Nie, Zhaoran Wang

Keywords Paper

0

0

0

0

3:22

18/07/2021

Model Performance Scaling with Multiple Data Sources

Tatsunori Hashimoto

Keywords Paper

Algorithms, Supervised Learning

0

0

0

1

4:50

06/12/2021

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning

Kai Wang, Sanket Shah, Haipeng Chen and
Andrew Perrault, Finale Doshi-Velez, Milind Tambe

Keywords Paper

deep learning, optimization, reinforcement learning and planning

0

0

0

0

14:52

03/05/2021

Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

Keywords Paper

reinforcement learning, model-predictive control

0

0

0

0

5:09

06/12/2021

Automated Dynamic Mechanism Design

Hanrui Zhang, Vincent Conitzer

Keywords Paper

0

0

0

0

14:35

12/07/2020

Goal-Aware Prediction: Learning to Model What Matters

Suraj Nair, Silvio Savarese, Chelsea Finn

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

11:16

22/09/2020

FISSA: Fusing item similarity models with self-attention networks for sequential recommendation

Jing Lin, Weike Pan, Zhong Ming

Keywords Paper

Item Similarity Models, Sequential Recommendation, Gating Networks, Self-Attention

0

0

0

0

2:06

04/07/2020

Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation

Xinting Huang, Jianzhong Qi, Yu Sun, Rui Zhang

Keywords Paper

Semi-Supervised Learning, generalization function, Stochastic Estimation, Dialogue optimization

0

0

0

0

11:31

06/12/2020

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

Paul Barde, Julien Roy, Wonseok Jeon and
Joelle Pineau, Chris Pal, Derek Nowrouzezahrai

Keywords Paper

0

0

0

0

3:08