Design of Experiments for Stochastic Contextual Linear Bandits

06/12/2021

Design of Experiments for Stochastic Contextual Linear Bandits

Andrea Zanette, Kefan Dong, Jonathan N Lee, Emma Brunskill

Keywords: reinforcement learning and planning, bandits

Abstract Paper Similar Papers

Abstract: In the stochastic linear contextual bandit setting there exist several minimax procedures for exploration with policies that are reactive to the data being acquired. In practice, there can be a significant engineering overhead to deploy these algorithms, especially when the dataset is collected in a distributed fashion or when a human in the loop is needed to implement a different policy. Exploring with a single non-reactive policy is beneficial in such cases. Assuming some batch contexts are available, we design a single stochastic policy to collect a good dataset from which a near-optimal policy can be extracted. We present a theoretical analysis as well as numerical experiments on both synthetic and real-world datasets.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

06/12/2021

Control Variates for Slate Off-Policy Evaluation

Nikos Vlassis, Ashok Chandrashekar, Fernando Amat, Nathan Kallus

Keywords Paper

optimization, bandits

0

0

0

0

12:25

26/08/2020

Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration

Matteo Papini, Andrea Battistello, Marcello Restelli

Keywords Paper

0

0

0

0

12:47

12/07/2020

A distributional view on multi objective policy optimization

Abbas Abdolmaleki, Sandy Huang, Leonard Hasenclever and
Michael Neunert, Martina Zambelli, Murilo Martins, Francis Song, Nicolas Heess, Raia Hadsell, Martin Riedmiller

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:04

18/07/2021

Discovering symbolic policies with deep reinforcement learning

Mikel Landajuela Larma, Brenden Petersen, Sookyung Kim and
Claudio Santiago, Ruben Glatt, Nathan Mundhenk, Jacob Pettit, Daniel Faissol

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

4:55

02/02/2021

Progression Heuristics for Planning with Probabilistic LTL Constraints

Ian Mallett, Sylvie Thiebaux, Felipe Trevizan

Keywords Paper

0

0

0

0

18:23

06/12/2021

Automated Dynamic Mechanism Design

Hanrui Zhang, Vincent Conitzer

Keywords Paper

0

0

0

0

14:35

06/12/2020

Online Bayesian Goal Inference for Boundedly Rational Planning Agents

Tan Zhi-Xuan, Jordyn Mann, Tom Silver and
Josh Tenenbaum, Vikash Mansinghka

Keywords Paper

0

0

0

0

3:23

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

18/07/2021

Partially Observed Exchangeable Modeling

Yang Li, Junier Oliva

Keywords Paper

Deep Learning, Generative Models, Algorithms, Sparsity and Compressed Sensing, Algorithms, Large Scale Learning; Algorithms, Regression

0

0

0

0

5:47

06/12/2021

Using Random Effects to Account for High-Cardinality Categorical Features and Repeated Measures in Deep Neural Networks

Giora Simchoni, Saharon Rosset

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

13:33

06/12/2021

Efficient Online Estimation of Causal Effects by Deciding What to Observe

Shantanu Gupta, Zachary Lipton, David Childers

Keywords Paper

reinforcement learning and planning, graph learning, causality

0

0

0

0

14:18

03/05/2021

Enforcing robust control guarantees within neural network policies

Priya Donti, Melrose Roderick, Mahyar Fazlyab, Zico Kolter

Keywords Paper

reinforcement learning, differentiable optimization, robust control

0

0

0

1

5:09

18/07/2021

Dataset Dynamics via Gradient Flows in Probability Space

David Alvarez-Melis, Nicolo Fusi

Keywords Paper

Algorithms, Optimal Transport

0

0

0

1

5:11

18/07/2021

Active Learning of Continuous-time Bayesian Networks through Interventions

Dominik Linzner, Heinz Koeppl

Keywords Paper

Probabilistic Methods, Graphical Models

0

0

0

0

5:07

26/04/2020

Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning

Ali Mousavi, Lihong Li, Qiang Liu, Denny Zhou

Keywords Paper

reinforcement learning, off-policy estimation, importance sampling, propensity score

0

0

0

0

5:25

06/12/2020

Counterfactual Predictions under Runtime Confounding

Amanda Coston, Edward Kennedy, Alexandra Chouldechova

Keywords Paper

0

0

0

0

3:14

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

18/07/2021

Beyond the Pareto Efficient Frontier: Constraint Active Search for Multiobjective Experimental Design

Gustavo Malkomes, Harvey Cheng, Eric Lee, Michael McCourt

Keywords Paper

Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

0

0

0

0

5:20

18/07/2021

Provably Correct Optimization and Exploration with Non-linear Policies

Fei Feng, Wotao Yin, Alekh Agarwal, Lin Yang

Keywords Paper

Deep Learning, Adversarial Networks, Applications, Fairness, Accountability, and Transparency, Theory, RL, Decisions and Control Theory

0

0

0

0

5:03

26/10/2020

A correctness result for synthesizing plans with loops in stochastic domains

Laszlo Treszkai, Vaishak Belle

Keywords Paper

Finite-state controllers, plans with loops, stochastic domains, soundness and completeness results

0

0

0

0

12:43

03/05/2021

Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs

Cheng Wang, Carolin Lawrence, Mathias Niepert

Keywords Paper

calibration, uncertainty estimation, RNN

0

0

0

0

4:25

06/12/2021

Improving Compositionality of Neural Networks by Decoding Representations to Inputs

Mike Wu, Noah Goodman, Stefano Ermon

Keywords Paper

deep learning, machine learning, adversarial robustness and security, generative model

0

0

0

0

12:36

06/12/2021

Stateful Strategic Regression

Keegan Harris, Hoda Heidari, Steven Wu

Keywords Paper

optimization

0

0

0

0

14:02

06/12/2020

Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method

Qi Zhou, Yufei Kuang, Zherui Qiu and
Houqiang Li, Jie Wang

Keywords Paper

0

0

0

0

3:10

06/12/2020

Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation

Han Zhao, Jianfeng Chi, Yuan Tian, Geoffrey Gordon

Keywords Paper

0

0

0

0

3:17

26/08/2020

Finite-Time Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation

Jun Sun, Gang Wang, Georgios B. Giannakis and
Qinmin Yang, Zaiyue Yang

Keywords Paper

0

0

0

0

17:07

06/12/2021

An Information-theoretic Approach to Distribution Shifts

Marco Federici, Ryota Tomioka, Patrick Forré

Keywords Paper

theory, deep learning, machine learning, graph learning, domain adaptation, representation learning

0

0

0

0

9:50

06/12/2020

Beyond Individualized Recourse: Interpretable and Interactive Summaries of Actionable Recourses

Kai Rawal, Himabindu Lakkaraju

Keywords Paper

0

0

0

0

3:31

18/07/2021

Mandoline: Model Evaluation under Distribution Shift

Mayee Chen, Karan Goel, Nimit Sohoni and
Fait Poms, Kayvon Fatahalian, Christopher Re

Keywords Paper

Algorithms, Others

0

0

0

1

5:49

06/12/2020

Normalizing Kalman Filters for Multivariate Time Series Analysis

Emmanuel de Bézenac, Syama Sundar Rangapuram, Konstantinos Benidis and
Michael Bohlke-Schneider, Richard Kurle, Lorenzo Stella, Hilaf Hasson, Patrick Gallinari, Tim Januschowski

Keywords Paper

0

0

0

0

3:19

06/12/2020

General Transportability of Soft Interventions: Completeness Results

Juan Correa, Elias Bareinboim

Keywords Paper

0

0

0

0

3:23

06/12/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn

Keywords Paper

0

0

0

0

3:24

06/12/2020

Posterior Re-calibration for Imbalanced Datasets

Junjiao Tian, Yen-Cheng Liu, Nathaniel Glaser and
Yen-Chang Hsu, Zsolt Kira

Keywords Paper

Algorithms -> Few-Shot Learning, Applications -> Computer Vision

0

0

0

0

3:23

03/05/2021

Learning perturbation sets for robust machine learning

Eric Wong, Zico Kolter

Keywords Paper

conditional variational autoencoder, adversarial examples, perturbation sets, robust machine learning

0

1

0

0

5:06

26/08/2020

Validation of Approximate Likelihood and Emulator Models for Computationally Intensive Simulations

Niccolo Dalmasso, Ann Lee, Rafael Izbicki and
Taylor Pospisil, Ilmun Kim, Chieh-An Lin

Keywords Paper

0

0

0

0

14:06

06/12/2021

Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

Tim Seyde, Igor Gilitschenski, Wilko Schwarting and
Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:48

06/12/2021

Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning

Hiroki Furuta, Tadashi Kozuno, Tatsuya Matsushima and
Yutaka Matsuo, Shixiang (Shane) Gu

Keywords Paper

reinforcement learning and planning

0

0

0

0

10:00

02/02/2021

Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs

Aria HasanzadeZonuzy, Archana Bura, Dileep Kalathil, Srinivas Shakkottai

Keywords Paper

0

0

0

0

17:18

03/05/2021

Benchmarks for Deep Off-Policy Evaluation

Justin Fu, Mohammad Norouzi, Ofir Nachum and
George Tucker, ziyu wang, Alexander Novikov, Sherry Yang, Michael Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Paine

Keywords Paper

reinforcement learning, benchmarks, off-policy evaluation

0

0

0

0

10:05