Learning Robust Decision Policies from Observational Data

06/12/2020

Learning Robust Decision Policies from Observational Data

Muhammad Osama, Dave Zachariah, Peter Stoica

Keywords:

Abstract Paper Similar Papers

Abstract: We address the problem of learning a decision policy from observational data of past decisions in contexts with features and associated outcomes. The past policy maybe unknown and in safety-critical applications, such as medical decision support, it is of interest to learn robust policies that reduce the risk of outcomes with high costs. In this paper, we develop a method for learning policies that reduce tails of the cost distribution at a specified level and, moreover, provide a statistically valid bound on the cost of each decision. These properties are valid under finite samples -- even in scenarios with uneven or no overlap between features for different decisions in the observed data -- by building on recent results in conformal prediction. The performance and statistical properties of the proposed method are illustrated using both real and synthetic data.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

13/04/2021

Evaluating model robustness and stability to dataset shift

Adarsh Subbaswamy, Roy Adams, Suchi Saria

Keywords Paper

0

0

0

0

2:44

12/07/2020

Estimation of Bounds on Potential Outcomes For Decision Making

Maggie Makar, Fredrik Johansson, John Guttag, David Sontag

Keywords Paper

Causality

0

0

0

0

13:12

13/04/2021

Learning prediction intervals for regression: Generalization and calibration

Haoxian Chen, Ziyi Huang, Henry Lam and
Huajie Qian, Haofeng Zhang

Keywords Paper

0

0

0

0

3:26

12/07/2020

Uniform Convergence of Rank-weighted Learning

Liu Leqi, Justin Khim, Adarsh Prasad, Pradeep Ravikumar

Keywords Paper

Learning Theory

0

0

0

0

13:21

12/07/2020

Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions

Omer Gottesman, Joseph Futoma, Yao Liu and
Sonali Parbhoo, Leo Celi, Emma Brunskill, Finale Doshi-Velez

Keywords Paper

Reinforcement Learning - General

0

0

0

0

11:15

12/07/2020

Performative Prediction

Juan Perdomo, Tijana Zrnic, Celestine Mendler-Dünner, University of California Moritz Hardt

Keywords Paper

Learning Theory

0

0

0

0

11:22

06/12/2021

Asymptotically Exact Error Characterization of Offline Policy Evaluation with Misspecified Linear Models

Kohei Miyaguchi

Keywords Paper

reinforcement learning and planning

0

0

0

0

9:06

06/12/2021

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Alicia Curth, Changhee Lee, Mihaela van der Schaar

Keywords Paper

deep learning, machine learning, domain adaptation, causality

0

0

0

0

13:43

06/12/2021

Two steps to risk sensitivity

Christopher Gagne, Peter Dayan

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

8:22

14/09/2020

Counterfactual Propagation for Semi-Supervised Individual Treatment Effect Estimation

Shonosuke Harada, Hisashi Kashima

Keywords Paper

causal inference, treatment effect estimation, semi-supervised learning

0

0

0

0

11:23

14/09/2020

A General Machine Learning Framework for Survival Analysis

Andreas Bender, David Rügamer, Fabian Scheipl, Bernd Bischl

Keywords Paper

survival analysis, gradient boosting, neural networks, competing risks, multi-state models

0

0

0

0

13:37

06/12/2020

The Advantage of Conditional Meta-Learning for Biased Regularization and Fine Tuning

Giulia Denevi, Massimiliano Pontil, Carlo Ciliberto

Keywords Paper

0

0

0

0

3:24

18/07/2021

A Regret Minimization Approach to Iterative Learning Control

Naman Agarwal, Elad Hazan, Anirudha Majumdar, Karan Singh

Keywords Paper

Reinforcement Learning and Planning, Planning and Control

0

0

0

0

5:13

06/12/2021

Conformal Bayesian Computation

Edwin Fong, Chris C Holmes

Keywords Paper

machine learning

0

0

0

0

14:54

03/05/2021

Set Prediction without Imposing Structure as Conditional Density Estimation

David W Zhang, Gertjan J Burghouts, Cees G Snoek

Keywords Paper

energy based models, set prediction

0

0

0

0

5:02

03/08/2020

Estimation Rates for Sparse Linear Cyclic Causal Models

Jan-Christian Huetter, Philippe Rigollet

Keywords Paper

0

0

0

0

7:59

06/12/2020

Efficient Learning of Discrete Graphical Models

Marc Vuffray, Sidhant Misra, Andrey Lokhov

Keywords Paper

0

0

0

0

2:18

12/07/2020

Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation

Yaqi Duan, Zeyu Jia, Mengdi Wang

Keywords Paper

Learning Theory

0

0

0

0

14:10

06/12/2021

Calibrating Predictions to Decisions: A Novel Approach to Multi-Class Calibration

Shengjia Zhao, Michael Kim, Roshni Sahoo and
Tengyu Ma, Stefano Ermon

Keywords Paper

theory, deep learning, machine learning

0

0

0

0

10:37

03/08/2020

Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Tárik S. Salem, Helge Langseth, Heri Ramampiaro

Keywords Paper

0

0

0

0

7:45

18/07/2021

Sequential Domain Adaptation by Synthesizing Distributionally Robust Experts

Bahar Taskesen, Man Chung Yue, Jose Blanchet and
Daniel Kuhn, Viet Anh Nguyen

Keywords Paper

Optimization, Convex Optimization, Theory, Regularization

0

0

0

0

17:53

03/05/2021

Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

Keywords Paper

reinforcement learning, model-predictive control

0

0

0

0

5:09

06/12/2020

Fair regression with Wasserstein barycenters

Evgenii Chzhen, Christophe Denis, Mohamed Hebiri and
Luca Oneto, Massimiliano Pontil

Keywords Paper

0

0

0

0

3:12

26/08/2020

Regularized Autoencoders via Relaxed Injective Probability Flow

Abhishek Kumar, Ben Poole, Kevin Murphy

Keywords Paper

0

0

0

0

14:03

06/12/2020

Belief-Dependent Macro-Action Discovery in POMDPs using the Value of Information

Genevieve Flaspohler, Nicholas Roy, John Fisher III

Keywords Paper

0

0

0

0

3:23

18/07/2021

Active Learning of Continuous-time Bayesian Networks through Interventions

Dominik Linzner, Heinz Koeppl

Keywords Paper

Probabilistic Methods, Graphical Models

0

0

0

0

5:07

06/12/2020

Learning Bounds for Risk-sensitive Learning

Jaeho Lee, Sejun Park, Jinwoo Shin

Keywords Paper

0

0

0

0

3:02

26/08/2020

A Framework for Sample Efficient Interval Estimation with Control Variates

Shengjia Zhao, Christopher Yeh, Stefano Ermon

Keywords Paper

0

0

0

0

12:01

26/04/2020

Input Complexity and Out-of-distribution Detection with Likelihood-based Generative Models

Joan Serrà, David Álvarez, Vicenç Gómez and
Olga Slizovskaia, José F. Núñez, Jordi Luque

Keywords Paper

OOD, generative models, likelihood

0

0

0

0

5:26

03/05/2021

Calibration tests beyond classification

David Widmann, Fredrik Lindsten, Dave Zachariah

Keywords Paper

uncertainty quantification, maximum mean discrepancy, integral probability metric, framework, calibration

0

0

0

0

6:05

06/12/2020

A Class of Algorithms for General Instrumental Variable Models

Niki Kilbertus, Matt Kusner, Ricardo Silva

Keywords Paper

0

0

0

0

3:13

06/12/2021

SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Jongheon Jeong, Sejun Park, Minkyu Kim and
Heung-Chang Lee, Do-Guk Kim, Jinwoo Shin

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security

0

0

0

0

12:23

23/07/2020

Variational learning of individual survival distributions

Zidi Xiu, Chenyang Tao, Ricardo Henao

Keywords Paper

Applied computing, Life and medical sciences, Health informatics, Computing methodologies, Modeling and simulation, Model development and analysis, Modeling methodologies

0

0

0

0

7:44

13/04/2021

Adversarially robust estimate and risk analysis in linear regression

Yue Xing, Ruizhi Zhang, Guang Cheng

Keywords Paper

0

0

0

0

3:03

06/12/2020

Adversarial Robustness of Supervised Sparse Coding

Jeremias Sulam, Ramchandran Muthukumar, Raman Arora

Keywords Paper

0

0

0

0

3:08

09/07/2020

Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal

Alekh Agarwal, Sham Kakade, Lin Yang

Keywords Paper

Reinforcement learning, Sampling algorithms

0

0

0

0

15:13

18/07/2021

Outside the Echo Chamber: Optimizing the Performative Risk

John Miller, Juan Perdomo, Tijana Zrnic

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:05

06/12/2020

Distributionally Robust Federated Averaging

Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

Keywords Paper

0

0

0

0

3:11

22/11/2021

FAR: A General Framework for Attributional Robustness

Adam Ivankay, Ivan Girardi, Chiara Marchiori, Pascal Frossard

Keywords Paper

robustness, attribution robustness, adversarial attacks, explainability, attribution maps

0

0

0

0

3:00

03/05/2021

Multivariate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

Kashif Rasul, Abdul-Saboor Sheikh, Ingmar Schuster and
Urs Bergmann, Roland Vollgraf

Keywords Paper

probabilistic multivariate forecasting, normalizing flows, attention, time series

0

0

0

0

9:59