POPCORN: Partially Observed Prediction Constrained Reinforcement Learning

26/08/2020

POPCORN: Partially Observed Prediction Constrained Reinforcement Learning

Joseph Futoma, Michael Hughes, Finale Doshi-Velez

Keywords:

Abstract Paper Similar Papers

Abstract: Many medical decision-making tasks can be framed as partially observed Markov decision processes (POMDPs). However, prevailing two-stage approaches that first learn a POMDP and then solve it often fail because the model that best fits the data may not be well suited for planning. We introduce a new optimization objective that (a) produces both high-performing policies and high-quality generative models, even when some observations are irrelevant for planning, and (b) does so in batch off-policy settings that are typical in healthcare, when only retrospective data is available. We demonstrate our approach on synthetic examples and a challenging medical decision-making problem.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease Progression

Zhaozhi Qian, William Zame, Lucas Fleuren and
Paul Elbers, Mihaela van der Schaar

Keywords Paper

deep learning, machine learning

0

0

0

0

6:47

12/07/2020

Learning From Irregularly-Sampled Time Series: A Missing Data Perspective

Steven Cheng-Xian Li, Benjamin Marlin

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

14:26

26/04/2020

Estimating counterfactual treatment outcomes over time through adversarially balanced representations

Ioana Bica, Ahmed M Alaa, James Jordon, Mihaela van der Schaar

Keywords Paper

treatment effects over time, causal inference, counterfactual estimation

0

0

0

0

5:14

07/08/2020

Personalized Input-Output Hidden Markov Models for Disease Progression Modeling

Kristen A. Severson, Lana M. Chahine, Luba Smolensky and
Kenney Ng, Jianying Hu, Soumya Ghosh

Keywords Paper

0

0

0

0

2:40

06/12/2020

Gradient Regularized V-Learning for Dynamic Treatment Regimes

Yao Zhang, Mihaela van der Schaar

Keywords Paper

Probabilistic Methods -> Causal Inference; Theory -> Learning Theory, Algorithms -> Kernel Methods

0

0

0

0

3:13

06/12/2020

Learning to search efficiently for causally near-optimal treatments

Samuel Håkansson, Viktor Lindblom, Omer Gottesman, Fredrik Johansson

Keywords Paper

Algorithms -> Online Learning, Reinforcement Learning and Planning -> Reinforcement Learning

0

0

0

0

3:19

13/04/2021

Regret minimization for causal inference on large treatment space

Akira Tanimoto, Tomoya Sakai, Takashi Takenouchi, Hisashi Kashima

Keywords Paper

0

0

0

0

3:05

13/04/2021

Off-policy evaluation in infinite-horizon reinforcement learning with latent confounders

Andrew Bennett, Nathan Kallus, Lihong Li, Ali Mousavi

Keywords Paper

0

0

0

0

2:33

03/05/2021

Scalable Bayesian Inverse Reinforcement Learning

Alex Chan, Mihaela van der Schaar

Keywords Paper

Bayesian, Imitation Learning, Inverse reinforcement learning

0

0

0

0

5:10

23/07/2020

Analyzing the role of model uncertainty for electronic health records

Michael W. Dusenberry, Dustin Tran, Edward Choi and
Jonas Kemp, Jeremy Nixon, Ghassen Jerfel, Katherine Heller, Andrew M. Dai

Keywords Paper

Applied computing, Life and medical sciences, Computing methodologies, Machine learning, Machine learning approaches, Neural networks, Modeling and simulation, Model development and analysis, Uncertainty quantification

0

0

0

0

8:18

06/12/2020

Robust Recursive Partitioning for Heterogeneous Treatment Effects with Uncertainty Quantification

Hyun-Suk Lee, Yao Zhang, William Zame and
Cong Shen, Jang-Won Lee, Mihaela van der Schaar

Keywords Paper

0

0

0

0

3:23

06/12/2021

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort and
Andreas Kirsch, Uri Shalit, Yarin Gal

Keywords Paper

theory, deep learning, causality, active learning

0

0

0

0

6:04

14/06/2020

Iteratively-Refined Interactive 3D Medical Image Segmentation With Multi-Agent Reinforcement Learning

Xuan Liao, Wenhao Li, Qisen Xu and
Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Yanfeng Wang, Ya Zhang

Keywords Paper

medical image segmentation, interactive image segmentation, reinforcement learning

0

0

0

0

1:00

06/12/2021

Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification

Alkis Gotovos, Rebekka Burkholz, John Quackenbush, Stefanie Jegelka

Keywords Paper

0

0

0

0

14:24

02/02/2021

A Unified Framework for Planning with Learned Neural Network Transition Models

Buser Say

Keywords Paper

0

0

0

0

18:41

23/08/2020

INPREM: An interpretable and trustworthy predictive model for healthcare

Xianli Zhang, Buyue Qian, Shilei Cao and
Yang Li, Hang Chen, Yefeng Zheng, Ian Davidson

Keywords Paper

attention mechanism, healthcare informatics, model interpretability, model uncertainty

0

0

0

0

16:56

07/08/2020

Comparisons Between Hamiltonian Monte Carlo and Maximum A Posteriori For A Bayesian Model For Apixaban Induction Dose & Dose Personalization

A. Demetri Pananos, Daniel J. Lizotte

Keywords Paper

0

0

0

0

3:03

02/02/2021

Variational Disentanglement for Rare Event Modeling

Zidi Xiu, Chenyang Tao, Michael Gao and
Connor Davis, Benjamin A. Goldstein, Ricardo Henao

Keywords Paper

0

0

0

0

17:52

06/12/2020

Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies

Nathan Kallus, Masatoshi Uehara

Keywords Paper

0

0

0

0

3:11

23/08/2020

Recurrent halting chain for early multi-label classification

Thomas Hartvigsen, Cansu Sen, Xiangnan Kong, Elke Rundensteiner

Keywords Paper

multi-label classification, recurrent neural network, reinforcement learning, early classification

0

0

0

0

17:14

18/11/2020

A new representation learning method for individual treatment effect estimation: Split covariate representation network

Liu Qidong, Tian Feng, Ji Weihua, Zheng Qinghua

Keywords Paper

0

0

0

0

10:01

06/12/2020

A Class of Algorithms for General Instrumental Variable Models

Niki Kilbertus, Matt Kusner, Ricardo Silva

Keywords Paper

0

0

0

0

3:13

23/07/2020

Deidentification of free-text medical records using pre-trained bidirectional transformers

Alistair E. W. Johnson, Lucas Bulgarelli, Tom J. Pollard

Keywords Paper

Applied computing, Document management and text processing, Document preparation, Annotation

0

0

0

0

6:35

02/02/2021

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Xiangde Luo, Jieneng Chen, Tao Song, Guotai Wang

Keywords Paper

0

0

0

0

13:38

26/08/2020

GP-VAE: Deep Probabilistic Time Series Imputation

Vincent Fortuin, Dmitry Baranchuk, Gunnar Raetsch, Stephan Mandt

Keywords Paper

0

0

0

0

10:21

13/04/2021

A variational information bottleneck approach to multi-omics data integration

Changhee Lee, Mihaela Schaar

Keywords Paper

0

0

0

0

3:10

07/08/2020

Addressing Sample Size Challenges in Linked Data Through Data Fusion

Srikesh Arunajadai, Lulu Lee, Tom Haskell

Keywords Paper

0

0

0

0

3:02

06/12/2020

Stochastic Segmentation Networks: Modelling Spatially Correlated Aleatoric Uncertainty

Miguel Monteiro, Loic Le Folgoc, Daniel Coelho de Castro and
Nick Pawlowski, Bernardo Marques, Konstantinos Kamnitsas, Mark van der Wilk, Ben Glocker

Keywords Paper

0

0

0

0

3:23

06/12/2020

What went wrong and when? Instance-wise feature importance for time-series black-box models

Sana Tonekaboni, Shalmali Joshi, Kieran Campbell and
David Duvenaud, Anna Goldenberg

Keywords Paper

0

0

0

0

3:23

06/12/2020

Estimation and Imputation in Probabilistic Principal Component Analysis with Missing Not At Random Data

Aude Sportisse, Claire Boyer, Julie Josse

Keywords Paper

, Algorithms -> Online Learning

0

0

0

0

3:20

26/08/2020

Contextual Constrained Learning for Dose-Finding Clinical Trials

Hyun-Suk Lee, Cong Shen, James Jordon, Mihaela van der Schaar

Keywords Paper

0

0

0

0

14:50

14/06/2020

SOS: Selective Objective Switch for Rapid Immunofluorescence Whole Slide Image Classification

Sam Maksoud, Kun Zhao, Peter Hobson and
Anthony Jennings, Brian C. Lovell

Keywords Paper

whole-slide imaging, image classification, neural networks, multi-scale networks, patch-based classification, gigapixel image analysis, digital pathology

0

0

0

0

4:56

06/12/2020

Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding

Hong Namkoong, Ramtin Keramati, Steve Yadlowsky, Emma Brunskill

Keywords Paper

0

0

0

0

3:24

12/07/2020

Estimation of Bounds on Potential Outcomes For Decision Making

Maggie Makar, Fredrik Johansson, John Guttag, David Sontag

Keywords Paper

Causality

0

0

0

0

13:12

13/04/2021

Evaluating model robustness and stability to dataset shift

Adarsh Subbaswamy, Roy Adams, Suchi Saria

Keywords Paper

0

0

0

0

2:44

19/08/2021

Multi-Cause Effect Estimation with Disentangled Confounder Representation

Jing Ma, Ruocheng Guo, Aidong Zhang, Jundong Li

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Trustable Learning

0

0

0

0

15:23

06/12/2021

Finding Regions of Heterogeneity in Decision-Making via Expected Conditional Covariance

Justin Lim, Christina X Ji, Michael Oberst and
Saul Blecker, Leora Horwitz, David Sontag

Keywords Paper

causality

0

0

0

0

13:05

23/07/2020

MIMIC-Extract: a data extraction, preprocessing, and representation pipeline for MIMIC-III

Shirly Wang, Matthew B. A. McDermott, Geeticka Chauhan and
Marzyeh Ghassemi, Michael C. Hughes, Tristan Naumann

Keywords Paper

Applied computing, Life and medical sciences, Health care information systems, Health informatics

0

0

0

0

7:58

23/07/2020

An adversarial approach for the robust classification of pneumonia from chest radiographs

Joseph D. Janizek, Gabriel Erion, Alex J. DeGrave, Su-In Lee

Keywords Paper

Applied computing, Life and medical sciences, Computing methodologies, Machine learning, Machine learning approaches, Learning latent representations, Neural networks

0

0

0

0

8:07

16/11/2020

Predicting Clinical Trial Results by Implicit Evidence Integration

Qiao Jin, Chuanqi Tan, Mosha Chen and
Xiaozhong Liu, Songfang Huang

Keywords Paper

evidence-based medicine, clinical task, manual collection, ctrp framework

0

0

0

0

11:13