Hidden stratification causes clinically meaningful failures in machine learning for medical imaging

23/07/2020

Hidden stratification causes clinically meaningful failures in machine learning for medical imaging

Luke Oakden-Rayner, Jared Dunnmon, Gustavo Carneiro, Christopher Re

Keywords: Computing methodologies, Machine learning

Abstract: Machine learning models for medical image analysis often suffer from poor performance on important subsets of a population that are not identified during training or testing. For example, overall performance of a cancer detection model may be high, but the model may still consistently miss a rare but aggressive cancer subtype. We refer to this problem as hidden stratification, and observe that it results from incompletely describing the meaningful variation in a dataset. While hidden stratification can substantially reduce the clinical efficacy of machine learning models, its effects remain difficult to measure. In this work, we assess the utility of several possible techniques for measuring hidden stratification effects, and characterize these effects both via synthetic experiments on the CIFAR-100 benchmark dataset and on multiple real-world medical imaging datasets. Using these measurement techniques, we find evidence that hidden stratification can occur in unidentified imaging subsets with low prevalence, low label quality, subtle distinguishing features, or spurious correlates, and that it can result in relative performance differences of over 20% on clinically important subsets. Finally, we discuss the clinical implications of our findings, and suggest that evaluation of hidden stratification should be a critical component of any machine learning deployment in medical imaging.

Hidden stratification causes clinically meaningful failures in machine learning for medical imaging

Luke Oakden-Rayner, Jared Dunnmon, Gustavo Carneiro, Christopher Re

Comments

Similar Papers

An adversarial approach for the robust classification of pneumonia from chest radiographs

Joseph D. Janizek, Gabriel Erion, Alex J. DeGrave, Su-In Lee

Keywords Abstract Paper

Applied computing, Life and medical sciences, Computing methodologies, Machine learning, Machine learning approaches, Learning latent representations, Neural networks

Constrained Optimization to Train Neural Networks on Critical and Under-Represented Classes

Sara Sangalli, Ertunc Erdil, Andeas Hötker and Olivio Donati, Ender Konukoglu

Keywords Abstract Paper

deep learning, optimization, machine learning

Addressing The False Negative Problem of Deep Learning MRI Reconstruction Models by Adversarial Attacks and Robust Training

Kaiyang Cheng, Francesco Calivá, Rutwik Shah and Misung Han, Sharmila Majumdar, Valentina Pedoia

Keywords Abstract Paper

SOS: Selective Objective Switch for Rapid Immunofluorescence Whole Slide Image Classification

Sam Maksoud, Kun Zhao, Peter Hobson and Anthony Jennings, Brian C. Lovell

Keywords Abstract Paper

whole-slide imaging, image classification, neural networks, multi-scale networks, patch-based classification, gigapixel image analysis, digital pathology

Brain Metastasis Segmentation Network Trained with Robustness to Annotations with Multiple False Negatives

Darvin Yi, Endre Grøvik, Michael Iv and Elizabeth Tong, Greg Zaharchuk, Daniel Rubin

Keywords Abstract Paper

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort and Andreas Kirsch, Uri Shalit, Yarin Gal

Keywords Abstract Paper

theory, deep learning, causality, active learning

Representation learning for improved interpretability and classification accuracy of clinical factors from EEG

Garrett Honke, Irina Higgins, Nina Thigpen and Vladimir Miskovic, Katie Link, Sunny Duan, Pramod Gupta, Julia Klawohn, Greg Hajcak

Keywords Abstract Paper

representation learning, beta-VAE, depression, electroencephalography, ERP, EEG, disentanglement

Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease Progression

Zhaozhi Qian, William Zame, Lucas Fleuren and Paul Elbers, Mihaela van der Schaar

Keywords Abstract Paper

deep learning, machine learning

Transfer Learning via Optimal Transportation for Integrative Cancer Patient Stratification

Ziyu Liu, Wei Shao, Jie Zhang and Min Zhang, Kun Huang

Keywords Abstract Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Applications of Unsupervised Learning, Bio/Medicine

Deep Learning Applied to Chest X-Rays: Exploiting and Preventing Shortcuts

Sarah Jabbour, David Fouhey, Ella Kazerooni and Michael W. Sjoding, Jenna Wiens

Keywords Abstract Paper

Stochastic Segmentation Networks: Modelling Spatially Correlated Aleatoric Uncertainty

Miguel Monteiro, Loic Le Folgoc, Daniel Coelho de Castro and Nick Pawlowski, Bernardo Marques, Konstantinos Kamnitsas, Mark van der Wilk, Ben Glocker

Keywords Abstract Paper

Reliable and Trustworthy Machine Learning for Health Using Dataset Shift Detection

Chunjong Park, Anas Awadalla, Tadayoshi Kohno, Shwetak Patel

Keywords Abstract Paper

deep learning, machine learning

RareBERT: Transformer Architecture for Rare Disease Patient Identification using Administrative Claims

PKS Prakash, Srinivas Chilukuri, Nikhil Ranade, Shankar Viswanathan

Keywords Abstract Paper

Adversarial Filters of Dataset Biases

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Abstract Paper

Deep Learning - Algorithms

Variational Disentanglement for Rare Event Modeling

Zidi Xiu, Chenyang Tao, Michael Gao and Connor Davis, Benjamin A. Goldstein, Ricardo Henao

Keywords Abstract Paper

Learning to search efficiently for causally near-optimal treatments

Samuel Håkansson, Viktor Lindblom, Omer Gottesman, Fredrik Johansson

Keywords Abstract Paper

Algorithms -> Online Learning, Reinforcement Learning and Planning -> Reinforcement Learning

Evaluating model robustness and stability to dataset shift

Adarsh Subbaswamy, Roy Adams, Suchi Saria

Keywords Abstract Paper

Robust Recursive Partitioning for Heterogeneous Treatment Effects with Uncertainty Quantification

Hyun-Suk Lee, Yao Zhang, William Zame and Cong Shen, Jang-Won Lee, Mihaela van der Schaar

Keywords Abstract Paper

Human Uncertainty Inference via Deterministic Ensemble Neural Networks

Yujin Cha, Sang Wan Lee

Keywords Abstract Paper

Regret minimization for causal inference on large treatment space

Akira Tanimoto, Tomoya Sakai, Takashi Takenouchi, Hisashi Kashima

Keywords Abstract Paper

SagaNet: A Small Sample Gated Network for Pediatric Cancer Diagnosis

Yuhan Liu, Shiliang Sun

Keywords Abstract Paper

Applications, Computational Biology and Bioinformatics

Analyzing the role of model uncertainty for electronic health records

Michael W. Dusenberry, Dustin Tran, Edward Choi and Jonas Kemp, Jeremy Nixon, Ghassen Jerfel, Katherine Heller, Andrew M. Dai

Keywords Paper

Sara Sangalli, Ertunc Erdil, Andeas Hötker and
Olivio Donati, Ender Konukoglu

Keywords Paper

Kaiyang Cheng, Francesco Calivá, Rutwik Shah and
Misung Han, Sharmila Majumdar, Valentina Pedoia

Keywords Paper

Sam Maksoud, Kun Zhao, Peter Hobson and
Anthony Jennings, Brian C. Lovell

Keywords Paper

Darvin Yi, Endre Grøvik, Michael Iv and
Elizabeth Tong, Greg Zaharchuk, Daniel Rubin

Keywords Paper

Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort and
Andreas Kirsch, Uri Shalit, Yarin Gal

Keywords Paper

Garrett Honke, Irina Higgins, Nina Thigpen and
Vladimir Miskovic, Katie Link, Sunny Duan, Pramod Gupta, Julia Klawohn, Greg Hajcak

Keywords Paper

Zhaozhi Qian, William Zame, Lucas Fleuren and
Paul Elbers, Mihaela van der Schaar

Keywords Paper

Ziyu Liu, Wei Shao, Jie Zhang and
Min Zhang, Kun Huang

Keywords Paper

Sarah Jabbour, David Fouhey, Ella Kazerooni and
Michael W. Sjoding, Jenna Wiens

Keywords Paper

Miguel Monteiro, Loic Le Folgoc, Daniel Coelho de Castro and
Nick Pawlowski, Bernardo Marques, Konstantinos Kamnitsas, Mark van der Wilk, Ben Glocker

Keywords Paper

Keywords Paper

Keywords Paper

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and
Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Paper

Zidi Xiu, Chenyang Tao, Michael Gao and
Connor Davis, Benjamin A. Goldstein, Ricardo Henao

Keywords Paper

Keywords Paper

Keywords Paper

Hyun-Suk Lee, Yao Zhang, William Zame and
Cong Shen, Jang-Won Lee, Mihaela van der Schaar

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Michael W. Dusenberry, Dustin Tran, Edward Choi and
Jonas Kemp, Jeremy Nixon, Ghassen Jerfel, Katherine Heller, Andrew M. Dai

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jianfeng He, Xuchao Zhang, Shuo Lei and
Zhiqian Chen, Fanglan Chen, Abdulaziz Alhamadani, Bei Xiao, ChangTien Lu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Hyeryun Park, Kyungmo Kim, Jooyoung Yoon and
Seongkeun Park, Jinwook Choi

Keywords Paper

Keywords Paper

Keywords Paper

Jun Wang, Shaoguo Wen, Jianghua Yu and
Kaixing Chen, Xin Zhou, Peng Gao, Guotong Xie, Changsheng Li

Keywords Paper

XiaoTian Yu, Zunlei Feng, Yuexuan Wang and
Thomas Kwok To Li, Xiuming Zhang, Mingli Song

Keywords Paper

Keywords Paper

Keywords Paper