Confound removal and normalization in practice: a neuroimaging based sex prediction case study

Abstract: Machine learning (ML) methods are increasingly being used to predict pathologies and biological traits using neuroimaging data. Here controlling for confounds is essential to get unbiased estimates of generalization performance and to identify the features driving predictions. However, a systematic evaluation of the advantages and disadvantages of available alternatives is lacking. This makes it difficult to compare results across studies and to build deployment quality models. Here, we evaluated two commonly used confound removal schemes–whole data confound regression (WDCR) and cross-validated confound regression (CVCR)–to understand their effectiveness and biases induced in generalization performance estimation. Additionally, we study the interaction of the confound removal schemes with Z-score normalization, a common practice in ML modelling. We applied eight combinations of confound removal schemes and normalization (pipelines) to decode sex from resting-state functional MRI (rfMRI) data while controlling for two confounds, brain size and age. We show that both schemes effectively remove linear univariate and multivariate confounding effects resulting in reduced model performance with CVCR providing better generalization estimates, i.e., closer to out-of-sample performance than WDCR. We found no effect of normalizing before or after confound removal. In the presence of dataset and confound shift, four tested confound removal procedures yielded mixed results, raising new questions. We conclude that CVCR is a better method to control for confounding effects in neuroimaging studies. We believe that our in-depth analyses shed light on choices associated with confound removal and hope that it generates more interest in this problem instrumental to numerous applications.

Confound removal and normalization in practice: a neuroimaging based sex prediction case study

Shammi More, Forschungszentrum Jülich, Jülich, Germany, Simon Eickhoff, Forschungszentrum Jülich, Jülich, Germany, Julian Caspers, Kaustubh Patil, Forschungszentrum Jülich, Jülich, Germany

Comments

Similar Papers

NestedVAE: Isolating Common Factors via Weak Supervision

Matthew J. Vowels, Necati Cihan Camgöz, Richard Bowden

Keywords Abstract Paper

fairness, bias, representation learning, invariance, vae, variational, weakly supervised, information bottleneck

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort and Andreas Kirsch, Uri Shalit, Yarin Gal

Keywords Abstract Paper

theory, deep learning, causality, active learning

Diagnose Like A Pathologist: Weakly-Supervised Pathologist-Tree Network for Slide-Level Immunohistochemical Scoring

Zhen Chen, Jun Zhang, Shuanlong Che and Junzhou Huang, Xiao Han, Yixuan Yuan

Keywords Abstract Paper

Removing Inter-Experimental Variability from Functional Data in Systems Neuroscience

Dominic Gonschorek, Larissa Höfling, Klaudia P. Szatko and Katrin Franke, Timm Schubert, Benjamin Dunn, Philipp Berens, David Klindt, Thomas Euler

Keywords Abstract Paper

optimization, machine learning, neuroscience, domain adaptation

Constrained Optimization to Train Neural Networks on Critical and Under-Represented Classes

Sara Sangalli, Ertunc Erdil, Andeas Hötker and Olivio Donati, Ender Konukoglu

Keywords Abstract Paper

deep learning, optimization, machine learning

Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability

Suraj Srinivas, François Fleuret

Keywords Abstract Paper

Interpretability, saliency maps, score-matching

Adversarial Filters of Dataset Biases

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Abstract Paper

Deep Learning - Algorithms

Explaining Convolutional Neural Networks through Attribution-Based Input Sampling and Block-Wise Feature Aggregation

Sam Sattarzadeh, Mahesh Sudhakar, Anthony Lem and Shervin Mehryar, Konstantinos N Plataniotis, Jongseong Jang, Hyunwoo Kim, Yeonjeong Jeong, Sangmin Lee, Kyunghoon Bae

Keywords Abstract Paper

An extensive investigation of machine learning techniques for sleep apnea screening

Jose F. Rodrigues, Jean-Louis Pepin, Lorraine Goeuriot, Sihem Amer-Yahia

Keywords Abstract Paper

machine learning, naive bayes, obstructive sleep apnea screening, decision trees

Atlas-aware ConvNet for accurate yet robust anatomical segmentation

Yuan Liang, Weinan Song, Jiawei Yang and Liang Qiu, Kun Wang, Lei He

Keywords Abstract Paper

Deep Direct Likelihood Knockoffs

Mukund Sudarshan, Wesley Tansey, Rajesh Ranganath

Keywords Abstract Paper

Disentangling Human Error from Ground Truth in Segmentation of Medical Images

Le Zhang, Ryu Tanno, Moucheng Xu and Chen Jin, Joseph Jacob, Olga Cicarrelli, Frederik Barkhof, Daniel Alexander

Keywords Abstract Paper

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Xiangde Luo, Jieneng Chen, Tao Song, Guotai Wang

Keywords Abstract Paper

Towards a better understanding of label smoothing in neural machine translation

Yingbo Gao, Weiyue Wang, Christian Herold and Zijian Yang, Hermann Ney

Keywords Abstract Paper

Sparse encoding for more-interpretable feature-selecting representations in probabilistic matrix factorization

Joshua Chang, Patrick A Fletcher, Jungmin Han and Ted Chang, Shashaank Vattikuti, Bart Desmet, Ayah Zirikly, Carson Chow

Keywords Abstract Paper

bayesian, interpretability, generalized additive model, sparse coding, factor analysis, probabilistic matrix factorization, poisson matrix factorization

DKMA-ULD: Domain Knowledge augmented Multi-head Attention based Robust Universal Lesion Detection

Manu Sheoran, Meghal Dani, Monika Sharma, Lovekesh Vig

Keywords Abstract Paper

Universal lesion detection, Multi-intensity images, Custom anchors, DeepLesion, Self-attention, CT scans, HU windows

Representation Learning With Statistical Independence to Mitigate Bias

Ehsan Adeli, Qingyu Zhao, Adolf Pfefferbaum and Edith V. Sullivan, Li Fei-Fei, Juan Carlos Niebles, Kilian M. Pohl

Keywords Abstract Paper

Deep Learning Applied to Chest X-Rays: Exploiting and Preventing Shortcuts

Sarah Jabbour, David Fouhey, Ella Kazerooni and Michael W. Sjoding, Jenna Wiens

Keywords Abstract Paper

Experimental design for MRI by greedy policy search

Tim Bakker, Herke van Hoof, Max Welling

Keywords Abstract Paper

Latent-optimization based Disease-aware Image Editing for Medical Image Augmentation

Aakash saboo, Prashnna K Gyawali, Ankit Shukla and Manoj Sharma, Neeraj Jain, Linwei Wang

Keywords Abstract Paper

Latent optimization, StyleGAN, Image Editing, Chest X-ray, Image manipulation, constrained optimization, Disease progression, Disease quantification, Manifold, Latent space traversal

Temporal Positive-unlabeled Learning for Biomedical Hypothesis Generation via Risk Estimation

Uchenna Akujuobi, Jun Chen, Mohamed Elhoseiny and Michael Spranger, Xiangliang Zhang

Keywords Abstract Paper

Modeling Shared responses in Neuroimaging Studies through MultiView ICA

Hugo Richard, Luigi Gresele, Aapo Hyvarinen and Bertrand Thirion, Alexandre Gramfort, Pierre Ablin

Keywords Abstract Paper

Keywords Paper

Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort and
Andreas Kirsch, Uri Shalit, Yarin Gal

Keywords Paper

Zhen Chen, Jun Zhang, Shuanlong Che and
Junzhou Huang, Xiao Han, Yixuan Yuan

Keywords Paper

Dominic Gonschorek, Larissa Höfling, Klaudia P. Szatko and
Katrin Franke, Timm Schubert, Benjamin Dunn, Philipp Berens, David Klindt, Thomas Euler

Keywords Paper

Sara Sangalli, Ertunc Erdil, Andeas Hötker and
Olivio Donati, Ender Konukoglu

Keywords Paper

Keywords Paper

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and
Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Paper

Sam Sattarzadeh, Mahesh Sudhakar, Anthony Lem and
Shervin Mehryar, Konstantinos N Plataniotis, Jongseong Jang, Hyunwoo Kim, Yeonjeong Jeong, Sangmin Lee, Kyunghoon Bae

Keywords Paper

Keywords Paper

Yuan Liang, Weinan Song, Jiawei Yang and
Liang Qiu, Kun Wang, Lei He

Keywords Paper

Keywords Paper

Le Zhang, Ryu Tanno, Moucheng Xu and
Chen Jin, Joseph Jacob, Olga Cicarrelli, Frederik Barkhof, Daniel Alexander

Keywords Paper

Keywords Paper

Yingbo Gao, Weiyue Wang, Christian Herold and
Zijian Yang, Hermann Ney

Keywords Paper

Joshua Chang, Patrick A Fletcher, Jungmin Han and
Ted Chang, Shashaank Vattikuti, Bart Desmet, Ayah Zirikly, Carson Chow

Keywords Paper

Keywords Paper

Ehsan Adeli, Qingyu Zhao, Adolf Pfefferbaum and
Edith V. Sullivan, Li Fei-Fei, Juan Carlos Niebles, Kilian M. Pohl

Keywords Paper

Sarah Jabbour, David Fouhey, Ella Kazerooni and
Michael W. Sjoding, Jenna Wiens

Keywords Paper

Keywords Paper

Aakash saboo, Prashnna K Gyawali, Ankit Shukla and
Manoj Sharma, Neeraj Jain, Linwei Wang

Keywords Paper

Uchenna Akujuobi, Jun Chen, Mohamed Elhoseiny and
Michael Spranger, Xiangliang Zhang

Keywords Paper

Hugo Richard, Luigi Gresele, Aapo Hyvarinen and
Bertrand Thirion, Alexandre Gramfort, Pierre Ablin

Keywords Paper

Keywords Paper

Tony Yousefnezhad, Alessandro Selvitella, Daoqiang Zhang and
Andrew Greenshaw, Russell Greiner

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Chaithanya Kumar Mummadi, Ranjitha Subramaniam, Robin Hutmacher and
Julien Vitay, Volker Fischer, Jan Hendrik Metzen

Keywords Paper

Yuzhe Yang, Kaiwen Zha, YINGCONG CHEN and
Hao Wang, Dina Katabi

Keywords Paper

Yifan Zhang, Bryan Hooi, Dapeng Hu and
Jian Liang, Jiashi Feng

Keywords Paper

Xuan Liao, Wenhao Li, Qisen Xu and
Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Yanfeng Wang, Ya Zhang

Keywords Paper

Huaxiu Yao, Ying Wei, Long-Kai Huang and
Ding Xue, Junzhou Huang, Zhenhui (Jessie) Li

Keywords Paper

Zhuchen Shao, Hao Bian, Yang Chen and
Yifeng Wang, Jian Zhang, Xiangyang Ji, yongbing zhang

Keywords Paper

Chun-Mei Feng, Zhanyuan Yang, Geng Chen and
Yong Xu, Ling Shao

Keywords Paper

Keywords Paper