Overparameterization hurts worst-group accuracy with spurious correlations

Abstract: Increasing model capacity well beyond the point of zero training error has been observed to improve average test accuracy. However, such overparameterized models have been recently shown to obtain low worst-group accuracy --- i.e., low accuracy on atypical groups of test examples --- when there are spurious correlations that hold for the majority of training examples. We show on two image datasets that in contrast to average accuracy, overparameterization hurts worst-group accuracy in the presence of spurious correlations. We replicate this surprising phenomenon in a synthetic example and identify properties of the data distribution that induce the detrimental effect of overparameterization on worst-group accuracy. Our analysis leads us to show that a counter-intuitive approach of subsampling the majority group yields high worst-group accuracy in the overparameterized regime, whereas upweighting the minority does not. Our results suggest that when it comes to achieving high worst-group accuracy, there is a tension between using overparameterized models vs. using all of the training data.

26/04/2020

anomaly detection, adversarial learning, one-class classification, autoencoder, novelty detection, outlier detection, semi supervised learning, ucsd pedestrian2, mnist, caltech -256

1:01

03/05/2021

Overparameterization hurts worst-group accuracy with spurious correlations

Shiori Sagawa, aditi raghunathan, Pang Wei Koh, Percy Liang

Comments

Similar Papers

Distributionally Robust Neural Networks

Shiori Sagawa*, Pang Wei Koh*, Tatsunori B. Hashimoto, Percy Liang

Keywords Abstract Paper

distributionally robust optimization, deep learning, robustness, generalization, regularization

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Zhao Wang, Aron Culotta

Keywords Abstract Paper

Just Train Twice: Improving Group Robustness without Training Group Information

Evan Liu, Behzad Haghgoo, Annie Chen and Aditi Raghunathan, Pang Wei Koh, Shiori Sagawa, Percy Liang, Chelsea Finn

Keywords Abstract Paper

Adversarial Filters of Dataset Biases

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Abstract Paper

Evaluating model performance under worst-case subpopulations

Mike Li, Hongseok Namkoong, Shangzhou Xia

Keywords Abstract Paper

robustness, fairness

Overparameterisation and worst-case generalisation: friend or foe?

Aditya Krishna Menon, Ankit Singh Rawat, Sanjiv Kumar

Keywords Abstract Paper

worst-case generalisation, overparameterisation

Examining and Combating Spurious Features under Distribution Shift

Chunting Zhou, Xuezhe Ma, Paul Michel, Graham Neubig

Keywords Abstract Paper

Deep Learning, Embedding and Representation learning

Combining Ensembles and Data Augmentation Can Harm Your Calibration

Yeming Wen, Ghassen Jerfel, Rafael Müller and Michael W Dusenberry, Jasper Snoek, Balaji Lakshminarayanan, Dustin Tran

Keywords Abstract Paper

Uncertainty estimates, Ensembles, Calibration

Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets

Max Ryabinin, Andrey Malinin, Mark Gales

Keywords Abstract Paper

True Few-Shot Learning with Language Models

Ethan Perez, Douwe Kiela, Kyunghyun Cho

Keywords Abstract Paper

language, few shot learning

Practical and Rigorous Uncertainty Bounds for Gaussian Process Regression

Christian Fiedler, Carsten W. Scherer, Sebastian Trimpe

Keywords Abstract Paper

Old Is Gold: Redefining the Adversarially Learned One-Class Classifier Training Paradigm

Muhammad Zaigham Zaheer, Jin-Ha Lee, Marcella Astrid, Seung-Ik Lee

Keywords Abstract Paper

anomaly detection, adversarial learning, one-class classification, autoencoder, novelty detection, outlier detection, semi supervised learning, ucsd pedestrian2, mnist, caltech -256

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

Yangming Li, lemao liu, Shuming Shi

Keywords Abstract Paper

Negative Sampling, Unlabeled Entity Problem, Named Entity Recognition

Elastic weight consolidation for better bias inoculation

James Thorne, Andreas Vlachos

Keywords Abstract Paper

Understanding the Limits of Unsupervised Domain Adaptation via Data Poisoning

Akshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Jihun Hamm

Keywords Abstract Paper

robustness, domain adaptation

Overinterpretation reveals image classification model pathologies

Brandon Carter, Siddhartha Jain, Jonas Mueller, David Gifford

Keywords Abstract Paper

deep learning, machine learning, robustness, adversarial robustness and security, vision, interpretability

Precise Tradeoffs in Adversarial Training for Linear Regression

Adel Javanmard, Mahdi Soltanolkotabi, Hamed Hassani

Keywords Abstract Paper

Adversarial learning and robustness, High-dimensional statistics, Regression

Predictive power of nearest neighbors algorithm under random perturbation

Yue Xing, Qifan Song, Guang Cheng

Keywords Abstract Paper

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve, Kevin Duarte, Yogesh S Rawat, Mubarak Shah

Keywords Abstract Paper

Deep Learning, Calibration, Uncertainty, Pseudo-Labeling, Semi-Supervised Learning

Task-Robust Model-Agnostic Meta-Learning

Liam Collins, Aryan Mokhtari, Sanjay Shakkottai

Keywords Abstract Paper

Understanding and Mitigating the Tradeoff between Robustness and Accuracy

Aditi Raghunathan, Sang Michael Xie, Fanny Yang and John Duchi, Percy Liang

Keywords Abstract Paper

Input Complexity and Out-of-distribution Detection with Likelihood-based Generative Models

Shiori Sagawa, Pang Wei Koh, Tatsunori B. Hashimoto, Percy Liang

Keywords Paper

Keywords Paper

Evan Liu, Behzad Haghgoo, Annie Chen and
Aditi Raghunathan, Pang Wei Koh, Shiori Sagawa, Percy Liang, Chelsea Finn

Keywords Paper

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and
Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yeming Wen, Ghassen Jerfel, Rafael Müller and
Michael W Dusenberry, Jasper Snoek, Balaji Lakshminarayanan, Dustin Tran

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Aditi Raghunathan, Sang Michael Xie, Fanny Yang and
John Duchi, Percy Liang

Keywords Paper

Joan Serrà, David Álvarez, Vicenç Gómez and
Olga Slizovskaia, José F. Núñez, Jordi Luque

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Beidi Chen, Weiyang Liu, Zhiding Yu and
Jan Kautz, Anshumali Shrivastava, Animesh Garg, Anima Anandkumar

Keywords Paper

Joseph Viviano, Becks Simpson, Francis Dutil and
Yoshua Bengio, Joseph Paul Cohen

Keywords Paper

Taehee Jung, Dongyeop Kang, Hua Cheng and
Lucas Mentch, Thomas Schaaf

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper