Robustness to Spurious Correlations via Human Annotations

12/07/2020

Robustness to Spurious Correlations via Human Annotations

Megha Srivastava, Tatsunori Hashimoto, Percy Liang

Keywords: Trustworthy Machine Learning

Abstract Paper Similar Papers

Abstract: The reliability of machine learning systems critically assumes that the associations between features and labels remain similar between training and test distributions. However, unmeasured variables, such as confounders, break this assumption---useful correlations between features and labels at training time can become useless or even harmful at test time. For example, high obesity is generally predictive for heart disease, but this relation may not hold for smokers who generally have lower rates of obesity and higher rates of heart disease. We present a framework for making models robust to spurious correlations by leveraging humans' common sense knowledge of causality. Specifically, we use human annotation to augment each training example with a potential unmeasured variable (i.e. an underweight patient with heart disease may be a smoker), which reduces the problem to a covariate shift problem. We then introduce a new distributionally robust optimization objective over unmeasured variables (UV-DRO) to control the worst-case loss over possible test-time shifts. Empirically, we show 5--10% improvements on a digit recognition task confounded by rotation, and 1.5--5% gains on the task of predicting arrests from NYPD Police Stops confounded by location.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Posterior Calibrated Training on Sentence Classification Tasks

Taehee Jung, Dongyeop Kang, Hua Cheng and
Lucas Mentch, Thomas Schaaf

Keywords Paper

Sentence Tasks, classifications, xSLUE, classification tasks

0

0

0

0

7:00

06/07/2020

Brain Metastasis Segmentation Network Trained with Robustness to Annotations with Multiple False Negatives

Darvin Yi, Endre Grøvik, Michael Iv and
Elizabeth Tong, Greg Zaharchuk, Daniel Rubin

Keywords Paper

0

0

0

0

5:00

26/04/2020

Distributionally Robust Neural Networks

Shiori Sagawa, Pang Wei Koh, Tatsunori B. Hashimoto, Percy Liang

Keywords Paper

distributionally robust optimization, deep learning, robustness, generalization, regularization

0

0

0

1

5:22

06/12/2020

Task-Robust Model-Agnostic Meta-Learning

Liam Collins, Aryan Mokhtari, Sanjay Shakkottai

Keywords Paper

0

0

0

0

3:17

18/07/2021

Examining and Combating Spurious Features under Distribution Shift

Chunting Zhou, Xuezhe Ma, Paul Michel, Graham Neubig

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:53

12/07/2020

Overparameterization hurts worst-group accuracy with spurious correlations

Shiori Sagawa, aditi raghunathan, Pang Wei Koh, Percy Liang

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

15:09

02/02/2021

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Zhao Wang, Aron Culotta

Keywords Paper

0

0

0

0

17:39

12/07/2020

Adversarial Filters of Dataset Biases

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and
Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

15:25

06/12/2020

Self-Adaptive Training: beyond Empirical Risk Minimization

Lang Huang, Chao Zhang, Hongyang Zhang

Keywords Paper

Deep Learning -> Generative Models, Algorithms -> Semi-Supervised Learning

0

0

0

0

3:23

12/07/2020

Understanding and Mitigating the Tradeoff between Robustness and Accuracy

Aditi Raghunathan, Sang Michael Xie, Fanny Yang and
John Duchi, Percy Liang

Keywords Paper

Adversarial Examples

0

0

0

0

14:35

06/12/2021

Constrained Optimization to Train Neural Networks on Critical and Under-Represented Classes

Sara Sangalli, Ertunc Erdil, Andeas Hötker and
Olivio Donati, Ender Konukoglu

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

14:04

18/07/2021

Fair Selective Classification Via Sufficiency

Joshua Lee, Yuheng Bu, Deepta Rajan and
Prasanna Sattigeri, Rameswar Panda, Subhro Das, Gregory Wornell

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

18:20

19/10/2020

Fairness-aware learning with prejudice free representations

Ramanujam Madhavan, Mohit Wadhwa

Keywords Paper

fairness, prejudice, privacy, interpretability

0

0

0

0

7:04

03/05/2021

Mirostat: A Neural Text Decoding Algorithm That Directly Controls Perplexity

Sourya Basu, Govardana Sachithanandam Ramachandran, Nitish Shirish Keskar, Lav R Varshney

Keywords Paper

cross-entropy, incoherence, repetitions, sampling algorithms, Neural text decoding

0

0

0

0

5:07

12/07/2020

Invertible generative models for inverse problems: mitigating representation error and dataset bias

Muhammad Asim, Max Daniels, Oscar Leong and
Paul Hand, Ali Ahmed

Keywords Paper

Optimization - General

0

0

0

1

14:44

06/12/2021

Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets

Max Ryabinin, Andrey Malinin, Mark Gales

Keywords Paper

machine learning

0

0

0

0

12:36

03/05/2021

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

deep neural networks., instance-based label noise, Learning with noisy labels

0

0

0

0

5:18

06/12/2020

Sampling-Decomposable Generative Adversarial Recommender

Binbin Jin, Defu Lian, Zheng Liu and
Qi Liu, Jianhui Ma, Xing Xie, Enhong Chen

Keywords Paper

0

0

0

0

3:17

02/02/2021

Learning Precise Temporal Point Event Detection with Misaligned Labels

Julien Schroeter, Kirill Sidorov, David Marshall

Keywords Paper

0

0

0

0

21:24

18/07/2021

Just Train Twice: Improving Group Robustness without Training Group Information

Evan Liu, Behzad Haghgoo, Annie Chen and
Aditi Raghunathan, Pang Wei Koh, Shiori Sagawa, Percy Liang, Chelsea Finn

Keywords Paper

Deep Learning

0

0

0

0

20:58

06/12/2021

Evaluating model performance under worst-case subpopulations

Mike Li, Hongseok Namkoong, Shangzhou Xia

Keywords Paper

robustness, fairness

0

0

0

0

5:45

06/12/2021

Uncertainty Quantification and Deep Ensembles

Rahul Rahaman, alexandre thiery

Keywords Paper

deep learning, machine learning

0

0

0

0

14:40

06/12/2021

Improving black-box optimization in VAE latent space using decoder uncertainty

Pascal Notin, José Miguel Hernández-Lobato, Yarin Gal

Keywords Paper

optimization, robustness, generative model

0

0

0

0

11:11

14/09/2020

Estimating Precisions for Multiple Binary Classifiers Under Limited Samples

Rahul Tripathi, Srinivasan Jagannathan, Balaji Dhamodharaswamy

Keywords Paper

model precision, crowd-sourcing, sampling

0

0

0

0

16:08

03/05/2021

No Cost Likelihood Manipulation at Test Time for Making Better Mistakes in Deep Networks

Shyamgopal Karthik, Ameya Prabhu, Puneet Dokania, Vineet Gandhi

Keywords Paper

Conditional Risk Minimization, Hierarchy-Aware Classification, Post-Hoc Correction

0

0

0

0

4:53

03/05/2021

Selective Classification Can Magnify Disparities Across Groups

Erik Jones, Shiori Sagawa, Pang Wei Koh and
Ananya Kumar, Percy Liang

Keywords Paper

log-concavity, group disparities, selective classification, robustness

0

0

0

0

5:24

16/11/2020

Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data

Shachar Rosenman, Alon Jacovi, Yoav Goldberg

Keywords Paper

data process, re collection, sota models, tacred

0

0

0

0

5:55

18/07/2021

Wasserstein Distributional Normalization For Robust Distributional Certification of Noisy Labeled Data

Sung Woo Park, Junseok Kwon

Keywords Paper

Deep Learning, Generative Models, Algorithms, Representation Learning; Optimization, Submodular Optimization, Probabilistic Methods, Robust statistics

0

0

0

0

5:20

03/05/2021

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

0

0

0

0

5:13

06/12/2021

Risk Bounds for Over-parameterized Maximum Margin Classification on Sub-Gaussian Mixtures

Yuan Cao, Quanquan Gu, Mikhail Belkin

Keywords Paper

deep learning, machine learning

0

0

0

0

13:47

06/12/2021

Towards Deeper Deep Reinforcement Learning with Spectral Normalization

Nils Bjorck, Carla Gomes, Kilian Weinberger

Keywords Paper

reinforcement learning and planning, vision, language

0

0

0

0

9:28

14/06/2020

HyperSTAR: Task-Aware Hyperparameters for Deep Networks

Gaurav Mittal, Chang Liu, Nikolaos Karianakis and
Victor Fragoso, Mei Chen, Yun Fu

Keywords Paper

auto ml, hyperparameter optimization, meta learning, task aware, hyperband, hyperparameters, warm start, image classication, resnet, shufflenet

0

0

0

0

4:58

06/12/2020

Consistency Regularization for Certified Robustness of Smoothed Classifiers

Jongheon Jeong, Jinwoo Shin

Keywords Paper

0

0

0

0

3:16

02/02/2021

Learning from Noisy Labels with Complementary Loss Functions

Deng-Bao Wang, Yong Wen, Lujia Pan, Min-Ling Zhang

Keywords Paper

0

0

0

0

14:00

04/08/2021

Robust learning under clean-label attack

Avrim Blum, Steve Hanneke, Jian Qian, Han Shao

Keywords Paper

0

0

0

0

12:30

26/04/2020

On Generalization Error Bounds of Noisy Gradient Methods for Non-Convex Learning

Jian Li, Xuanyuan Luo, Mingda Qiao

Keywords Paper

learning theory, generalization, nonconvex learning, stochastic gradient descent, Langevin dynamics

0

0

0

0

4:50

06/12/2021

Data Augmentation Can Improve Robustness

Sylvestre-Alvise Rebuffi, Sven Gowal, Dan Andrei Calian and
Florian Stimberg, Olivia Wiles, Timothy A Mann

Keywords Paper

robustness, adversarial robustness and security

0

0

0

0

8:06

25/07/2020

Sampler design for implicit feedback data by noisy-label robust learning

Wenhui Yu, Zheng Qin

Keywords Paper

collaborative filtering, bayesian point-wise optimization, noisy-label robust learning, negative sampling, item recommendation

0

0

0

0

12:25

06/12/2021

Boosted CVaR Classification

Runtian Zhai, Chen Dan, Arun Suggala and
J. Zico Kolter, Pradeep Ravikumar

Keywords Paper

machine learning, fairness

0

0

0

0

14:10

13/04/2021

Predictive power of nearest neighbors algorithm under random perturbation

Yue Xing, Qifan Song, Guang Cheng

Keywords Paper

0

0

0

0

3:02