Simpson's Bias in NLP Training

02/02/2021

Simpson's Bias in NLP Training

Fei Yuan, Longtu Zhang, Huang Bojun, Yaobo Liang

Keywords:

Abstract Paper Similar Papers

Abstract: In most machine learning tasks, we evaluate a model M on a given data population S by measuring a population-level metric F(S;M). Examples of such evaluation metric F include precision/recall for (binary) recognition, the F1 score for multi-class classification, and the BLEU metric for language generation. On the other hand, the model M is trained by optimizing a sample-level loss G(S_t; M) at each learning step t, where S_t is a subset of S (a.k.a. the mini-batch). Popular choices of G include cross-entropy loss, the Dice loss, and sentence-level BLEU scores. A fundamental assumption behind this paradigm is that the mean value of the sample-level loss G, if averaged over all possible samples, should effectively represent the population-level metric F of the task, such as, that E[ G(S_t; M) ] ~ F(S; M). In this paper, we systematically investigate the above assumption in several NLP tasks. We show, both theoretically and experimentally, that some popular designs of the sample-level loss G may be inconsistent with the true population-level metric F of the task, so that models trained to optimize the former can be substantially sub-optimal to the latter, a phenomenon we call it, Simpson's bias, due to its deep connections with the classic paradox known as Simpson's reversal paradox in statistics and social sciences.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948775

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Certifying Robustness to Programmable Data Bias in Decision Trees

Anna Meyer, Aws Albarghouthi, Loris D'Antoni

Keywords Paper

robustness, fairness

0

0

0

0

13:03

13/04/2021

Good classifiers are abundant in the interpolating regime

Ryan Theisen, Jason Klusowski, Michael Mahoney

Keywords Paper

0

0

0

0

2:59

09/07/2020

Bessel Smoothing and Multi-Distribution Property Estimation

Yi Hao, Ping Li

Keywords Paper

Distribution learning/testing, High-dimensional statistics, Information theory

0

0

0

0

14:48

06/12/2021

Robust Generalization despite Distribution Shift via Minimum Discriminating Information

Tobias Sutter, Andreas Krause, Daniel Kuhn

Keywords Paper

optimization, machine learning

0

0

0

0

15:05

03/05/2021

Why resampling outperforms reweighting for correcting sampling bias with stochastic gradients

Jing An, Lexing Ying, Yuhua Zhu

Keywords Paper

stability, stochastic asymptotics, resampling, reweighting, biased sampling

0

0

0

0

5:31

06/12/2020

Learning by Minimizing the Sum of Ranked Range

Shu Hu, Yiming Ying, xin wang, Siwei Lyu

Keywords Paper

Algorithms -> Sparsity and Compressed Sensing, Theory -> Frequentist Statistics

0

0

0

0

3:12

06/12/2020

Exact expressions for double descent and implicit regularization via surrogate random design

Michal Derezinski, Feynman Liang, Michael W Mahoney

Keywords Paper

0

0

0

0

3:24

06/12/2021

Generalization of Model-Agnostic Meta-Learning Algorithms: Recurring and Unseen Tasks

Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

Keywords Paper

theory, optimization, meta learning

0

0

0

0

14:42

18/07/2021

Rate-Distortion Analysis of Minimum Excess Risk in Bayesian Learning

Hassan Hafez-Kolahi, Behrad Moniri, Shohreh Kasaei, Mahdieh Soleymani Baghshah

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

14:44

06/12/2020

Field-wise Learning for Multi-field Categorical Data

Zhibin Li, Jian Zhang, Yongshun Gong and
Yazhou Yao, Qiang Wu

Keywords Paper

0

0

0

0

3:29

06/12/2021

Learning Gaussian Mixtures with Generalized Linear Models: Precise Asymptotics in High-dimensions

Bruno Loureiro, Gabriele Sicuro, Cedric Gerbelot and
Alessandro Pacco, Florent Krzakala, Lenka Zdeborová

Keywords Paper

theory, machine learning

0

0

0

0

14:35

06/12/2021

Robustness between the worst and average case

Leslie Rice, Anna Bair, Huan Zhang, J. Zico Kolter

Keywords Paper

machine learning, robustness, adversarial robustness and security, generative model

0

0

0

0

10:46

06/12/2020

Minimax Estimation of Conditional Moment Models

Nishanth Dikkala, Greg Lewis, Lester Mackey, Vasilis Syrgkanis

Keywords Paper

0

0

0

0

3:04

09/07/2020

Estimating Principal Components under Adversarial Perturbations

Pranjal Awasthi, Xue Chen, Aravindan Vijayaraghavan

Keywords Paper

Unsupervised and semi-supervised learning, Adversarial learning and robustness

0

0

0

0

15:40

18/07/2021

Optimizing Black-box Metrics with Iterative Example Weighting

Gaurush Hiranandani, Jatin Mathur, Harikrishna Narasimhan and
Mahdi Milani Fard, Sanmi Koyejo

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

5:49

18/07/2021

Learning from Biased Data: A Semi-Parametric Approach

Patrice Bertail, Stephan Clémençon, Yannick Guyonvarch, Nathan NOIRY

Keywords Paper

Applications, Fairness, Accountability, and Transparency, Theory, Algorithms, Clustering; Applications, Hardware and Systems; Applications, Privacy, Anonymity, and Security

0

0

0

0

5:09

18/07/2021

Adaptive Sampling for Best Policy Identification in Markov Decision Processes

Aymen Al Marjani, Alexandre Proutiere

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

5:35

19/08/2021

Understanding the Effect of Bias in Deep Anomaly Detection

Ziyu Ye, Yuxin Chen, Haitao Zheng

Keywords Paper

Machine Learning, Deep Learning, Anomaly/Outlier Detection, Semi-Supervised Learning

0

0

0

1

14:23

20/07/2020

Large deviations for the perceptron model and consequences for active learning

Hugo Cui, Luca Saglietti, Lenka Zdeborova

Keywords Paper

0

0

0

0

14:19

06/12/2020

Learning to summarize with human feedback

Nisan Stiennon, Long Ouyang, Jeffrey Wu and
Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Keywords Paper

0

0

0

0

3:17

14/06/2020

Instance Credibility Inference for Few-Shot Learning

Yikai Wang, Chengming Xu, Chen Liu and
Li Zhang, Yanwei Fu

Keywords Paper

few-shot learning, incidental parameters, regularization path, semi-supervised learning, self-taught learning

0

0

0

0

1:01

23/08/2020

Targeted data-driven regularization for out-of-distribution generalization

Mohammad Mahdi Kamani, Sadegh Farhang, Mehrdad Mahdavi, James Z. Wang

Keywords Paper

data-driven regularization, out-of-distribution generalization, bilevel programming

0

0

0

0

6:36

04/07/2020

Investigating Word-Class Distributions in Word Vector Spaces

Ryohei Sasano, Anna Korhonen

Keywords Paper

modeling distribution, centroid-based model, discriminative models, Word-Class Distributions

0

0

0

0

11:53

02/02/2021

Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks

Haobo Jiang, Jin Xie, Jian Yang

Keywords Paper

0

0

0

0

13:27

26/04/2020

Frequency-based Search-control in Dyna

Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand

Keywords Paper

Model-based reinforcement learning, search-control, Dyna, frequency of a signal

0

0

0

0

4:32

13/04/2021

The sample complexity of meta sparse regression

Zhanyu Wang, Jean Honorio

Keywords Paper

0

0

0

0

2:57

12/07/2020

Sequential Transfer in Reinforcement Learning with a Generative Model

Andrea Tirinzoni, Riccardo Poiani, Marcello Restelli

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:54

06/12/2020

On the Theory of Transfer Learning: The Importance of Task Diversity

Nilesh Tripuraneni, Michael Jordan, Chi Jin

Keywords Paper

0

0

0

0

3:19

12/07/2020

Data Valuation using Reinforcement Learning

Jinsung Yoon, Sercan Arik, Tomas Pfister

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

14:35

23/08/2020

Adversarial infidelity learning for model interpretation

Jian Liang, Bing Bai, Yuren Cao and
Kun Bai, Fei Wang

Keywords Paper

infidelity, model interpretation, adversarial learning, black-box explanations

0

0

0

0

5:34

18/11/2020

A one-step approach to covariate shift adaptation

Tianyi Zhang, Ikko Yamane, Nan Lu, Masashi Sugiyama

Keywords Paper

0

0

0

0

12:27

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

26/08/2020

A Theoretical and Practical Framework for Regression and Classification from Truncated Samples

Andrew Ilyas, Emmanouil Zampetakis, Constantinos Daskalakis

Keywords Paper

0

0

0

0

15:28

18/07/2021

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient

Botao Hao, Yaqi Duan, Tor Lattimore and
Csaba Szepesvari, Mengdi Wang

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:20

06/12/2020

A random matrix analysis of random Fourier features: beyond the Gaussian kernel, a precise phase transition, and the corresponding double descent

Zhenyu Liao, Romain Couillet, Michael W Mahoney

Keywords Paper

0

0

0

0

3:26

14/06/2020

Adaptive Dilated Network With Self-Correction Supervision for Counting

Shuai Bai, Zhiqun He, Yu Qiao and
Hanzhe Hu, Wei Wu, Junjie Yan

Keywords Paper

crowd counting, self-correction, convolutional neural network

0

0

0

0

0:59

12/07/2020

The Sample Complexity of Best-$k$ Items Selection from Pairwise Comparisons

Wenbo Ren, Jia Liu, Ness Shroff

Keywords Paper

Supervised Learning

0

0

0

0

13:16

12/07/2020

Convex Calibrated Surrogates for the Multi-Label F-Measure

Mingyuan Zhang, Harish Guruprasad Ramaswamy, Shivani Agarwal

Keywords Paper

Supervised Learning

0

0

0

0

16:09

12/07/2020

Explaining Groups of Points in Low-Dimensional Representations

Gregory Plumb, Jonathan Terhorst, Sriram Sankararaman, Ameet Talwalkar

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

12:07

26/08/2020

Multi-level Gaussian Graphical Models Conditional on Covariates

Gi Bum Kim, Seyoung Kim

Keywords Paper

0

0

0

0

12:45