On Statistical Bias In Active Learning: How and When to Fix It

Abstract: Active learning is a powerful tool when labelling data is expensive, but it introduces a bias because the training data no longer follows the population distribution. We formalize this bias and investigate the situations in which it can be harmful and sometimes even helpful. We further introduce novel corrective weights to remove bias when doing so is beneficial. Through this, our work not only provides a useful mechanism that can improve the active learning approach, but also an explanation for the empirical successes of various existing approaches which ignore this bias. In particular, we show that this bias can be actively helpful when training overparameterized models---like neural networks---with relatively modest dataset sizes.

06/12/2021

On Statistical Bias In Active Learning: How and When to Fix It

Sebastian Farquhar, Yarin Gal, Tom Rainforth

Comments

Similar Papers

No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data

Mi Luo, Fei Chen, Dapeng Hu and Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Abstract Paper

optimization, machine learning, federated learning

Fair Generative Modeling via Weak Supervision

Kristy Choi, Aditya Grover, Trisha Singh and Rui Shu, Stefano Ermon

Keywords Abstract Paper

Deep Learning - Generative Models and Autoencoders

The Power of Comparisons for Actively Learning Linear Classifiers

Max Hopkins, Daniel Kane, Shachar Lovett

Keywords Abstract Paper

Towards Better Generalization of Adaptive Gradient Methods

Yingxue Zhou, Belhal Karimi, Jinxing Yu and Zhiqiang Xu, Ping Li

Keywords Abstract Paper

Learning from Failure: De-biasing Classifier from Biased Classifier

Junhyun Nam, Hyuntak Cha, Sungsoo Ahn and Jaeho Lee, Jinwoo Shin

Keywords Abstract Paper

Learning where to learn: Gradient sparsity in meta and continual learning

Johannes von Oswald, Dominic Zhao, Seijin Kobayashi and Simon Schug, Massimo Caccia, Nicolas Zucchet, João Sacramento

Keywords Abstract Paper

deep learning, optimization, meta learning, continual learning, few shot learning

Self-Adaptive Training: beyond Empirical Risk Minimization

Lang Huang, Chao Zhang, Hongyang Zhang

Keywords Abstract Paper

Deep Learning -> Generative Models, Algorithms -> Semi-Supervised Learning

Analyzing the effect of neural network architecture on training performance

Karthik Abinav Sankararaman, Soham De, Zheng Xu and W. Ronny Huang, Tom Goldstein

Keywords Abstract Paper

Deep Learning - Theory

MOPO: Model-based Offline Policy Optimization

Tianhe (Kevin) Yu, Garrett Thomas, Lantao Yu and Stefano Ermon, James Zou, Sergey Levine, Chelsea Finn, Tengyu Ma

Keywords Abstract Paper

Network Pruning That Matters: A Case Study on Retraining Variants

Duong Le, Binh-Son Hua

Keywords Abstract Paper

Network Pruning

When Do Curricula Work?

Xiaoxia (Shirley) Wu, Ethan Dyer, Behnam Neyshabur

Keywords Abstract Paper

Empirical Investigation, Understanding Deep Learning, Curriculum Learning

Minimax Lower Bounds for Transfer Learning with Linear and One-hidden Layer Neural Networks

Mohammadreza Mousavi Kalan, Zalan Fabian, Salman Avestimehr, Mahdi Soltanolkotabi

Keywords Abstract Paper

On Episodes, Prototypical Networks, and Few-Shot Learning

Steinar Laenen, Luca Bertinetto

Keywords Abstract Paper

machine learning, generative model, meta learning, few shot learning

A Theoretical Analysis of Fine-tuning with Linear Teachers

Gal Shachaf, Alon Brutzkus, Amir Globerson

Keywords Abstract Paper

theory, deep learning, transfer learning

Curse or Redemption? How Data Heterogeneity Affects the Robustness of Federated Learning

Syed Zawad, Ahsan Ali, Pin-Yu Chen and Ali Anwar, Yi Zhou, Nathalie Baracaldo, Yuan Tian, Feng Yan

Keywords Abstract Paper

EvidentialMix: Learning With Combined Open-Set and Closed-Set Noisy Labels

Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis and Ian Reid, Gustavo Carneiro

Keywords Abstract Paper

Step-Ahead Error Feedback for Distributed Training with Compressed Gradient

An Xu, Zhouyuan Huo, Heng Huang

Keywords Abstract Paper

More Data Can Expand The Generalization Gap Between Adversarially Robust and Standard Models

Lin Chen, Yifei Min, Mingrui Zhang, Amin Karbasi

Keywords Abstract Paper

Adversarial Examples

PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees

Jonas Rothfuss, Vincent Fortuin, Martin Josifoski, Andreas Krause

Keywords Abstract Paper

Algorithms, Multitask, Transfer, and Meta Learning

Adaptive Sampling for Minimax Fair Classification

Shubhanshu Shekhar, Greg Fields, Mohammad Ghavamzadeh, Tara Javidi

Keywords Abstract Paper

deep learning, machine learning, fairness

Contrastive Learning Inverts the Data Generating Process

Roland S. Zimmermann, Yash Sharma, Steffen Schneider and Matthias Bethge, Wieland Brendel

Keywords Abstract Paper

Theory, Deep learning Theory

Mi Luo, Fei Chen, Dapeng Hu and
Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Paper

Kristy Choi, Aditya Grover, Trisha Singh and
Rui Shu, Stefano Ermon

Keywords Paper

Keywords Paper

Yingxue Zhou, Belhal Karimi, Jinxing Yu and
Zhiqiang Xu, Ping Li

Keywords Paper

Junhyun Nam, Hyuntak Cha, Sungsoo Ahn and
Jaeho Lee, Jinwoo Shin

Keywords Paper

Johannes von Oswald, Dominic Zhao, Seijin Kobayashi and
Simon Schug, Massimo Caccia, Nicolas Zucchet, João Sacramento

Keywords Paper

Keywords Paper

Karthik Abinav Sankararaman, Soham De, Zheng Xu and
W. Ronny Huang, Tom Goldstein

Keywords Paper

Tianhe (Kevin) Yu, Garrett Thomas, Lantao Yu and
Stefano Ermon, James Zou, Sergey Levine, Chelsea Finn, Tengyu Ma

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Syed Zawad, Ahsan Ali, Pin-Yu Chen and
Ali Anwar, Yi Zhou, Nathalie Baracaldo, Yuan Tian, Feng Yan

Keywords Paper

Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis and
Ian Reid, Gustavo Carneiro

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Roland S. Zimmermann, Yash Sharma, Steffen Schneider and
Matthias Bethge, Wieland Brendel

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Denis Yarats, Amy Zhang, Ilya Kostrikov and
Brandon Amos, Joelle Pineau, Rob Fergus

Keywords Paper

Keywords Paper

Cody Coleman, Christopher Yeh, Stephen Mussmann and
Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Aoxue Li, Weiran Huang, Xu Lan and
Jiashi Feng, Zhenguo Li, Liwei Wang

Keywords Paper

Kevin Liang, Weituo Hao, Dinghan Shen and
Yufan Zhou, Weizhu Chen, Changyou Chen, Lawrence Carin

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper