Representation Matters: Assessing the Importance of Subgroup Allocations in Training Data

18/07/2021

Representation Matters: Assessing the Importance of Subgroup Allocations in Training Data

Esther Rolf, Theodora Worledge, Benjamin Recht, Michael Jordan

Keywords: Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

Abstract Paper Similar Papers

Abstract: Collecting more diverse and representative training data is often touted as a remedy for the disparate performance of machine learning predictors across subpopulations. However, a precise framework for understanding how dataset properties like diversity affect learning outcomes is largely lacking. By casting data collection as part of the learning process, we demonstrate that diverse representation in training data is key not only to increasing subgroup performances, but also to achieving population-level objectives. Our analysis and experiments describe how dataset compositions influence performance and provide constructive results for using trends in existing data, alongside domain knowledge, to help guide intentional, objective-aware dataset design

0

0

0

1

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

Regularising Knowledge Transfer by Meta Functional Learning

Pan Li, Yanwei Fu, Shaogang Gong

Keywords Paper

Machine Learning, Classification, Transfer, Adaptation, Multi-task Learning, Weakly Supervised Learning

0

0

0

0

13:41

03/05/2021

When Do Curricula Work?

Xiaoxia (Shirley) Wu, Ethan Dyer, Behnam Neyshabur

Keywords Paper

Empirical Investigation, Understanding Deep Learning, Curriculum Learning

0

0

0

0

14:37

18/07/2021

Model Performance Scaling with Multiple Data Sources

Tatsunori Hashimoto

Keywords Paper

Algorithms, Supervised Learning

0

0

0

1

4:50

06/12/2021

On Large-Cohort Training for Federated Learning

Zachary Charles, Zachary Garrett, Zhouyuan Huo and
Sergei Shmulyian, Virginia Smith

Keywords Paper

optimization, fairness, federated learning

0

0

0

0

13:53

03/05/2021

Dataset Condensation with Gradient Matching

Bo ZHAO, Konda Reddy Mopuri, Hakan Bilen

Keywords Paper

dataset condensation, image generation, data-efficient learning

0

0

0

0

15:09

13/04/2021

Learn to expect the unexpected: Probably approximately correct domain generalization

Vikas Garg, Adam Tauman Kalai, Katrina Ligett, Steven Wu

Keywords Paper

0

0

0

0

3:01

06/12/2021

A Mathematical Framework for Quantifying Transferability in Multi-source Transfer Learning

Xinyi Tong, Xiangxiang Xu, Shao-Lun Huang, Lizhong Zheng

Keywords Paper

theory, deep learning, machine learning, vision, transfer learning

2

1

0

0

13:27

06/12/2020

Continuous Meta-Learning without Tasks

James Harrison, Apoorva Sharma, Chelsea Finn, Marco Pavone

Keywords Paper

0

0

0

0

3:09

26/04/2020

Federated Adversarial Domain Adaptation

Xingchao Peng, Zijun Huang, Yizhe Zhu, Kate Saenko

Keywords Paper

Federated Learning, Domain Adaptation, Transfer Learning, Feature Disentanglement

0

0

0

2

4:57

16/11/2020

Self-Paced Learning for Neural Machine Translation

Yu Wan, Baosong Yang, Derek F. Wong and
Yikai Zhou, Lidia S. Chao, Haibo Zhang, Boxing Chen

Keywords Paper

neural, curriculum learning, translation tasks, nmt

0

0

0

0

6:03

06/12/2020

Information-theoretic Task Selection for Meta-Reinforcement Learning

Ricardo Luna Gutierrez, Matteo Leonetti

Keywords Paper

0

0

0

0

2:57

13/04/2021

On data efficiency of meta-learning

Maruan Al-Shedivat, Liam Li, Eric Xing, Ameet Talwalkar

Keywords Paper

0

0

0

0

3:24

13/04/2021

Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms

Alicia Curth, Mihaela Schaar

Keywords Paper

0

0

0

0

3:01

23/08/2020

Spectrum-guided adversarial disparity learning

Zhe Liu, Lina Yao, Lei Bai and
Xianzhi Wang, Can Wang

Keywords Paper

adversarial autoencoder, generative models, intraclass variability, activity recognition

0

0

0

0

14:30

20/07/2020

SelectNet: Learning to Sample from the Wild for Imbalanced Data Training

Yunru Liu, Tingran Gao, Haizhao Yang

Keywords Paper

0

0

0

0

20:50

06/12/2021

Adaptive Sampling for Minimax Fair Classification

Shubhanshu Shekhar, Greg Fields, Mohammad Ghavamzadeh, Tara Javidi

Keywords Paper

deep learning, machine learning, fairness

0

0

0

0

15:19

06/12/2020

What is being transferred in transfer learning?

Behnam Neyshabur, Hanie Sedghi, Chiyuan Zhang

Keywords Paper

0

0

0

0

3:20

06/12/2020

Probabilistic Active Meta-Learning

Jean Kaddour, Steindor Saemundsson, Marc Deisenroth

Keywords Paper

0

0

0

0

3:17

12/07/2020

Meta Variance Transfer: Learning to Augment from the Others

Seong-Jin Park, Seungju Han, Ji-won Baek and
Insoo Kim, Juhwan Song, Hae Beom Lee, Jae-Joon Han, Sung Ju Hwang

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

14:59

13/04/2021

Sample elicitation

Jiaheng Wei, Zuyue Fu, Yang Liu and
Xingyu Li, Zhuoran Yang, Zhaoran Wang

Keywords Paper

0

0

0

0

3:16

12/07/2020

The Non-IID Data Quagmire of Decentralized Machine Learning

Kevin Hsieh, Amar Phanishayee, Onur Mutlu, Phillip Gibbons

Keywords Paper

Optimization - Large Scale, Parallel and Distributed

0

0

0

0

14:58

06/12/2021

Deep Learning on a Data Diet: Finding Important Examples Early in Training

Mansheej Paul, Surya Ganguli, Gintare Karolina Dziugaite

Keywords Paper

deep learning

0

0

0

0

10:18

03/05/2021

Adaptive Procedural Task Generation for Hard-Exploration Problems

Kuan Fang, Yuke Zhu, Silvio Savarese, Li Fei-Fei

Keywords Paper

reinforcement learning, task generation, procedural generation, curriculum learning

0

0

0

0

5:06

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

04/07/2020

Empowering Active Learning to Jointly Optimize System and User Demands

Ji-Ung Lee, Christian M. Meyer, Iryna Gurevych

Keywords Paper

educational application, Active Learning, end-user application, active approach

0

0

0

0

12:00

13/04/2021

A theory of multiple-source adaptation with limited target labeled data

Yishay Mansour, Mehryar Mohri, Jae Ro and
Ananda Theertha Suresh, Ke Wu

Keywords Paper

0

0

0

0

2:39

04/07/2020

Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling

Ouyu Lan, Xiao Huang, Bill Yuchen Lin and
He Jiang, Liyuan Liu, Xiang Ren

Keywords Paper

Sequence Labeling, natural problems, crowd annotation, multi-source learning

0

0

0

0

12:01

14/09/2020

Active Learning for Hierarchical Multi-Label Classification

Felipe Kenji Nakano, Ricardo Cerri, Vens Celin

Keywords Paper

0

0

0

0

15:42

06/12/2020

Structured Prediction for Conditional Meta-Learning

Ruohan Wang, Yiannis Demiris, Carlo Ciliberto

Keywords Paper

0

0

0

0

3:12

06/12/2021

No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data

Mi Luo, Fei Chen, Dapeng Hu and
Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, federated learning

0

0

0

0

3:27

06/12/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn

Keywords Paper

0

0

0

0

3:24

06/12/2021

Revisit Multimodal Meta-Learning through the Lens of Multi-Task Learning

Milad Abdollahzadeh, Touba Malekzadeh, Ngai-Man (Man) Cheung

Keywords Paper

meta learning, few shot learning

0

0

0

0

13:08

26/08/2020

Data Generation for Neural Programming by Example

Judith Clymo, Adria Gascon, Brooks Paige and
Nathanael Fijalkow, Haik Manukian

Keywords Paper

0

0

0

0

14:31

08/12/2020

Dynamic Curriculum Learning for Low-Resource Neural Machine Translation

Chen Xu, Bojie Hu, Yufan Jiang and
Kai Feng, Zeyang Wang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

Keywords Paper

0

0

0

0

13:28

06/12/2020

Self-Paced Deep Reinforcement Learning

Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Keywords Paper

0

0

0

0

3:00

03/05/2021

One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks

Atish Agarwala, Abhimanyu Das, Brendan Juba and
Rina Panigrahy, Vatsal Sharan, Xin Wang, Qiuyi Zhang

Keywords Paper

deep learning theory, multi-task learning

0

0

0

0

5:18

26/04/2020

Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin and
Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle

Keywords Paper

few-shot learning, meta-learning, few-shot classification

0

0

0

0

5:05

03/05/2021

Theoretical bounds on estimation error for meta-learning

James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI and
Toniann Pitassi, Richard Zemel

Keywords Paper

meta learning, minimax risk, few-shot, lower bounds, learning theory

0

0

0

0

4:46

02/02/2021

Constructing a Fair Classifier with Generated Fair Data

Taeuk Jang, Feng Zheng, Xiaoqian Wang

Keywords Paper

0

0

0

0

15:58

16/11/2020

Transformer Based Multi-Source Domain Adaptation

Dustin Wright, Isabelle Augenstein

Keywords Paper

unsupervised adaptation, cnns, rnns, domain classifiers

0

0

0

0

11:30