The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

02/02/2021

The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

Naman Goel, Alfonso Amayuelas, Amit Deshpande, Amit Sharma

Keywords:

Abstract Paper Similar Papers

Abstract: Training datasets for machine learning often have some form of missingness. For example, to learn a model for deciding whom to give a loan, the available training data includes individuals who were given a loan in the past, but not those who were not. This missingness, if ignored, nullifies any fairness guarantee of the training procedure when the model is deployed. Using causal graphs, we characterize the missingness mechanisms in different real-world scenarios. We show conditions under which various distributions, used in popular fairness algorithms, can or can not be recovered from the training data. Our theoretical results imply that many of these algorithms can not guarantee fairness in practice. Modeling missingness also helps to identify correct design principles for fair algorithms. For example, in multi-stage settings where decisions are made in multiple screening rounds, we use our framework to derive the minimal distributions required to design a fair algorithm. Our proposed algorithm also decentralizes the decision-making process and still achieves similar performance to the optimal algorithm that requires centralization and non-recoverable distributions.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948271

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

13/04/2021

All of the fairness for edge prediction with optimal transport

Charlotte Laclau, Ievgen Redko, Manvi Choudhary, Christine Largeron

Keywords Paper

0

0

0

0

3:09

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

13/04/2021

Learning the truth from only one side of the story

Heinrich Jiang, Qijia Jiang, Aldo Pacchiano

Keywords Paper

0

0

0

0

2:54

06/12/2021

Sample Selection for Fair and Robust Training

Yuji Roh, Kangwook Lee, Steven Whang, Changho Suh

Keywords Paper

optimization, robustness, fairness

0

0

0

0

13:44

18/07/2021

A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning

Nikunj Saunshi, Arushi Gupta, Wei Hu

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:20

06/12/2021

Adaptive Sampling for Minimax Fair Classification

Shubhanshu Shekhar, Greg Fields, Mohammad Ghavamzadeh, Tara Javidi

Keywords Paper

deep learning, machine learning, fairness

0

0

0

0

15:19

06/12/2021

Does enforcing fairness mitigate biases caused by subpopulation shift?

Subha Maity, Debarghya Mukherjee, Mikhail Yurochkin, Yuekai Sun

Keywords Paper

graph learning, domain adaptation, fairness

0

0

0

0

13:22

02/02/2021

Group Fairness by Probabilistic Modeling with Latent Fair Decisions

YooJung Choi, Meihua Dang, Guy Van den Broeck

Keywords Paper

0

0

0

0

19:30

19/10/2020

Fairness-aware learning with prejudice free representations

Ramanujam Madhavan, Mohit Wadhwa

Keywords Paper

fairness, prejudice, privacy, interpretability

0

0

0

0

7:04

18/07/2021

Whitening and Second Order Optimization Both Make Information in the Dataset Unusable During Training, and Can Reduce or Prevent Generalization

Neha Wadia, Daniel Duckworth, Samuel Schoenholz and
Ethan Dyer, Jascha Sohl-Dickstein

Keywords Paper

Optimization, Probabilistic Methods, Topic Models, Probabilistic Methods, Latent Variable Models

0

0

0

0

5:17

13/04/2021

On the generalization properties of adversarial training

Yue Xing, Qifan Song, Guang Cheng

Keywords Paper

0

0

0

0

3:05

06/12/2021

Integrated Latent Heterogeneity and Invariance Learning in Kernel Space

Jiashuo Liu, Zheyuan Hu, Peng Cui and
Bo Li, Zheyan Shen

Keywords Paper

deep learning, reinforcement learning and planning, machine learning

0

0

0

0

11:11

06/12/2020

Minimax Lower Bounds for Transfer Learning with Linear and One-hidden Layer Neural Networks

Mohammadreza Mousavi Kalan, Zalan Fabian, Salman Avestimehr, Mahdi Soltanolkotabi

Keywords Paper

0

0

0

0

3:16

06/12/2021

Subgroup Generalization and Fairness of Graph Neural Networks

Jiaqi Ma, Junwei Deng, Qiaozhu Mei

Keywords Paper

deep learning, graph learning, fairness, semi-supervised learning

0

0

0

0

14:45

02/02/2021

Exploratory Machine Learning with Unknown Unknowns

Peng Zhao, Yu-Jie Zhang, Zhi-Hua Zhou

Keywords Paper

0

0

0

0

21:39

26/04/2020

Meta-Learning without Memorization

Mingzhang Yin, George Tucker, Mingyuan Zhou and
Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, memorization, regularization, overfitting, mutually-exclusive

0

0

0

0

5:09

06/12/2021

Assessing Fairness in the Presence of Missing Data

Yiliang Zhang, Qi Long

Keywords Paper

domain adaptation, fairness

0

0

0

0

7:37

26/08/2020

Identifying and Correcting Label Bias in Machine Learning

Heinrich Jiang, Ofir Nachum

Keywords Paper

0

0

0

0

12:42

02/02/2021

Any-Precision Deep Neural Networks

Haichao Yu, Haoxiang Li, Humphrey Shi and
Thomas S. Huang, Gang Hua

Keywords Paper

0

0

0

0

14:26

13/04/2021

Towards understanding the behaviors of optimal deep active learning algorithms

Yilun Zhou, Adithya Renduchintala, Xian Li and
Sida Wang, Yashar Mehdad, Asish Ghoshal

Keywords Paper

0

0

0

0

3:00

26/04/2020

Towards Verified Robustness under Text Deletion Interventions

Johannes Welbl, Po-Sen Huang, Robert Stanforth and
Sven Gowal, Krishnamurthy (Dj) Dvijotham, Martin Szummer, Pushmeet Kohli

Keywords Paper

natural language processing, specification, verification, model undersensitivity, adversarial, interval bound propagation

0

0

0

0

5:01

12/07/2020

Supervised learning: no loss no cry

Richard Nock, Aditya Menon

Keywords Paper

Learning Theory

0

0

0

0

15:18

06/12/2021

Statistically and Computationally Efficient Linear Meta-representation Learning

Kiran Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

Keywords Paper

optimization, meta learning, representation learning, few shot learning

1

0

0

1

12:56

26/04/2020

Rényi Fair Inference

Sina Baharlouei, Maher Nouiehed, Ahmad Beirami, Meisam Razaviyayn

Keywords Paper

0

0

0

0

4:41

18/07/2021

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:06

06/12/2020

What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation

Vitaly Feldman, Chiyuan Zhang

Keywords Paper

0

0

0

0

3:22

06/12/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn

Keywords Paper

0

0

0

0

3:24

09/07/2020

Implicit Bias of Gradient Descent for Wide Two-layer Neural Networks Trained with the Logistic Loss

Lénaïc Chizat, Francis Bach

Keywords Paper

Neural networks/deep learning, Non-convex optimization

0

0

0

0

14:41

18/07/2021

Selecting Data Augmentation for Simulating Interventions

Max Ilse, Jakub Tomczak, Patrick Forré

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

4:14

18/07/2021

Dash: Semi-Supervised Learning with Dynamic Thresholding

Yi Xu, Lei Shang, Jinxing Ye and
Qi Qian, Yufeng Li, Baigui Sun, Hao Li, rong jin

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

1

15:24

06/12/2020

Functional Regularization for Representation Learning: A Unified Theoretical Perspective

Siddhant Garg, Yingyu Liang

Keywords Paper

0

0

0

0

3:19

26/08/2020

Fair Decisions Despite Imperfect Predictions

Niki Kilbertus, Manuel Gomez Rodriguez, Bernhard Schölkopf and
Krikamol Muandet, Isabel Valera

Keywords Paper

0

0

0

0

14:12

18/07/2021

On Recovering from Modeling Errors Using Testing Bayesian Networks

Haiying Huang, Adnan Darwiche

Keywords Paper

Probabilistic Methods, Graphical Models

0

0

0

0

5:09

05/01/2021

Representation Learning With Statistical Independence to Mitigate Bias

Ehsan Adeli, Qingyu Zhao, Adolf Pfefferbaum and
Edith V. Sullivan, Li Fei-Fei, Juan Carlos Niebles, Kilian M. Pohl

Keywords Paper

0

0

0

0

4:33

02/02/2021

Teaching the Old Dog New Tricks: Supervised Learning with Constraints

Fabrizio Detassis, Michele Lombardi, Michela Milano

Keywords Paper

0

0

0

0

16:58

19/08/2021

Cross-Domain Few-Shot Classification via Adversarial Task Augmentation

Haoqing Wang, Zhi-Hong Deng

Keywords Paper

Computer Vision, Recognition, Adversarial Machine Learning, Deep Learning

0

0

0

0

10:39

18/07/2021

Monotonic Robust Policy Optimization with Model Discrepancy

yuankun jiang, Chenglin Li, Wenrui Dai and
Junni Zou, Hongkai Xiong

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:17

06/12/2021

Bridging the Gap Between Practice and PAC-Bayes Theory in Few-Shot Meta-Learning

Nan Ding, Xi Chen, Tomer Levinboim and
Sebastian Goodman, Radu Soricut

Keywords Paper

theory, meta learning, few shot learning

0

0

0

0

11:14

06/12/2020

MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures

Jeong Un Ryu, JWoong Shin, Hae Beom Lee, Sung Ju Hwang

Keywords Paper

0

0

0

0

3:32

06/12/2021

DECAF: Generating Fair Synthetic Data Using Causally-Aware Generative Networks

Boris van Breugel, Trent Kyono, Jeroen Berrevoets, Mihaela van der Schaar

Keywords Paper

machine learning, generative model, causality, fairness

0

0

0

0

9:53