Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection

04/07/2020

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection

Shauli Ravfogel, Yanai Elazar, Hila Gonen, Michael Twiton, Yoav Goldberg

Keywords: multi-class classification, Iterative Projection, Iterative , neural representation

Abstract Paper Similar Papers

Abstract: The ability to control for the kinds of information encoded in neural representation has a variety of use cases, especially in light of the challenge of interpreting these models. We present Iterative Null-space Projection (INLP), a novel method for removing information from neural representations. Our method is based on repeated training of linear classifiers that predict a certain property we aim to remove, followed by projection of the representations on their null-space. By doing so, the classifiers become oblivious to that target property, making it hard to linearly separate the data according to it. While applicable for multiple uses, we evaluate our method on bias and fairness use-cases, and show that our method is able to mitigate bias in word embeddings, as well as to increase fairness in a setting of multi-class classification.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Can Information Flows Suggest Targets for Interventions in Neural Circuits?

Praveen Venkatesh, Sanghamitra Dutta, Neil Mehta, Pulkit Grover

Keywords Paper

theory, deep learning, machine learning, neuroscience, fairness, interpretability

0

0

0

0

14:11

06/12/2021

Bayesian Adaptation for Covariate Shift

Aurick Zhou, Sergey Levine

Keywords Paper

deep learning, machine learning, robustness, vision, domain adaptation

0

0

0

0

8:21

12/07/2020

Adversarial Filters of Dataset Biases

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and
Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

15:25

06/12/2020

Learning from Failure: De-biasing Classifier from Biased Classifier

Junhyun Nam, Hyuntak Cha, Sungsoo Ahn and
Jaeho Lee, Jinwoo Shin

Keywords Paper

0

0

0

0

3:21

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

02/02/2021

Visualization of Supervised and Self-Supervised Neural Networks via Attribution Guided Factorization

Shir Gur, Ameen Ali, Lior Wolf

Keywords Paper

0

0

0

0

14:14

13/04/2021

Influence decompositions for neural network attribution

Kyle Reing, Greg Ver Steeg, Aram Galstyan

Keywords Paper

0

0

0

0

2:52

06/12/2020

Posterior Re-calibration for Imbalanced Datasets

Junjiao Tian, Yen-Cheng Liu, Nathaniel Glaser and
Yen-Chang Hsu, Zsolt Kira

Keywords Paper

Algorithms -> Few-Shot Learning, Applications -> Computer Vision

0

0

0

0

3:23

06/12/2021

Understanding Interlocking Dynamics of Cooperative Rationalization

Mo Yu, Yang Zhang, Shiyu Chang, Tommi Jaakkola

Keywords Paper

deep learning, language, interpretability

0

0

0

0

13:41

18/07/2021

Robust Representation Learning via Perceptual Similarity Metrics

Saeid A Taghanaki, Kristy Choi, Amir Hosein Khasahmadi, Anirudh Goyal

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

4:48

14/06/2020

Regularizing Class-Wise Predictions via Self-Knowledge Distillation

Sukmin Yun, Jongjin Park, Kimin Lee, Jinwoo Shin

Keywords Paper

image classification, regularization, self-knowledge distillation, generalization, calibration

0

0

0

0

1:01

06/12/2020

Adversarial Self-Supervised Contrastive Learning

Minseon Kim, Jihoon Tack, Sung Ju Hwang

Keywords Paper

0

0

0

0

3:19

14/06/2020

NestedVAE: Isolating Common Factors via Weak Supervision

Matthew J. Vowels, Necati Cihan Camgöz, Richard Bowden

Keywords Paper

fairness, bias, representation learning, invariance, vae, variational, weakly supervised, information bottleneck

0

0

0

0

1:00

04/07/2020

Location Attention for Extrapolation to Longer Sequences

Yann Dubois, Gautier Dagan, Dieuwke Hupkes, Elia Bruni

Keywords Paper

Extrapolation, natural processing, generalization, Lookup task

0

0

0

0

11:02

06/12/2021

Consistency Regularization for Variational Auto-Encoders

Samarth Sinha, Adji Bousso Dieng

Keywords Paper

deep learning, machine learning, self-supervised learning, generative model, contrastive learning, representation learning

0

0

0

0

10:52

14/06/2020

Robust Design of Deep Neural Networks Against Adversarial Attacks Based on Lyapunov Theory

Arash Rahnama, Andre T. Nguyen, Edward Raff

Keywords Paper

adversarial machine learning, robustness, control theory, lyapunov theory, spectral norm regularization, stability and robustness analysis of dnns, dissipativity and passivity theory, adversarial attacks, learning theory, mathematical analysis of dnns

0

0

0

0

1:00

26/08/2020

Non-Parametric Calibration for Classification

Jonathan Wenger, Hedvig Kjellström, Rudolph Triebel )

Keywords Paper

0

0

0

0

15:29

02/02/2021

Uncertainty-Aware Multi-View Representation Learning

Yu Geng, Zongbo Han, Changqing Zhang, Qinghua Hu

Keywords Paper

0

0

0

0

14:19

06/12/2021

Predify: Augmenting deep neural networks with brain-inspired predictive coding dynamics

Bhavin Choksi, Milad Mozafari, Callum Biggs O'May and
B. ADOR, Andrea Alamia, Rufin VanRullen

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security, neuroscience, vision

0

0

0

0

11:21

06/12/2021

A universal probabilistic spike count model reveals ongoing modulation of neural variability

David Liu, Mate Lengyel

Keywords Paper

generative model, kernel methods

0

0

0

0

15:06

03/05/2021

Representation Learning via Invariant Causal Mechanisms

Jovana Mitrovic, Brian McWilliams, Jacob C Walker and
Lars Buesing, Charles Blundell

Keywords Paper

Self-supervised Learning, Representation Learning, Causality, Contrastive Methods

1

0

0

0

7:03

06/12/2020

Robust Federated Learning: The Case of Affine Distribution Shifts

Amirhossein Reisizadeh, Farzan Farnia, Ramtin Pedarsani, Ali Jadbabaie

Keywords Paper

0

0

0

0

3:16

03/05/2021

Removing Undesirable Feature Contributions Using Out-of-Distribution Data

Saehyung Lee, Changhwa Park, Hyungyu Lee and
Jihun Yi, Jonghyun Lee, Sungroh Yoon

Keywords Paper

adversarial training, generalization, out-of-distribution, adversarial robustness

0

1

0

1

5:05

03/05/2021

Influence Functions in Deep Learning Are Fragile

Samyadeep Basu, Phil Pope, Soheil Feizi

Keywords Paper

Influence Functions, Interpretability

0

0

1

1

6:15

06/12/2020

A Causal View on Robustness of Neural Networks

Cheng Zhang, Kun Zhang, Yingzhen Li

Keywords Paper

Data, Challenges, Implementations, and Software -> Virtual Environments; Deep Learning -> Memory-Augmented Neural Networks; Neu, Deep Learning

0

0

0

0

3:25

14/06/2020

Attack to Explain Deep Representation

Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Keywords Paper

interpreting deep learning, adversarial attack, explanation attack, explainable ai, image generation

0

0

0

0

1:01

18/07/2021

Uniform Convergence, Adversarial Spheres and a Simple Remedy

Gregor Bachmann, Seyed Moosavi, Thomas Hofmann

Keywords Paper

Theory, Deep learning Theory

0

2

0

0

5:52

03/05/2021

Towards Robust Neural Networks via Close-loop Control

Zhuotong Chen, Qianxiao Li, Zheng Zhang

Keywords Paper

dynamical system, neural network robustness, optimal control

0

0

0

0

4:47

03/05/2021

Counterfactual Generative Networks

Axel Sauer, Andreas Geiger

Keywords Paper

Generative Models, Data Augmentation, Image Classification, Counterfactuals, Robustness, Causality

0

0

0

0

5:25

18/07/2021

Instabilities of Offline RL with Pre-Trained Neural Representation

Ruosong Wang, Yifan Wu, Russ Salakhutdinov, Sham Kakade

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:19

07/09/2020

Transferring Pretrained Networks to Small Data via Category Decorrelation

Ying Jin, Zhangjie Cao, Mingsheng Long, Jianmin Wang

Keywords Paper

Category Decorrelation, Under Transfer

1

1

0

0

8:39

06/12/2020

Learning efficient task-dependent representations with synaptic plasticity

Colin Bredenberg, Eero Simoncelli, Cristina Savin

Keywords Paper

0

0

0

0

3:24

18/07/2021

Nondeterminism and Instability in Neural Network Optimization

Cecilia Summers, Michael J Dinneen

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

5:12

06/12/2021

Learning Debiased and Disentangled Representations for Semantic Segmentation

Sanghyeok Chu, Dongwan Kim, Bohyung Han

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

15:14

26/04/2020

Disentangling neural mechanisms for perceptual grouping

Junkyung Kim, Drew Linsley, Kalpit Thakkar, Thomas Serre

Keywords Paper

Perceptual grouping, visual cortex, recurrent feedback, horizontal connections, top-down connections

0

0

0

0

5:16

03/05/2021

Evaluation of Neural Architectures Trained With Square Loss vs Cross-Entropy in Classification Tasks

Like Hui, Misha Belkin

Keywords Paper

classification, experimental evaluation, square loss vs cross-entropy, large scale learning

0

0

0

0

5:09

06/12/2021

Adaptive Sampling for Minimax Fair Classification

Shubhanshu Shekhar, Greg Fields, Mohammad Ghavamzadeh, Tara Javidi

Keywords Paper

deep learning, machine learning, fairness

0

0

0

0

15:19

02/02/2021

Explainable Models with Consistent Interpretations

Vipin Pillai, Hamed Pirsiavash

Keywords Paper

0

0

0

0

16:20

12/07/2020

Training Binary Neural Networks through Learning with Noisy Supervision

Kai Han, Yunhe Wang, Yixing Xu and
Chunjing Xu, Enhua Wu, Chang Xu

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

12:34

26/08/2020

Robust Learning from Discriminative Feature Feedback

Sanjoy Dasgupta, Sivan Sabato

Keywords Paper

0

0

0

0

14:37