Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation

Abstract: Word embeddings derived from human-generated corpora inherit strong gender bias which can be further amplified by downstream models. Some commonly adopted debiasing approaches, including the seminal Hard Debias algorithm, apply post-processing procedures that project pre-trained word embeddings into a subspace orthogonal to an inferred gender subspace. We discover that semantic-agnostic corpus regularities such as word frequency captured by the word embeddings negatively impact the performance of these algorithms. We propose a simple but effective technique, Double Hard Debias, which purifies the word embeddings against such corpus regularities prior to inferring and removing the gender subspace. Experiments on three bias mitigation benchmarks show that our approach preserves the distributional semantics of the pre-trained word embeddings while reducing gender bias to a significantly larger degree than prior approaches.

19/04/2021

video recognition, audio-visual representation, self-supervised learning, active learning, contrastive representation learning

5:22

08/12/2020

Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation

Tianlu Wang, Xi Victoria Lin, Nazneen Fatema Rajani, Bryan McCann, Vicente Ordonez, Caiming Xiong

Comments

Similar Papers

Machine translationese: Effects of algorithmic bias on linguistic complexity in machine translation

Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam

Keywords Abstract Paper

Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings

Vaibhav Kumar, Tenzin Bhotia, Vaibhav Kumar, Tanmoy Chakraborty

Keywords Abstract Paper

word embeddings, semantic words, coreference resolution, post-processing methods

Learning Disentangled Representation for Fair Facial Attribute Classification via Fairness-aware Information Alignment

Sungho Park, Sunhee Hwang, Dohyung Kim, Hyeran Byun

Keywords Abstract Paper

Bigram and Unigram Based Text Attack via Adaptive Monotonic Heuristic Search

Xinghao Yang, Weifeng Liu, James Bailey and Dacheng Tao, Wei Liu

Keywords Abstract Paper

Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation Problem

Danielle Saunders, Bill Byrne

Keywords Abstract Paper

Reducing Bias, Neural Translation, Domain Problem, NLP tasks

Infusing Sequential Information into Conditional Masked Translation Model with Self-Review Mechanism

Pan Xie, Zhi Cui, Xiuying Chen and XiaoHui Hu, Jianwei Cui, Bin Wang

Keywords Abstract Paper

On the Importance of Word Order Information in Cross-lingual Sequence Labeling

Zihan Liu, Genta I Winata, Samuel Cahyawijaya and Andrea Madotto, Zhaojiang Lin, Pascale Fung

Keywords Abstract Paper

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Abstract Paper

Natural Inference, data augmentation, Robustifying Models, deep models

Robust normalized squares maximization for unsupervised domain adaptation

Wenju Zhang, Xiang Zhang, Qing Liao and Wenjing Yang, Long Lan, Zhigang Luo

Keywords Abstract Paper

transfer learning, image classification, domain adaptation

Combining Subword Representations into Word-level Representations in the Transformer Architecture

Noe Casas, Marta R. Costa-jussà, José A. R. Fonollosa

Keywords Abstract Paper

Neural Translation, Subword Representations, Word-level Representations, Transformer Architecture

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

Yangming Li, lemao liu, Shuming Shi

Keywords Abstract Paper

Negative Sampling, Unlabeled Entity Problem, Named Entity Recognition

Stereotype and skew: Quantifying gender bias in pre-trained and fine-tuned language models

Daniel Vassimon Manela, David Errington, Thomas Fisher and Boris Breugel, Pasquale Minervini

Keywords Abstract Paper

Active Contrastive Learning of Audio-Visual Video Representations

Shuang Ma, Zhaoyang Zeng, Daniel McDuff, Yale Song

Keywords Abstract Paper

video recognition, audio-visual representation, self-supervised learning, active learning, contrastive representation learning

Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and Anna Korhonen, Goran Glavaš

Keywords Abstract Paper

Merging Statistical Feature via Adaptive Gate for Improved Text Classification

Xianming Li, Zongxi Li, Haoran Xie, Qing Li

Keywords Abstract Paper

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Lingkai Kong, Haoming Jiang, Yuchen Zhuang and Jie Lyu, Tuo Zhao, Chao Zhang

Keywords Abstract Paper

augmented training, in-distribution calibration, text classification, expectation error

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Abstract Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

Towards Debiasing Sentence Representations

Paul Pu Liang, Irene Mengze Li, Emily Zheng and Yao Chong Lim, Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Abstract Paper

Debiasing Representations, real-world scenarios, legal systems, debiasing

Max-Margin Incremental CCG Parsing

Miloš Stanojević, Mark Steedman

Keywords Abstract Paper

Incremental parsing, human processing, ASR, MT

Towards Robustness Against Natural Language Word Substitutions

Xinshuai Dong, Anh Tuan Luu, Rongrong Ji, Hong Liu

Keywords Abstract Paper

Adversarial Defense, Natural Language Processing

Residual Energy-Based Models for Text Generation

Yuntian Deng, Anton Bakhtin, Myle Ott and Arthur Szlam, Marc'Aurelio Ranzato

Keywords Abstract Paper

energy-based models, text generation

Keywords Paper

Keywords Paper

Keywords Paper

Xinghao Yang, Weifeng Liu, James Bailey and
Dacheng Tao, Wei Liu

Keywords Paper

Keywords Paper

Pan Xie, Zhi Cui, Xiuying Chen and
XiaoHui Hu, Jianwei Cui, Bin Wang

Keywords Paper

Zihan Liu, Genta I Winata, Samuel Cahyawijaya and
Andrea Madotto, Zhaojiang Lin, Pascale Fung

Keywords Paper

Keywords Paper

Wenju Zhang, Xiang Zhang, Qing Liao and
Wenjing Yang, Long Lan, Zhigang Luo

Keywords Paper

Keywords Paper

Keywords Paper

Daniel Vassimon Manela, David Errington, Thomas Fisher and
Boris Breugel, Pasquale Minervini

Keywords Paper

Keywords Paper

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and
Anna Korhonen, Goran Glavaš

Keywords Paper

Keywords Paper

Lingkai Kong, Haoming Jiang, Yuchen Zhuang and
Jie Lyu, Tuo Zhao, Chao Zhang

Keywords Paper

Keywords Paper

Paul Pu Liang, Irene Mengze Li, Emily Zheng and
Yao Chong Lim, Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Paper

Keywords Paper

Keywords Paper

Yuntian Deng, Anton Bakhtin, Myle Ott and
Arthur Szlam, Marc'Aurelio Ranzato

Keywords Paper

Zhihong Chen, Chao Chen, Zhaowei Cheng and
Boyuan Jiang, Ke Fang, Xinyu Jin

Keywords Paper

Qian-Wen Zhang, Ximing Zhang, Zhao Yan and
Ruifang Liu, Yunbo Cao, Min-Ling Zhang

Keywords Paper

Keywords Paper

Jieyu Zhao, Subhabrata Mukherjee, Saghar Hosseini and
Kai-Wei Chang, Ahmed Hassan Awadallah

Keywords Paper

Mengxi Jia, Xinhua Cheng, Yunpeng Zhai and
Shijian Lu, Siwei Ma, Yonghong Tian, Jian Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Mandela Patrick, Po-Yao Huang, Yuki Asano and
Florian Metze, Alexander G Hauptmann, Joao F. Henriques, Andrea Vedaldi

Keywords Paper

Keywords Paper

Keywords Paper

Joe Stacey, Pasquale Minervini, Haim Dubossarsky and
Sebastian Riedel, Tim Rocktäschel

Keywords Paper

Shubao Liu, Ke-Yue Zhang, Taiping Yao and
Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Yuan Xie, Lizhuang Ma

Keywords Paper

Keywords Paper

Keywords Paper

Edgar Schoenfeld, Vadim Sushko, Dan Zhang and
Juergen Gall, Bernt Schiele, Anna Khoreva

Keywords Paper