CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models

16/11/2020

CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models

Nikita Nangia, Clara Vania, Rasika Bhalerao, Samuel R. Bowman

Keywords: nlp tasks, pretrained models, masked models, mlms

Abstract Paper Similar Papers

Abstract: Pretrained language models, especially masked language models (MLMs) have seen success across many NLP tasks. However, there is ample evidence that they use the cultural biases that are undoubtedly present in the corpora they are trained on, implicitly creating harm with biased representations. To measure some forms of social bias in language models against protected demographic groups in the US, we introduce the Crowdsourced Stereotype Pairs benchmark (CrowS-Pairs). CrowS-Pairs has 1508 examples that cover stereotypes dealing with nine types of bias, like race, religion, and age. In CrowS-Pairs a model is presented with two sentences: one that is more stereotyping and another that is less stereotyping. The data focuses on stereotypes about historically disadvantaged groups and contrasts them with advantaged groups. We find that all three of the widely-used MLMs we evaluate substantially favor sentences that express stereotypes in every category in CrowS-Pairs. As work on building less biased models advances, this dataset can be used as a benchmark to evaluate progress.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Contextualizing Hate Speech Classifiers with Post-hoc Explanation

Brendan Kennedy, Xisen Jin, Aida Mostafazadeh Davani and
Morteza Dehghani, Xiang Ren

Keywords Paper

Contextualizing Classifiers, Post-hoc Explanation, Hate classifiers, fine-tuned classifiers

1

1

0

0

7:09

19/08/2021

An Examination of Fairness of AI Models for Deepfake Detection

Loc Trinh, Yan Liu

Keywords Paper

AI Ethics, Trust, Fairness, Fairness, Deep Learning, Biometrics, Face and Gesture Recognition

0

0

0

0

10:25

04/07/2020

Social Bias Frames: Reasoning about Social and Power Implications of Language

Maarten Sap, Saadia Gabriel, Lianhui Qin and
Dan Jurafsky, Noah A. Smith, Yejin Choi

Keywords Paper

Warning, large-scale evaluation, high-level categorization, Social Frames

0

0

0

0

11:02

08/12/2020

Monolingual and Multilingual Reduction of Gender Bias in Contextualized Representations

Sheng Liang, Philipp Dufter, Hinrich Schütze

Keywords Paper

0

0

0

0

14:20

05/01/2021

FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age for Bias Measurement and Mitigation

Kimmo Karkkainen, Jungseock Joo

Keywords Paper

0

0

0

0

4:44

19/04/2021

Machine translationese: Effects of algorithmic bias on linguistic complexity in machine translation

Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam

Keywords Paper

0

0

0

0

11:19

02/02/2021

The Gap on Gap: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Vid Kocijan, Oana-Maria Camburu, Thomas Lukasiewicz

Keywords Paper

0

0

0

0

15:13

07/06/2021

Discovering and Categorising Language Biases in Reddit

Xavier Ferrer, Tom Van Nuenen, Jose M. Such, Natalia Criado

Keywords Paper

Qualitative and quantitative studies of social media, Social network analysis, communities identification, expertise and authority discovery, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analy

0

0

0

0

8:03

07/06/2021

Measuring Societal Biases from Text Corpora with Smoothed First-Order Co-occurrence

Navid Rekabsaz, Robert West, James Henderson, Allan Hanbury

Keywords Paper

Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of social media behavior, Text categorization, topic recognition, demographic/gender/age identification

0

0

0

0

8:05

04/07/2020

Demographics Should Not Be the Reason of Toxicity: Mitigating Discrimination in Text Classifications with Instance Weighting

Guanhua Zhang, Bing Bai, Junqi Zhang and
Kun Bai, Conghui Zhu, Tiejun Zhao

Keywords Paper

Mitigating Discrimination, Text Classifications, Discrimination, Instance Weighting

0

0

0

0

11:58

08/12/2020

An analysis of language models for metaphor recognition

Arthur Neidlein, Philip Wiesenbach, Katja Markert

Keywords Paper

0

0

0

0

13:52

16/11/2020

On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment

Zirui Wang, Zachary C. Lipton, Yulia Tsvetkov

Keywords Paper

multilingual models, meta-learning algorithm, multilingual representations, negative interference

0

0

0

0

12:03

08/12/2020

Assessing Polyseme Sense Similarity through Co-predication Acceptability and Contextualised Embedding Distance

Janosch Haber, Massimo Poesio

Keywords Paper

0

0

0

0

14:07

07/09/2020

From Saturation to Zero-Shot Visual Relationship Detection Using Local Context

Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos

Keywords Paper

Visual Relationship Detection, Scene Graph Generation, Zero-shot Classification, Local Context, Language Bias

0

0

0

0

7:17

14/09/2020

A Deep Dive into Multilingual Hate Speech Classification

Sai Saketh Aluru, Binny Mathew, Punyajoy Saha, Animesh Mukherjee

Keywords Paper

hate speech, multilingual, classification, bert, embeddings

0

0

0

0

14:20

19/08/2021

Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models

Maximilian Spliethöver, Henning Wachsmuth

Keywords Paper

AI Ethics, Trust, Fairness, Fairness, Societal Impact of AI, Natural Language Processing

0

0

0

0

12:59

16/11/2020

Speakers Fill Lexical Semantic Gaps with Context

Tiago Pimentel, Rowan Hall Maudslay, Damian Blasi, Ryan Cotterell

Keywords Paper

bert-based ambiguity, human annotation, lexical ambiguity, ambiguous words

0

0

0

0

10:05

16/11/2020

Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation

Emily Dinan, Angela Fan, Adina Williams and
Jack Urbanek, Douwe Kiela, Jason Weston

Keywords Paper

counterfactual augmentation, targeted collection, bias training, generative models

0

0

0

0

12:18

16/11/2020

Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank

Eleftheria Briakou, Marine Carpuat

Keywords Paper

detecting content, cross-lingual nlp, machine problem, annotation

0

0

0

0

11:06

19/04/2021

A unified feature representation for lexical connotations

Emily Allaway, Kathleen McKeown

Keywords Paper

0

0

0

0

12:07

16/11/2020

Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings

Vaibhav Kumar, Tenzin Bhotia, Vaibhav Kumar, Tanmoy Chakraborty

Keywords Paper

word embeddings, semantic words, coreference resolution, post-processing methods

0

0

0

0

11:56

04/07/2020

Feature Projection for Improved Text Classification

Qi Qin, Wenpeng Hu, Bing Liu

Keywords Paper

Text Classification, classification, sentiment classification, Bert classification

0

0

0

0

10:57

01/07/2020

BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection

Jihyung Moon, Won Ik Cho, Junbum Lee

Keywords Paper

0

0

0

0

10:25

08/12/2020

Learning to Decouple Relations: Few-Shot Relation Classification with Entity-Guided Attention and Confusion-Aware Training

Yingyao Wang, Junwei Bao, Guangyi Liu and
Youzheng Wu, Xiaodong He, Bowen Zhou, Tiejun Zhao

Keywords Paper

0

0

0

0

10:55

16/11/2020

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Jian Guan, Minlie Huang

Keywords Paper

open-ended generation, story generation, evaluating generation, constructing samples

0

0

0

0

11:26

14/06/2020

Counterfactual Samples Synthesizing for Robust Visual Question Answering

Long Chen, Xin Yan, Jun Xiao and
Hanwang Zhang, Shiliang Pu, Yueting Zhuang

Keywords Paper

visual question answering, counterfactual, debias, language bias, data augmentation, visual-and-language

0

0

0

0

1:01

08/12/2020

Don’t Patronize Me! An Annotated Dataset with Patronizing and Condescending Language towards Vulnerable Communities

Carla Perez Almendros, Luis Espinosa Anke, Steven Schockaert

Keywords Paper

0

0

0

0

15:03

16/11/2020

Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA

Ieva Staliūnaitė, Ignacio Iacobacci

Keywords Paper

nlp tasks, conversational task, semantic labeling, contextualized embeddings

0

0

0

0

11:23

04/07/2020

Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer

Jieyu Zhao, Subhabrata Mukherjee, Saghar Hosseini and
Kai-Wei Chang, Ahmed Hassan Awadallah

Keywords Paper

cross-lingual transfer, multilingual embeddings, NLP applications, bias analysis

0

0

0

0

11:42

19/04/2021

Stereotype and skew: Quantifying gender bias in pre-trained and fine-tuned language models

Daniel Vassimon Manela, David Errington, Thomas Fisher and
Boris Breugel, Pasquale Minervini

Keywords Paper

0

0

0

0

6:54

16/11/2020

Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech Datasets

Nedjma Ousidhoum, Yangqiu Song, Dit-Yan Yeung

Keywords Paper

classification, data process, topic models, selection bias

0

0

0

0

12:07

18/07/2021

Towards Understanding and Mitigating Social Biases in Language Models

Paul Liang, Chiyu Wu, Louis-Philippe Morency, Russ Salakhutdinov

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

5:43

02/02/2021

Mitigating Political Bias in Language Models through Reinforced Calibration

Ruibo Liu, Chenyan Jia, Jason Wei and
Guangxuan Xu, Lili Wang, Soroush Vosoughi

Keywords Paper

0

0

0

0

14:04

14/06/2020

Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning

Mei Wang, Weihong Deng

Keywords Paper

fairness, racial bias, face recognition, deep reinforcement learning, adaptive margin

0

0

0

0

1:00

07/06/2020

A Framework for Political Portmanteau Decomposition

Nabil Hossain, Minh Tran, Henry Kautz

Keywords Paper

building, detection, hate speech, linguistic, political, spread, terms, traditional, words

0

0

0

0

3:12

04/07/2020

Towards Debiasing Sentence Representations

Paul Pu Liang, Irene Mengze Li, Emily Zheng and
Yao Chong Lim, Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Paper

Debiasing Representations, real-world scenarios, legal systems, debiasing

0

0

0

0

12:03

01/07/2020

Demoting Racial Bias in Hate Speech Detection

Mengzhou Xia, Anjalie Field, Yulia Tsvetkov

Keywords Paper

0

0

0

0

12:41

06/12/2021

Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models

Hannah Rose Kirk, yennie jun, Filippo Volpin and
Haider Iqbal, Elias Benussi, Frederic Dreyer, Aleksandar Shtedritski, Yuki Asano

Keywords Paper

language

0

0

0

0

9:48

16/11/2020

An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models

Lifu Tu, Garima Lalwani, Spandana Gella, He He

Keywords Paper

generalization, natural inference, paraphrase identification, pre-trained models

0

0

0

0

11:55

18/07/2021

Fair Selective Classification Via Sufficiency

Joshua Lee, Yuheng Bu, Deepta Rajan and
Prasanna Sattigeri, Rameswar Panda, Subhro Das, Gregory Wornell

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

18:20