Detecting Gender Stereotypes: Lexicon vs. Supervised Learning Methods

Abstract: Biases in language influence how we interact with each other and society at large. Language affirming gender stereotypes is often observed in various contexts today, from recommendation letters and Wikipedia entries to fiction novels and movie dialogue. Yet to date, there is little agreement on the methodology to quantify gender stereotypes in natural language (specifically the English language). Common methodology (including those adopted by companies tasked with detecting gender bias) rely on a lexicon approach largely based on the original BSRI study from 1974.In this paper, we reexamine the role of gender stereotype detection in the context of modern tools, by comparatively analyzing efficacy of lexicon-based approaches and end-to-end, ML-based approaches prevalent in state-of-the-art natural language processing systems. Our efforts using a large dataset show that even compared to an updated lexicon-based approach, end-to-end classification approaches are significantly more robust and accurate, even when trained by moderately sized corpora.

Detecting Gender Stereotypes: Lexicon vs. Supervised Learning Methods

Jenna Cryan, Shiliang Tang, Xinyi Zhang, Miriam Metzger, Haitao Zheng, Ben Zhao

Comments

Similar Papers

Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models

Hannah Rose Kirk, yennie jun, Filippo Volpin and Haider Iqbal, Elias Benussi, Frederic Dreyer, Aleksandar Shtedritski, Yuki Asano

Keywords Abstract Paper

language

Machine translationese: Effects of algorithmic bias on linguistic complexity in machine translation

Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam

Keywords Abstract Paper

Towards Debiasing Sentence Representations

Paul Pu Liang, Irene Mengze Li, Emily Zheng and Yao Chong Lim, Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Abstract Paper

Debiasing Representations, real-world scenarios, legal systems, debiasing

Monolingual and Multilingual Reduction of Gender Bias in Contextualized Representations

Sheng Liang, Philipp Dufter, Hinrich Schütze

Keywords Abstract Paper

Measuring Societal Biases from Text Corpora with Smoothed First-Order Co-occurrence

Navid Rekabsaz, Robert West, James Henderson, Allan Hanbury

Keywords Abstract Paper

Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of social media behavior, Text categorization, topic recognition, demographic/gender/age identification

Semi-Supervised Topic Modeling for Gender Bias Discovery in English and Swedish

Hannah Devinney, Jenny Björklund, Henrik Björklund

Keywords Abstract Paper

LINSPECTOR: Multilingual Probing Tasks for Word Representations

Gözde Gül Sahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych

Keywords Abstract Paper

Word Representations, NLP, classification tasks, probing tasks

Seeing the World through Text: Evaluating Image Descriptions for Commonsense Reasoning in Machine Reading Comprehension

Diana Galvan-Sosa, Jun Suzuki, Kyosuke Nishida and Koji Matsuda, Kentaro Inui

Keywords Abstract Paper

Stereotype and skew: Quantifying gender bias in pre-trained and fine-tuned language models

Daniel Vassimon Manela, David Errington, Thomas Fisher and Boris Breugel, Pasquale Minervini

Keywords Abstract Paper

Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation

Tianlu Wang, Xi Victoria Lin, Nazneen Fatema Rajani and Bryan McCann, Vicente Ordonez, Caiming Xiong

Keywords Abstract Paper

Tailoring Embeddings, Gender Mitigation, Double-Hard Debias, downstream models

The Gap on Gap: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Vid Kocijan, Oana-Maria Camburu, Thomas Lukasiewicz

Keywords Abstract Paper

FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age for Bias Measurement and Mitigation

Kimmo Karkkainen, Jungseock Joo

Keywords Abstract Paper

PowerTransformer: Unsupervised Controllable Revision for Biased Language Correction

Xinyao Ma, Maarten Sap, Hannah Rashkin, Yejin Choi

Keywords Abstract Paper

bias correction, controllable debiasing, revision task, powertransformer

Detecting Independent Pronoun Bias with Partially-Synthetic Data Generation

Robert Munro, Alex (Carmen) Morrison

Keywords Abstract Paper

measuring models, parsers, language models, machine models

Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile Generation

Tuhin Chakrabarty, Smaranda Muresan, Nanyun Peng

Keywords Abstract Paper

human imagination, simile generation, mapping properties, sequence model

XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning

Edoardo Maria Ponti, Goran Glavaš, Olga Majewska and Qianchu Liu, Ivan Vulić, Anna Korhonen

Keywords Abstract Paper

machine reasoning, cross-lingual transfer, causal reasoning, multilingual pretraining

Breeding Gender-aware Direct Speech Translation Systems

Marco Gaido, Beatrice Savoldi, Luisa Bentivogli and Matteo Negri, Marco Turchi

Keywords Abstract Paper

Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender Bias

Ana Valeria González, Maria Barrett, Rasmus Hvingelby and Kellie Webster, Anders Søgaard

Keywords Abstract Paper

nlp tasks, russian, gender bias, coreferential reading

Toward Gender-Inclusive Coreference Resolution

Yang Trista Cao, Hal Daumé III

Keywords Abstract Paper

Gender-Inclusive Resolution, interrogating annotations, coreference systems, systemic biases

Towards Understanding and Mitigating Social Biases in Language Models

Paul Liang, Chiyu Wu, Louis-Philippe Morency, Russ Salakhutdinov

Keywords Abstract Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations

Mostafa Abdou, Vinit Ravishankar, Maria Barrett and Yonatan Belinkov, Desmond Elliott, Anders Søgaard

Keywords Abstract Paper

human understanding, Language Models, Winograd Perturbations, Large-scale models

Hannah Rose Kirk, yennie jun, Filippo Volpin and
Haider Iqbal, Elias Benussi, Frederic Dreyer, Aleksandar Shtedritski, Yuki Asano

Keywords Paper

Keywords Paper

Paul Pu Liang, Irene Mengze Li, Emily Zheng and
Yao Chong Lim, Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Diana Galvan-Sosa, Jun Suzuki, Kyosuke Nishida and
Koji Matsuda, Kentaro Inui

Keywords Paper

Daniel Vassimon Manela, David Errington, Thomas Fisher and
Boris Breugel, Pasquale Minervini

Keywords Paper

Tianlu Wang, Xi Victoria Lin, Nazneen Fatema Rajani and
Bryan McCann, Vicente Ordonez, Caiming Xiong

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Edoardo Maria Ponti, Goran Glavaš, Olga Majewska and
Qianchu Liu, Ivan Vulić, Anna Korhonen

Keywords Paper

Marco Gaido, Beatrice Savoldi, Luisa Bentivogli and
Matteo Negri, Marco Turchi

Keywords Paper

Ana Valeria González, Maria Barrett, Rasmus Hvingelby and
Kellie Webster, Anders Søgaard

Keywords Paper

Keywords Paper

Keywords Paper

Mostafa Abdou, Vinit Ravishankar, Maria Barrett and
Yonatan Belinkov, Desmond Elliott, Anders Søgaard

Keywords Paper

Nicolas Garneau, Mareike Hartmann, Anders Sandholm and
Sebastian Ruder, Ivan Vulić, Anders Søgaard

Keywords Paper

Emily Dinan, Angela Fan, Ledell Wu and
Jason Weston, Douwe Kiela, Adina Williams

Keywords Paper

Keywords Paper

Keywords Paper

Taesun Whang, Dongyub Lee, Dongsuk Oh and
Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee

Keywords Paper

Keywords Paper

Keywords Paper

Luisa Bentivogli, Beatrice Savoldi, Matteo Negri and
Mattia A. Di Gangi, Roldano Cattoni, Marco Turchi

Keywords Paper

Vered Shwartz, Peter West, Ronan Le Bras and
Chandra Bhagavatula, Yejin Choi

Keywords Paper

Keywords Paper

Mingda Li, Xinyue Liu, Weitong Ruan and
Luca Soldaini, Wael Hamza, Chengwei Su

Keywords Paper

Keywords Paper

Keywords Paper

Zhiqi Huang, Fenglin Liu, Xian Wu and
Shen Ge, Helin Wang, Wei Fan, Yuexian Zou

Keywords Paper

Keywords Paper