Challenges in automated debiasing for toxic language detection

19/04/2021

Challenges in automated debiasing for toxic language detection

Xuhui Zhou, Maarten Sap, Swabha Swayamdipta, Yejin Choi, Noah Smith

Keywords:

Abstract Paper Similar Papers

Abstract: Biased associations have been a challenge in the development of classifiers for detecting toxic language, hindering both fairness and accuracy. As potential solutions, we investigate recently introduced debiasing methods for text classification datasets and models, as applied to toxic language detection. Our focus is on lexical (e.g., swear words, slurs, identity mentions) and dialectal markers (specifically African American English). Our comprehensive experiments establish that existing methods are limited in their ability to prevent biased behavior in current toxicity detectors. We then propose an automatic, dialect-aware data correction method, as a proof-of-concept. Despite the use of synthetic labels, this method reduces dialectal associations with toxicity. Overall, our findings show that debiasing a model trained on biased toxic language data is not as effective as simply relabeling the data to remove existing biases.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

Yangming Li, lemao liu, Shuming Shi

Keywords Paper

Negative Sampling, Unlabeled Entity Problem, Named Entity Recognition

0

0

0

1

4:49

16/11/2020

Fortifying Toxic Speech Detectors Against Veiled Toxicity

Xiaochuang Han, Yulia Tsvetkov

Keywords Paper

detecting toxicity, toxic detectors, toxic detector, disguised language

0

0

0

0

7:00

04/07/2020

Max-Margin Incremental CCG Parsing

Miloš Stanojević, Mark Steedman

Keywords Paper

Incremental parsing, human processing, ASR, MT

0

0

0

0

11:39

08/12/2020

Learning Domain Terms - Empirical Methods to Enhance Enterprise Text Analytics Performance

Gargi Roy, Lipika Dey, Mohammad Shakir, Tirthankar Dasgupta

Keywords Paper

0

0

0

0

14:36

26/04/2020

Learning The Difference That Makes A Difference With Counterfactually-Augmented Data

Divyansh Kaushik, Eduard Hovy, Zachary Lipton

Keywords Paper

humans in the loop, annotation artifacts, text classification, sentiment analysis, natural language inference

0

0

0

0

4:25

06/12/2021

Debiased Visual Question Answering from Feature and Sample Perspectives

Zhiquan Wen, Guanghui Xu, Mingkui Tan and
Qingyao Wu, Qi Wu

Keywords Paper

vision

0

0

0

0

11:20

02/02/2021

Label Confusion Learning to Enhance Text Classification Models

Biyang Guo, Songqiao Han, Xiao Han and
Hailiang Huang, Ting Lu

Keywords Paper

0

0

0

0

15:17

12/08/2020

TextShield: Robust Text Classification Based on Multimodal Embedding and Neural Machine Translation

Jinfeng Li, Tianyu Du, Shouling Ji and
Rong Zhang, Quan Lu, Min Yang, Ting Wang

Keywords Paper

0

0

0

0

11:32

04/07/2020

Moving Down the Long Tail of Word Sense Disambiguation with Gloss Informed Bi-encoders

Terra Blevins, Luke Zettlemoyer

Keywords Paper

Word Disambiguation, Word WSD, WSD, sense disambiguation

0

0

0

0

11:18

01/07/2020

Demoting Racial Bias in Hate Speech Detection

Mengzhou Xia, Anjalie Field, Yulia Tsvetkov

Keywords Paper

0

0

0

0

12:41

04/07/2020

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Paper

Natural Inference, data augmentation, Robustifying Models, deep models

0

0

0

0

11:34

02/02/2021

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Zhao Wang, Aron Culotta

Keywords Paper

0

0

0

0

17:39

06/12/2021

Interactive Label Cleaning with Example-based Explanations

Stefano Teso, Andrea Bontempelli, Fausto Giunchiglia, Andrea Passerini

Keywords Paper

active learning

0

0

0

0

12:23

06/12/2021

A Theory-Driven Self-Labeling Refinement Method for Contrastive Representation Learning

Pan Zhou, Caiming Xiong, Xiaotong Yuan, Steven Chu Hong Hoi

Keywords Paper

theory, machine learning, self-supervised learning, contrastive learning, representation learning

0

0

0

0

14:12

14/09/2020

Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

Haw-Shiuan Chang, Shankar Vembu, Sunil Mohan and
Rheeya Uppaal, Andrew McCallum

Keywords Paper

0

0

0

0

15:03

14/06/2020

Real-World Person Re-Identification via Degradation Invariance Learning

Yukun Huang, Zheng-Jun Zha, Xueyang Fu and
Richang Hong, Liang Li

Keywords Paper

disentangled representation learning, person re-identification, generative adversarial network, image degradation, self-supervised learning

0

0

0

0

1:01

08/12/2020

Is it Great or Terrible? Preserving Sentiment in Neural Machine Translation of Arabic Reviews

Hadeel Saadany, Constantin Orasan

Keywords Paper

0

0

0

0

14:35

04/07/2020

An Analysis of the Utility of Explicit Negative Examples to Improve the Syntactic Abilities of Neural Language Models

Hiroshi Noji, Hiroya Takamura

Keywords Paper

resolving agreement, Augmentation, Augmentation sentences, Syntactic Models

0

0

0

0

12:14

16/11/2020

Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data

Shachar Rosenman, Alon Jacovi, Yoav Goldberg

Keywords Paper

data process, re collection, sota models, tacred

0

0

0

0

5:55

01/07/2020

Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference

Cemil Cengiz, Deniz Yuret

Keywords Paper

0

0

0

0

4:38

06/12/2021

Can contrastive learning avoid shortcut solutions?

Joshua Robinson, Li Sun, Ke Yu and
Kayhan Batmanghelich, Stefanie Jegelka, Suvrit Sra

Keywords Paper

self-supervised learning, contrastive learning

0

0

0

0

12:45

03/05/2021

Beyond Categorical Label Representations for Image Classification

Boyuan Chen, Yu Li, Sunand Raghupathi, Hod Lipson

Keywords Paper

Representation Learning, Image Classification, Label Representation

0

0

0

0

3:26

04/07/2020

Towards Debiasing Sentence Representations

Paul Pu Liang, Irene Mengze Li, Emily Zheng and
Yao Chong Lim, Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Paper

Debiasing Representations, real-world scenarios, legal systems, debiasing

0

0

0

0

12:03

02/02/2021

SHOT-VAE: Semi-supervised Deep Generative Models With Label-aware ELBO Approximations

Hao-Zhe Feng, Kezhi Kong, Minghao Chen and
Tianye Zhang, Minfeng Zhu, Wei Chen

Keywords Paper

0

0

0

0

14:35

06/12/2021

Making a (Counterfactual) Difference One Rationale at a Time

Mitchell Plyler, Michael Green, Min Chi

Keywords Paper

theory, generative model, language, interpretability

0

0

0

0

13:57

14/06/2020

Counterfactual Samples Synthesizing for Robust Visual Question Answering

Long Chen, Xin Yan, Jun Xiao and
Hanwang Zhang, Shiliang Pu, Yueting Zhuang

Keywords Paper

visual question answering, counterfactual, debias, language bias, data augmentation, visual-and-language

0

0

0

0

1:01

04/07/2020

How Does Selective Mechanism Improve Self-Attention Networks?

Xinwei Geng, Longyue Wang, Xing Wang and
Bing Qin, Ting Liu, Zhaopeng Tu

Keywords Paper

NLP tasks, natural inference, semantic labelling, machine translation

0

0

0

0

11:43

19/04/2021

Removing word-level spurious alignment between images and pseudo-captions in unsupervised image captioning

Ukyo Honda, Yoshitaka Ushiku, Atsushi Hashimoto and
Taro Watanabe, Yuji Matsumoto

Keywords Paper

0

0

0

0

12:30

22/11/2021

SAGAN: Adversarial Spatial-asymmetric Attention for Noisy Nona-Bayer Reconstruction

S M A Sharif, Rizwan Ali Naqvi, Mithun Biswas

Keywords Paper

Nona-Bayer Reconstruction, Joint Demosicing and Denoising, JDD, Pixel-bin Sensor, Nona-Bayer Demosaicking, Nona-Bayer Denoising, Spatial Asymmetric Attention, SAGAN, Spatial-asymmetric Attention Module, Smartphone Image JDD

0

0

0

0

3:03

02/02/2021

MASKER: Masked Keyword Regularization for Reliable Text Classification

Seung Jun Moon, Sangwoo Mo, Kimin Lee and
Jaeho Lee, Jinwoo Shin

Keywords Paper

0

0

0

0

15:05

16/11/2020

Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding

Samson Tan, Shafiq Joty, Lav Varshney, Min-Yen Kan

Keywords Paper

comprehension, fine-tuning models, downstream tasks, nlp systems

0

0

0

0

10:22

12/07/2020

Fundamental Tradeoffs between Invariance and Sensitivity to Adversarial Perturbations

Florian Tramer, Jens Behrmann, Nicholas Carlini and
Nicolas Papernot, Joern-Henrik Jacobsen

Keywords Paper

Adversarial Examples

0

0

0

0

15:22

03/05/2021

You Only Need Adversarial Supervision for Semantic Image Synthesis

Edgar Schoenfeld, Vadim Sushko, Dan Zhang and
Juergen Gall, Bernt Schiele, Anna Khoreva

Keywords Paper

GANs, Semantic Image Synthesis, Image Generation, Deep Learning

0

0

0

0

5:11

19/04/2021

From toxicity in online comments to incivility in American news: Proceed with caution

Anushree Hede, Oshin Agarwal, Linda Lu and
Diana C. Mutz, Ani Nenkova

Keywords Paper

0

0

0

0

10:10

19/04/2021

Stereotype and skew: Quantifying gender bias in pre-trained and fine-tuned language models

Daniel Vassimon Manela, David Errington, Thomas Fisher and
Boris Breugel, Pasquale Minervini

Keywords Paper

0

0

0

0

6:54

07/06/2020

Feature-Based Explanations Don’t Help People Detect Misclassifications of Online Toxicity

Samuel Carton, Qiaozhu Mei, Paul Resnick

Keywords Paper

behaviors, changes, humans, impact, learning, measures, performance, predictions, terms, toxic, toxicity

0

0

0

0

10:24

03/05/2021

Explaining the Efficacy of Counterfactually Augmented Data

Divyansh Kaushik, Amrith Setlur, Eduard H Hovy, Zachary Lipton

Keywords Paper

sentiment analysis, text classification, natural language inference, annotation artifacts, humans in the loop

0

0

0

0

5:11

02/02/2021

Learning from Noisy Labels with Complementary Loss Functions

Deng-Bao Wang, Yong Wen, Lujia Pan, Min-Ling Zhang

Keywords Paper

0

0

0

0

14:00

16/11/2020

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

Jack Hessel, Lillian Lee

Keywords Paper

modeling interactions, multimodal tasks, visual answering, multimodal learning

0

0

0

0

12:02

11/10/2020

Data Cleansing with Contrastive Learning for Vocal Note Event Annotations

Gabriel Meseguer Brocal, Rachel Bittner, Simon Durand, Brian Brost

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Evaluation, datasets, and reproducibility, Novel datasets and use cases, MIR tasks, Music transcription and annotation

0

0

0

0

3:51