Exploiting emojis for abusive language detection

19/04/2021

Exploiting emojis for abusive language detection

Michael Wiegand, Josef Ruppenhofer

Keywords:

Abstract Paper Similar Papers

Abstract: We propose to use abusive emojis, such as the “middle finger” or “face vomiting”, as a proxy for learning a lexicon of abusive words. Since it represents extralinguistic information, a single emoji can co-occur with different forms of explicitly abusive utterances. We show that our approach generates a lexicon that offers the same performance in cross-domain classification of abusive microposts as the most advanced lexicon induction method. Such an approach, in contrast, is dependent on manually annotated seed words and expensive lexical resources for bootstrapping (e.g. WordNet). We demonstrate that the same emojis can also be effectively used in languages other than English. Finally, we also show that emojis can be exploited for classifying mentions of ambiguous words, such as “fuck” and “bitch”, into generally abusive and just profane usages.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

25/07/2020

Think beyond the word: Understanding the implied textual meaning by digesting context, local, and noise

Guoxiu He, Zhe Gao, Zhuoren Jiang and
Yangyang Kang, Changlong Sun, Xiaozhong Liu, Wei Lu

Keywords Paper

deep neural networks, text classification, semantic representation, implied textual meaning

0

0

0

0

19:57

19/04/2021

Implicitly abusive comparisons – a new dataset and linguistic analysis

Michael Wiegand, Maja Geulig, Josef Ruppenhofer

Keywords Paper

0

0

0

0

10:52

08/12/2020

ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation

Dario Stojanovski, Benno Krojer, Denis Peskov, Alexander Fraser

Keywords Paper

0

0

0

0

14:09

08/12/2020

ERRANT: Assessing and Improving Grammatical Error Type Classification

Katerina Korre, John Pavlopoulos

Keywords Paper

0

0

0

0

8:37

07/06/2021

Discovering and Categorising Language Biases in Reddit

Xavier Ferrer, Tom Van Nuenen, Jose M. Such, Natalia Criado

Keywords Paper

Qualitative and quantitative studies of social media, Social network analysis, communities identification, expertise and authority discovery, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analy

0

0

0

0

8:03

04/07/2020

Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension

Fei Yuan, Linjun Shou, Xuanyu Bai and
Ming Gong, Yaobo Liang, Nan Duan, Yan Fu, Daxin Jiang

Keywords Paper

Multilingual Comprehension, multilingual MRC, MRC, sentence tasks

0

0

0

0

8:30

07/06/2020

A Framework for Political Portmanteau Decomposition

Nabil Hossain, Minh Tran, Henry Kautz

Keywords Paper

building, detection, hate speech, linguistic, political, spread, terms, traditional, words

0

0

0

0

3:12

16/11/2020

Hate-Speech and Offensive Language Detection in Roman Urdu

Hammad Rizwan, Muhammad Haroon Shakeel, Asim Karim

Keywords Paper

automatic detection, hate-speech detection, language models, transfer learning

0

0

0

0

10:55

02/02/2021

The Gap on Gap: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Vid Kocijan, Oana-Maria Camburu, Thomas Lukasiewicz

Keywords Paper

0

0

0

0

15:13

04/07/2020

Probing for Referential Information in Language Models

Ionut-Teodor Sorodoc, Kristina Gulordava, Gemma Boleda

Keywords Paper

Probing, probe tasks, Language Models, LSTM architectures

0

0

0

0

11:31

16/11/2020

BAE: BERT-based Adversarial Examples for Text Classification

Siddhant Garg, Goutham Ramakrishnan

Keywords Paper

nlp, generating examples, automatic evaluations, modern models

0

0

0

0

6:45

14/09/2020

A Deep Dive into Multilingual Hate Speech Classification

Sai Saketh Aluru, Binny Mathew, Punyajoy Saha, Animesh Mukherjee

Keywords Paper

hate speech, multilingual, classification, bert, embeddings

0

0

0

0

14:20

16/11/2020

Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech Datasets

Nedjma Ousidhoum, Yangqiu Song, Dit-Yan Yeung

Keywords Paper

classification, data process, topic models, selection bias

0

0

0

0

12:07

01/07/2020

Challenges in Emotion Style Transfer: An Exploration with a Lexical Substitution Pipeline

David Helbig, Enrica Troiano, Roman Klinger

Keywords Paper

0

0

0

0

17:44

04/07/2020

Contextualizing Hate Speech Classifiers with Post-hoc Explanation

Brendan Kennedy, Xisen Jin, Aida Mostafazadeh Davani and
Morteza Dehghani, Xiang Ren

Keywords Paper

Contextualizing Classifiers, Post-hoc Explanation, Hate classifiers, fine-tuned classifiers

1

1

0

0

7:09

01/07/2020

A Metric Learning Approach to Misogyny Categorization

Juan Manuel Coria, Sahar Ghannay, Sophie Rosset, Hervé Bredin

Keywords Paper

0

0

0

0

4:45

04/07/2020

Should All Cross-Lingual Embeddings Speak English?

Antonios Anastasopoulos, Graham Neubig

Keywords Paper

cross-lingual embeddings, lexicon tagging, lexicon dictionaries, cross-lingual baselines

0

0

0

0

9:25

08/12/2020

Informative Manual Evaluation of Machine Translation Output

Maja Popović

Keywords Paper

0

0

0

0

15:26

01/07/2020

Supertagging with CCG primitives

Aditya Bhargava, Gerald Penn

Keywords Paper

0

0

0

0

5:00

02/02/2021

Contextualized Rewriting for Text Summarization

Guangsheng Bao, Yue Zhang

Keywords Paper

0

0

0

0

17:38

01/07/2020

Sarcasm Detection using Context Separators in Online Discourse

Tanvi Dadu, Kartikey Pant

Keywords Paper

0

0

0

0

4:15

02/02/2021

Abusive Language Detection in Heterogeneous Contexts: Dataset Collection and the Role of Supervised Attention

Hongyu Gong, Alberto Valido, Katherine M. Ingram and
Giulia Fanti, Suma Bhat, Dorothy L. Espelage

Keywords Paper

0

0

0

0

15:07

19/04/2021

“laughing at you or with you”: The role of sarcasm in shaping the disagreement space

Debanjan Ghosh, Ritvik Shrivastava, Smaranda Muresan

Keywords Paper

0

0

0

0

10:54

19/04/2021

“are you kidding me?”: Detecting unpalatable questions on Reddit

Sunyam Bagga, Andrew Piper, Derek Ruths

Keywords Paper

0

0

0

0

11:46

08/12/2020

Monolingual and Multilingual Reduction of Gender Bias in Contextualized Representations

Sheng Liang, Philipp Dufter, Hinrich Schütze

Keywords Paper

0

0

0

0

14:20

16/11/2020

BERT-ATTACK: Adversarial Attack Against BERT Using BERT

Linyang Li, Ruotian Ma, Qipeng Guo and
Xiangyang Xue, Xipeng Qiu

Keywords Paper

adversarial attacks, downstream tasks, calculation, gradient-based methods

0

0

0

0

11:36

19/04/2021

Dictionary-based debiasing of pre-trained word embeddings

Masahiro Kaneko, Danushka Bollegala

Keywords Paper

0

0

0

0

8:20

04/07/2020

LINSPECTOR: Multilingual Probing Tasks for Word Representations

Gözde Gül Sahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych

Keywords Paper

Word Representations, NLP, classification tasks, probing tasks

0

0

0

0

11:51

04/07/2020

Joint Modelling of Emotion and Abusive Language Detection

Santhosh Rajamanickam, Pushkar Mishra, Helen Yannakoudakis, Ekaterina Shutova

Keywords Paper

Joint Detection, abuse detection, abusive detection, multi-task framework

0

0

0

0

11:16

04/07/2020

Cross-Linguistic Syntactic Evaluation of Word Prediction Models

Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou and
Natalia Talmina, Tal Linzen

Keywords Paper

Cross-Linguistic Syntax, Syntax, Cross-Linguistic Models, neural models

0

0

0

0

10:48

04/07/2020

Unknown Intent Detection Using Gaussian Mixture Model with an Application to Zero-shot Intent Classification

Guangfeng Yan, Lu Fan, Qimai Li and
Han Liu, Xiaotong Zhang, Xiao-Ming Wu, Albert Y.S. Lam

Keywords Paper

Unknown Detection, Zero-shot Classification, User classification, dialogue systems

0

0

0

0

10:27

16/11/2020

Multi-resolution Annotations for Emoji Prediction

Weicheng Ma, Ruibo Liu, Lili Wang, Soroush Vosoughi

Keywords Paper

natural tasks, emojis, linguistic components, multi-class setting

0

0

0

0

11:52

06/12/2021

BARTScore: Evaluating Generated Text as Text Generation

Weizhe Yuan, Graham Neubig, Pengfei Liu

Keywords Paper

0

0

0

0

13:47

03/05/2021

Probing BERT in Hyperbolic Spaces

Boli Chen, Yao Fu, Guangwei Xu and
Pengjun Xie, Chuanqi Tan, Mosha Chen, Liping Jing

Keywords Paper

Sentiment, Syntax, Probe, BERT, Hyperbolic

0

0

0

0

5:10

04/07/2020

Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation

Tianlu Wang, Xi Victoria Lin, Nazneen Fatema Rajani and
Bryan McCann, Vicente Ordonez, Caiming Xiong

Keywords Paper

Tailoring Embeddings, Gender Mitigation, Double-Hard Debias, downstream models

0

0

0

0

11:04

16/11/2020

Multi-Dimensional Gender Bias Classification

Emily Dinan, Angela Fan, Ledell Wu and
Jason Weston, Douwe Kiela, Adina Williams

Keywords Paper

detecting bias, machine models, nlp models, fine-grained framework

0

0

0

0

12:02

08/12/2020

Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for Low Resource Languages

Efsun Sarioglu Kayi, Linyong Nan, Bohan Qu and
Mona Diab, Kathleen McKeown

Keywords Paper

0

0

0

0

14:37

08/12/2020

Refining Implicit Argument Annotation for UCCA

Ruixiang Cui, Daniel Hershcovich

Keywords Paper

0

0

0

0

11:38

19/04/2021

Machine translationese: Effects of algorithmic bias on linguistic complexity in machine translation

Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam

Keywords Paper

0

0

0

0

11:19

08/12/2020

Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer’s Disease Detection

Zhiqiang Guo, Zhaoci Liu, Zhenhua Ling and
Shijin Wang, Lingjing Jin, Yunxia Li

Keywords Paper

0

0

0

0

13:12