Abusive Language Detection in Heterogeneous Contexts: Dataset Collection and the Role of Supervised Attention

02/02/2021

Abusive Language Detection in Heterogeneous Contexts: Dataset Collection and the Role of Supervised Attention

Hongyu Gong, Alberto Valido, Katherine M. Ingram, Giulia Fanti, Suma Bhat, Dorothy L. Espelage

Keywords:

Abstract Paper Similar Papers

Abstract: Abusive language is a massive problem in online social platforms. Existing abusive language detection techniques are particularly ill-suited to comments containing heterogeneous abusive language patterns, i.e., both abusive and non-abusive parts. This is due in part to the lack of datasets that explicitly annotate heterogeneity in abusive language. We tackle this challenge by providing an annotated dataset of abusive language in over 11,000 comments from YouTube. We account for heterogeneity in this dataset by separately annotating both the comment as a whole and the individual sentences that comprise each comment. We then propose an algorithm that uses a supervised attention mechanism to detect and categorize abusive content using multi-task learning. We empirically demonstrate the challenges of using traditional techniques on heterogeneous content and the comparative gains in performance of the proposed approach over state-of-the-art methods.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38951027

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/04/2021

“are you kidding me?”: Detecting unpalatable questions on Reddit

Sunyam Bagga, Andrew Piper, Derek Ruths

Keywords Paper

0

0

0

0

11:46

04/07/2020

Joint Modelling of Emotion and Abusive Language Detection

Santhosh Rajamanickam, Pushkar Mishra, Helen Yannakoudakis, Ekaterina Shutova

Keywords Paper

Joint Detection, abuse detection, abusive detection, multi-task framework

0

0

0

0

11:16

19/08/2021

Adapting Meta Knowledge with Heterogeneous Information Network for COVID-19 Themed Malicious Repository Detection

Yiyue Qian, Yiming Zhang, Yanfang Ye, Chuxu Zhang

Keywords Paper

Multidisciplinary Topics and Applications, Security and Privacy, Classification, Mining Graphs, Semi Structured Data, Complex Data

0

0

0

0

13:28

14/09/2020

PS3: Partition-based Skew-Specialized Sampling for Batch Mode Active Learning in Imbalanced Text Data

Ricky Fajri, Samaneh Khoshrou, Robert Peharz, Mykola Pechenizkiy

Keywords Paper

batch-mode active learning, imbalance data, hate-speech recognition

0

0

0

0

15:16

25/07/2020

Learning to transfer graph embeddings for inductive graph based recommendation

Le Wu, Yonghui Yang, Lei Chen and
Defu Lian, Richang Hong, Meng Wang

Keywords Paper

graph neural network, content based recommendation, inductive graph learning

0

0

0

0

15:15

07/06/2021

Discovering and Categorising Language Biases in Reddit

Xavier Ferrer, Tom Van Nuenen, Jose M. Such, Natalia Criado

Keywords Paper

Qualitative and quantitative studies of social media, Social network analysis, communities identification, expertise and authority discovery, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analy

0

0

0

0

8:03

07/06/2020

Aggressive, Repetitive, Intentional, Visible, and Imbalanced: Refining Representations for Cyberbullying Classification

Caleb Ziems, Ymir Vigfusson, Fred Morstatter

Keywords Paper

behaviors, cases, classification, classifiers, communities, detection, factors, large_scale, learning, linguistic, linguistic aspects, networks, performance, representations

0

0

1

0

9:53

16/11/2020

Hate-Speech and Offensive Language Detection in Roman Urdu

Hammad Rizwan, Muhammad Haroon Shakeel, Asim Karim

Keywords Paper

automatic detection, hate-speech detection, language models, transfer learning

0

0

0

0

10:55

08/12/2020

Towards Preemptive Detection of Depression and Anxiety in Twitter

David Owen, Jose Camacho-Collados, Luis Espinosa Anke

Keywords Paper

0

0

0

0

8:15

08/12/2020

Team Oulu at SemEval-2020 Task 12: Multilingual Identification of Offensive Language, Type and Target of Twitter Post Using Translated Datasets

Md Saroar Jahan

Keywords Paper

0

0

0

0

10:36

07/06/2021

“Call me sexist, but...” : Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples

Mattia Samory, Indira Sen, Julian Kohne and
Fabian Flöck, Claudia Wagner

Keywords Paper

Psychological, personality-based and ethnographic studies of social media, Qualitative and quantitative studies of social media, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of social

0

0

0

0

8:00

02/02/2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Ziyi Liu, Le Wang, Wei Tang and
Junsong Yuan, Nanning Zheng, Gang Hua

Keywords Paper

0

0

0

0

19:49

14/06/2020

Learning Meta Face Recognition in Unseen Domains

Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao and
Dong Cao, Zhen Lei, Stan Z. Li

Keywords Paper

face recognition, meta learning, domain generalization, metric learning

0

0

0

0

5:01

07/06/2021

You Don't Know How I Feel: Insider-Outsider Perspective Gaps in Cyberbullying Risk Detection

Seunghyun Kim, Afsaneh Razi, Gianluca Stringhini and
Pamela J. Wisniewski, Munmun De Choudhury

Keywords Paper

Qualitative and quantitative studies of social media, Human computer interaction, social media tools, navigation and visualization, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of soc

0

0

0

0

7:05

19/10/2020

Detection of novel social bots by ensembles of specialized classifiers

Mohsen Sayyadiharikandeh, Onur Varol, Kai-Cheng Yang and
Alessandro Flammini, Filippo Menczer

Keywords Paper

social bots, recall, social media, machine learning, cross-domain

0

0

0

0

10:01

07/06/2020

Empirical Analysis of Multi-Task Learning for Reducing Identity Bias in Toxic Comment Detection

Ameya Vaidya, Feng Mai, Yue Ning

Keywords Paper

attention, bias, deep learning, detection, groups, identities, learning, sources, toxic, toxicity

0

0

0

0

9:59

22/09/2020

Revisiting adversarially learned injection attacks against recommender systems

Jiaxi Tang, Hongyi Wen, Ke Wang

Keywords Paper

Recommender System, Security and Privacy, Adversarial Machine Learning

0

0

0

0

2:13

14/09/2020

A Deep Dive into Multilingual Hate Speech Classification

Sai Saketh Aluru, Binny Mathew, Punyajoy Saha, Animesh Mukherjee

Keywords Paper

hate speech, multilingual, classification, bert, embeddings

0

0

0

0

14:20

02/02/2021

Non-Autoregressive Coarse-to-Fine Video Captioning

Bang Yang, Yuexian Zou, Fenglin Liu, Can Zhang

Keywords Paper

0

0

0

0

18:21

25/07/2020

DVGAN: A minimax game for search result diversification combining explicit and implicit features

Jiongnan Liu, Zhicheng Dou, Xiaojie Wang and
Shuqi Lu, Ji-Rong Wen

Keywords Paper

generative adversarial network, search result diversification

0

0

0

0

12:46

04/07/2020

Integrating Semantic and Structural Information with Graph Convolutional Network for Controversy Detection

Lei Zhong, Juan Cao, Qiang Sheng and
Junbo Guo, Ziang Wang

Keywords Paper

Controversy Detection, Identifying posts, mining sentiment, assessing events

0

0

0

0

10:32

07/06/2021

Misinformation Adoption or Rejection in the Era of COVID-19

Maxwell Weinzierl, Suellen Hopfer, Sanda M. Harabagiu

Keywords Paper

Qualitative and quantitative studies of social media, Credibility of online content, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of social media behavior, Organizational and group be

0

0

0

0

7:51

12/08/2020

A Tale of Two Headers: A Formal Analysis of Inconsistent Click-Jacking Protection on the Web

Stefano Calzavara, Sebastian Roth, Alvise Rabitti and
Michael Backes, Ben Stock

Keywords Paper

0

0

0

0

12:00

25/07/2020

Think beyond the word: Understanding the implied textual meaning by digesting context, local, and noise

Guoxiu He, Zhe Gao, Zhuoren Jiang and
Yangyang Kang, Changlong Sun, Xiaozhong Liu, Wei Lu

Keywords Paper

deep neural networks, text classification, semantic representation, implied textual meaning

0

0

0

0

19:57

19/04/2021

“laughing at you or with you”: The role of sarcasm in shaping the disagreement space

Debanjan Ghosh, Ritvik Shrivastava, Smaranda Muresan

Keywords Paper

0

0

0

0

10:54

26/04/2020

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

0

0

0

0

5:16

02/02/2021

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

Binny Mathew, Punyajoy Saha, Seid Muhie Yimam and
Chris Biemann, Pawan Goyal, Animesh Mukherjee

Keywords Paper

0

0

0

0

18:43

07/09/2020

On Modality Bias in the TVQA Dataset

Thomas Winterbottom, Sarah Xiao, Alistair McLean, Noura Al Moubayed

Keywords Paper

Multimodality, Unimodal Bias, Dataset Bias, TVQA, Video-QA, BERT, Bilinear Pooling, TVQA+

0

0

0

0

10:02

30/11/2020

Learning to Adapt to Unseen Abnormal Activities under Weak Supervision

JaeYoo Park, Junha Kim, Bohyung Han

Keywords Paper

0

0

0

0

8:23

08/12/2020

Is it Great or Terrible? Preserving Sentiment in Neural Machine Translation of Arabic Reviews

Hadeel Saadany, Constantin Orasan

Keywords Paper

0

0

0

0

14:35

06/12/2021

CLIP-It! Language-Guided Video Summarization

Medhini Narasimhan, Anna Rohrbach, Trevor Darrell

Keywords Paper

transformers

0

0

0

0

6:14

06/12/2020

The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Douwe Kiela, Hamed Firooz, Aravind Mohan and
Vedanuj Goswami, Amanpreet Singh, Pratik Ringshia, Davide Testuggine

Keywords Paper

0

0

0

0

3:18

01/07/2020

A Metric Learning Approach to Misogyny Categorization

Juan Manuel Coria, Sahar Ghannay, Sophie Rosset, Hervé Bredin

Keywords Paper

0

0

0

0

4:45

14/06/2020

MetaIQA: Deep Meta-Learning for No-Reference Image Quality Assessment

Hancheng Zhu, Leida Li, Jinjian Wu and
Weisheng Dong, Guangming Shi

Keywords Paper

image quality assessment, convolutional neural networks, gradient optimization-based deep meta-learning, highly generalizable

0

0

0

0

0:57

22/11/2021

A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark

Zhenxi Zhu, Limin Wang, Sheng Guo, Gangshan Wu

Keywords Paper

few-shot learning, classifier-based baseline, new benchmark, action recognition

0

0

0

0

2:58

06/12/2021

Curriculum Disentangled Recommendation with Noisy Multi-feedback

Hong Chen, Yudong Chen, Xin Wang and
Ruobing Xie, Rui Wang, Feng Xia, Wenwu Zhu

Keywords Paper

representation learning, interpretability

0

0

0

0

6:03

19/08/2021

Tool- and Domain-Agnostic Parameterization of Style Transfer Effects Leveraging Pretrained Perceptual Metrics

Hiromu Yakura, Yuki Koyama, Masataka Goto

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Human-AI Collaboration, Intelligent User Interfaces

0

0

0

0

12:00

25/07/2020

Sampling bias due to near-duplicates in learning to rank

Maik Fröbe, Janek Bevendorff, Jan Heinrich Reimer and
Martin Potthast, Matthias Hagen

Keywords Paper

near-duplicate-detection, selection bias, learning to rank, novelty principle

0

0

0

0

10:59

14/09/2020

Early Detection of Fake News with Multi-Source Weak Social Supervision

Kai Shu, Guoqing Zheng, Yichuan Li and
Subhabrata Mukherjee, Ahmed Hassan Awadallah, Scott Ruston, Huan Liu

Keywords Paper

fake news, weak social supervision, meta learning

0

0

0

0

14:03

04/07/2020

Reasoning with Multimodal Sarcastic Tweets via Modeling Cross-Modality Contrast and Semantic Association

Nan Xu, Zhixiong Zeng, Wenji Mao

Keywords Paper

Reasoning, sarcasm, multimodal detection, Sarcasm

0

0

0

0

10:57