A Framework for Political Portmanteau Decomposition

07/06/2020

A Framework for Political Portmanteau Decomposition

Nabil Hossain, Minh Tran, Henry Kautz

Keywords: building, detection, hate speech, linguistic, political, spread, terms, traditional, words

Abstract Paper Similar Papers

Abstract: Portmanteaus are new words formed by combining the sounds and meanings of two words. Given their sticky nature, portmanteaus are often used to create political and personal attacks by combining a target entity with derogatory terms, which can then be spread online for promoting hate speech and defamation. In this paper, we present a framework to decompose political portmanteaus used online into their component words. Using our annotated dataset of political portmanteaus, we train a system that decomposes 76.2% of the political portmanteaus into their component words. Furthermore, for 93.4% of the political portmanteaus, our system finds the correct component words in its top 10 results, suggesting that using better ranking methods can lead to stronger results. This work provides a framework for both understanding an intriguing linguistic phenomena and for building hate-speech filters that could catch novel words that would bypass traditional hate speech detection approaches.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICWSM 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Analyzing Political Parody in Social Media

Antonios Maronikolakis, Danae Sánchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras

Keywords Paper

comedic purposes, computational parody, automatically tweets, fact checking

0

0

0

0

11:30

19/04/2021

Exploiting emojis for abusive language detection

Michael Wiegand, Josef Ruppenhofer

Keywords Paper

0

0

0

0

11:18

16/11/2020

Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech Datasets

Nedjma Ousidhoum, Yangqiu Song, Dit-Yan Yeung

Keywords Paper

classification, data process, topic models, selection bias

0

0

0

0

12:07

04/07/2020

Paraphrase-Sense-Tagged Sentences

Anne Cocos, Chris Callison-Burch

Keywords Paper

natural tasks, ranking sentences, hypernym prediction, sense-aware models

0

0

0

0

9:29

16/11/2020

CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models

Nikita Nangia, Clara Vania, Rasika Bhalerao, Samuel R. Bowman

Keywords Paper

nlp tasks, pretrained models, masked models, mlms

0

0

0

0

10:56

14/06/2020

Global-Local GCN: Large-Scale Label Noise Cleansing for Face Recognition

Yaobin Zhang, Weihong Deng, Mei Wang and
Jiani Hu, Xian Li, Dongyue Zhao, Dongchao Wen

Keywords Paper

face recognition, label noise, graph convolutional network, global-local

0

0

0

0

1:00

04/07/2020

Unknown Intent Detection Using Gaussian Mixture Model with an Application to Zero-shot Intent Classification

Guangfeng Yan, Lu Fan, Qimai Li and
Han Liu, Xiaotong Zhang, Xiao-Ming Wu, Albert Y.S. Lam

Keywords Paper

Unknown Detection, Zero-shot Classification, User classification, dialogue systems

0

0

0

0

10:27

01/07/2020

Supertagging with CCG primitives

Aditya Bhargava, Gerald Penn

Keywords Paper

0

0

0

0

5:00

08/12/2020

Exploring Cross-sentence Contexts for Named Entity Recognition with BERT

Jouni Luoma, Sampo Pyysalo

Keywords Paper

0

0

0

0

14:39

04/07/2020

ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations

Fernando Alva-Manchego, Louis Martin, Antoine Bordes and
Carolina Scarton, Benoît Sagot, Lucia Specia

Keywords Paper

Tuning Models, rewriting transformations, automatic simplification, splitting

0

0

0

0

12:11

14/09/2020

The Temporal Dictionary Ensemble (TDE) Classifier for Time Series Classification

Matthew Middlehurst, James Large, Gavin Cawley, Anthony Bagnall

Keywords Paper

time series, classification, bag of words, hive-cote

0

0

0

0

14:49

16/11/2020

BAE: BERT-based Adversarial Examples for Text Classification

Siddhant Garg, Goutham Ramakrishnan

Keywords Paper

nlp, generating examples, automatic evaluations, modern models

0

0

0

0

6:45

04/07/2020

A Retrieve-and-Rewrite Initialization Method for Unsupervised Machine Translation

Shuo Ren, Yu Wu, Shujie Liu and
Ming Zhou, Shuai Ma

Keywords Paper

Unsupervised Translation, translation, Retrieve-and-Rewrite Method, translation models

0

0

0

0

6:31

16/11/2020

Homophonic Pun Generation with Lexically Constrained Rewriting

Zhiwei Yu, Hongyu Zang, Xiaojun Wan

Keywords Paper

punning, error analysis, computational models, homophones

0

0

0

0

7:03

04/07/2020

Joint Modelling of Emotion and Abusive Language Detection

Santhosh Rajamanickam, Pushkar Mishra, Helen Yannakoudakis, Ekaterina Shutova

Keywords Paper

Joint Detection, abuse detection, abusive detection, multi-task framework

0

0

0

0

11:16

16/11/2020

Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile Generation

Tuhin Chakrabarty, Smaranda Muresan, Nanyun Peng

Keywords Paper

human imagination, simile generation, mapping properties, sequence model

0

0

0

0

11:11

08/12/2020

Native-like Expression Identification by Contrasting Native and Proficient Second Language Speakers

Oleksandr Harust, Yugo Murawaki, Sadao Kurohashi

Keywords Paper

0

0

0

0

14:13

05/01/2021

Facial Emotion Recognition With Noisy Multi-Task Annotations

Siwei Zhang, Zhiwu Huang, Danda Pani Paudel, Luc Van Gool

Keywords Paper

0

0

0

0

4:48

19/04/2021

WER-BERT: Automatic WER estimation with BERT in a balanced ordinal classification paradigm

Akshay Krishna Sheshadri, Anvesh Rao Vijjini, Sukhdeep Kharbanda

Keywords Paper

0

0

0

0

11:45

07/06/2021

Measuring Societal Biases from Text Corpora with Smoothed First-Order Co-occurrence

Navid Rekabsaz, Robert West, James Henderson, Allan Hanbury

Keywords Paper

Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of social media behavior, Text categorization, topic recognition, demographic/gender/age identification

0

0

0

0

8:05

01/07/2020

Filtering conversations through dialogue acts labels for improving corpus-based convergence studies

Simone Fuscone, Benoit Favre, Laurent Prévot

Keywords Paper

0

0

0

0

7:58

07/06/2021

Political Depolarization of News Articles Using Attribute-Aware Word Embeddings

Ruibo Liu, Lili Wang, Chenyan Jia, Soroush Vosoughi

Keywords Paper

Qualitative and quantitative studies of social media, Trust, reputation, recommendation systems, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of social media behavior, Measuring predi

0

0

0

0

6:25

04/07/2020

Phonetic and Visual Priors for Decipherment of Informal Romanization

Maria Ryskina, Matthew R. Gormley, Taylor Berg-Kirkpatrick

Keywords Paper

Decipherment Romanization, Informal romanization, idiosyncratic process, noisy-channel model

0

0

0

0

11:47

08/12/2020

Team Oulu at SemEval-2020 Task 12: Multilingual Identification of Offensive Language, Type and Target of Twitter Post Using Translated Datasets

Md Saroar Jahan

Keywords Paper

0

0

0

0

10:36

03/05/2021

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

0

0

0

0

3:51

06/12/2021

FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition

Yichong Leng, Xu Tan, Linchen Zhu and
Jin Xu, Renqian Luo, Linquan Liu, Tao Qin, Xiangyang Li, Edward Lin, Tie-Yan Liu

Keywords Paper

0

0

0

0

13:44

19/10/2020

Learning to create better ads: Generation and ranking approaches for ad creative refinement

Shaunak Mishra, Manisha Verma, Yichao Zhou and
Kapil Thadani, Wei Wang

Keywords Paper

natural language processing, keyword ranking, computational advertising, advertisement generation

0

0

0

0

10:00

02/02/2021

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps

Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu

Keywords Paper

0

0

0

0

15:58

02/02/2021

Bigram and Unigram Based Text Attack via Adaptive Monotonic Heuristic Search

Xinghao Yang, Weifeng Liu, James Bailey and
Dacheng Tao, Wei Liu

Keywords Paper

0

0

0

0

17:17

07/06/2021

Discovering and Categorising Language Biases in Reddit

Xavier Ferrer, Tom Van Nuenen, Jose M. Such, Natalia Criado

Keywords Paper

Qualitative and quantitative studies of social media, Social network analysis, communities identification, expertise and authority discovery, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analy

0

0

0

0

8:03

08/12/2020

SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP

Katsuki Chousa, Masaaki Nagata, Masaaki Nishino

Keywords Paper

0

0

0

0

14:39

16/11/2020

Multilingual Offensive Language Identification with Cross-lingual Embeddings

Tharindu Ranasinghe, Marcos Zampieri

Keywords Paper

bengali, cross-lingual embeddings, transfer learning, cyberaggression

0

0

0

0

7:00

04/07/2020

Contextualizing Hate Speech Classifiers with Post-hoc Explanation

Brendan Kennedy, Xisen Jin, Aida Mostafazadeh Davani and
Morteza Dehghani, Xiang Ren

Keywords Paper

Contextualizing Classifiers, Post-hoc Explanation, Hate classifiers, fine-tuned classifiers

1

1

0

0

7:09

14/09/2020

A Deep Dive into Multilingual Hate Speech Classification

Sai Saketh Aluru, Binny Mathew, Punyajoy Saha, Animesh Mukherjee

Keywords Paper

hate speech, multilingual, classification, bert, embeddings

0

0

0

0

14:20

16/11/2020

Semantic Role Labeling Guided Multi-turn Dialogue ReWriter

Kun Xu, Haochen Tan, Linfeng Song and
Han Wu, Haisong Zhang, Linqi Song, Dong Yu

Keywords Paper

multi-turn rewriting, attentive models, semantic labeling, semantic

0

0

0

0

7:04

02/02/2021

The Gap on Gap: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Vid Kocijan, Oana-Maria Camburu, Thomas Lukasiewicz

Keywords Paper

0

0

0

0

15:13

16/11/2020

Adversarial Attack and Defense of Structured Prediction Models

Wenjuan Han, Liwen Zhang, Yong Jiang, Kewei Tu

Keywords Paper

adversarial attacks, classification problems, structured tasks, nlp tasks

0

0

0

0

11:06

08/12/2020

An analysis of language models for metaphor recognition

Arthur Neidlein, Philip Wiesenbach, Katja Markert

Keywords Paper

0

0

0

0

13:52

04/07/2020

"The Boating Store Had Its Best Sail Ever": Pronunciation-attentive Contextualized Pun Recognition

Yichao Zhou, Jyun-Yu Jiang, Jieyu Zhao and
Kai-Wei Chang, Wei Wang

Keywords Paper

Pronunciation-attentive Recognition, human languages, intelligence systems, pun tasks

0

0

0

0

11:46

04/07/2020

Large Scale Multi-Actor Generative Dialog Modeling

Alex Boyd, Raul Puri, Mohammad Shoeybi and
Mostofa Patwary, Bryan Catanzaro

Keywords Paper

Large Modeling, generation, style matching, automatic evaluations

0

0

0

0

11:49