Adversarial stylometry in the wild: Transferable lexical substitution attacks on author profiling

Abstract: Written language contains stylistic cues that can be exploited to automatically infer a variety of potentially sensitive author information. Adversarial stylometry intends to attack such models by rewriting an author’s text. Our research proposes several components to facilitate deployment of these adversarial attacks in the wild, where neither data nor target models are accessible. We introduce a transformer-based extension of a lexical replacement attack, and show it achieves high transferability when trained on a weakly labeled corpus—decreasing target model performance below chance. While not completely inconspicuous, our more successful attacks also prove notably less detectable by humans. Our framework therefore provides a promising direction for future privacy-preserving adversarial attacks.

16/11/2020

Adversarial stylometry in the wild: Transferable lexical substitution attacks on author profiling

Chris Emmery, Ákos Kádár, Grzegorz Chrupała

Comments

Similar Papers

BERT-ATTACK: Adversarial Attack Against BERT Using BERT

Linyang Li, Ruotian Ma, Qipeng Guo and Xiangyang Xue, Xipeng Qiu

Keywords Abstract Paper

adversarial attacks, downstream tasks, calculation, gradient-based methods

BESA: BERT-based Simulated Annealing for Adversarial Text Attacks

Xinghao Yang, Weifeng Liu, Dacheng Tao, Wei Liu

Keywords Abstract Paper

Machine Learning, Adversarial Machine Learning, Natural Language Processing

Bigram and Unigram Based Text Attack via Adaptive Monotonic Heuristic Search

Xinghao Yang, Weifeng Liu, James Bailey and Dacheng Tao, Wei Liu

Keywords Abstract Paper

Word-level Textual Adversarial Attacking as Combinatorial Optimization

Yuan Zang, Fanchao Qi, Chenghao Yang and Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Abstract Paper

Textual attacking, Word-level attacking, combinatorial problem, Word-level Attacking

Large Scale Author Obfuscation Using Siamese Variational Auto-Encoder: The SiamAO System

Chakaveh Saedi, Mark Dras

Keywords Abstract Paper

The Curious Case of Neural Text Degeneration

Ari Holtzman, Jan Buys, Li Du and Maxwell Forbes, Yejin Choi

Keywords Abstract Paper

generation, text, NLG, NLP, natural language, natural language generation, language model, neural, neural language model

BAE: BERT-based Adversarial Examples for Text Classification

Siddhant Garg, Goutham Ramakrishnan

Keywords Abstract Paper

nlp, generating examples, automatic evaluations, modern models

T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack

Boxin Wang, Hengzhi Pei, Boyuan Pan and Qian Chen, Shuohang Wang, Bo Li

Keywords Abstract Paper

adversarial generation, nlp tasks, sentiment analysis, qa

Revisiting adversarially learned injection attacks against recommender systems

Jiaxi Tang, Hongyi Wen, Ke Wang

Keywords Abstract Paper

Recommender System, Security and Privacy, Adversarial Machine Learning

CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation

Tianlu Wang, Xuezhi Wang, Yao Qin and Ben Packer, Kang Li, Jilin Chen, Alex Beutel, Ed Chi

Keywords Abstract Paper

sentiment classification, model re-training, nlp models, cat-gen model

Paraphrase Generation by Learning How to Edit from Samples

Amirhossein Kazemnejad, Mohammadreza Salehi, Mahdieh Soleymani Baghshah

Keywords Abstract Paper

Paraphrase Generation, Neural sequence, sequence generation, retrieval-based method

Adversarial Attack and Defense of Structured Prediction Models

Wenjuan Han, Liwen Zhang, Yong Jiang, Kewei Tu

Keywords Abstract Paper

adversarial attacks, classification problems, structured tasks, nlp tasks

Improving Adversarial Text Generation by Modeling the Distant Future

Ruiyi Zhang, Changyou Chen, Zhe Gan and Wenlin Wang, Dinghan Shen, Guoyin Wang, Zheng Wen, Lawrence Carin

Keywords Abstract Paper

Adversarial Generation, long generation, next-word prediction, generator optimization

Generating Label Cohesive and Well-Formed Adversarial Claims

Pepa Atanasova, Dustin Wright, Isabelle Augenstein

Keywords Abstract Paper

inference tasks, fact checking, universal generation, adversarial attacks

Lower Your Guards: A Compositional Pattern-Match Coverage Checker

Sebastian Graf, Simon Peyton Jones, Ryan Scott

Keywords Abstract Paper

guards, Haskell, pattern matching, strictness

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Abstract Paper

Natural Inference, data augmentation, Robustifying Models, deep models

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

Keywords Abstract Paper

Deep Learning - Generative Models and Autoencoders

Sketch and Customize: A Counterfactual Story Generator

Changying Hao, Liang Pang, Yanyan Lan and Yan Wang, Jiafeng Guo, Xueqi Cheng

Keywords Abstract Paper

Generating CCG Categories

Yufang Liu, Tao Ji, Yuanbin Wu, Man Lan

Keywords Abstract Paper

Towards Robustness Against Natural Language Word Substitutions

Xinshuai Dong, Anh Tuan Luu, Rongrong Ji, Hong Liu

Keywords Abstract Paper

Adversarial Defense, Natural Language Processing

Linyang Li, Ruotian Ma, Qipeng Guo and
Xiangyang Xue, Xipeng Qiu

Keywords Paper

Keywords Paper

Xinghao Yang, Weifeng Liu, James Bailey and
Dacheng Tao, Wei Liu

Keywords Paper

Yuan Zang, Fanchao Qi, Chenghao Yang and
Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Paper

Keywords Paper

Ari Holtzman, Jan Buys, Li Du and
Maxwell Forbes, Yejin Choi

Keywords Paper

Keywords Paper

Boxin Wang, Hengzhi Pei, Boyuan Pan and
Qian Chen, Shuohang Wang, Bo Li

Keywords Paper

Keywords Paper

Tianlu Wang, Xuezhi Wang, Yao Qin and
Ben Packer, Kang Li, Jilin Chen, Alex Beutel, Ed Chi

Keywords Paper

Keywords Paper

Keywords Paper

Ruiyi Zhang, Changyou Chen, Zhe Gan and
Wenlin Wang, Dinghan Shen, Guoyin Wang, Zheng Wen, Lawrence Carin

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Changying Hao, Liang Pang, Yanyan Lan and
Yan Wang, Jiafeng Guo, Xueqi Cheng

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee and
Dheeraj Rajagopal, Jaime Carbonell, Yulia Tsvetkov

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

Xiaoxiao Guo, Mo Yu, Yupeng Gao and
Chuang Gan, Murray Campbell, Shiyu Chang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zhenfang Chen, Peng Wang, Lin Ma and
Kwan-Yee K. Wong, Qi Wu

Keywords Paper

Guoxiu He, Zhe Gao, Zhuoren Jiang and
Yangyang Kang, Changlong Sun, Xiaozhong Liu, Wei Lu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper