BAE: BERT-based Adversarial Examples for Text Classification

Abstract: Modern text classification models are susceptible to adversarial examples, perturbed versions of the original text indiscernible by humans which get misclassified by the model. Recent works in NLP use rule-based synonym replacement strategies to generate adversarial examples. These strategies can lead to out-of-context and unnaturally complex token replacements, which are easily identifiable by humans. We present BAE, a black box attack for generating adversarial examples using contextual perturbations from a BERT masked language model. BAE replaces and inserts tokens in the original text by masking a portion of the text and leveraging the BERT-MLM to generate alternatives for the masked tokens. Through automatic and human evaluations, we show that BAE performs a stronger attack, in addition to generating adversarial examples with improved grammaticality and semantic coherence as compared to prior work.

16/11/2020

BAE: BERT-based Adversarial Examples for Text Classification

Siddhant Garg, Goutham Ramakrishnan

Comments

Similar Papers

Generating Label Cohesive and Well-Formed Adversarial Claims

Pepa Atanasova, Dustin Wright, Isabelle Augenstein

Keywords Abstract Paper

inference tasks, fact checking, universal generation, adversarial attacks

Improving Text Generation with Dynamic Masking and Recovering

Zhidong Liu, Junhui Li, Muhua Zhu

Keywords Abstract Paper

Natural Language Processing, Machine Translation, Natural Language Generation

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and Xisen Jin, Xiang Ren

Keywords Abstract Paper

machine learning, fairness, language

BERT-ATTACK: Adversarial Attack Against BERT Using BERT

Linyang Li, Ruotian Ma, Qipeng Guo and Xiangyang Xue, Xipeng Qiu

Keywords Abstract Paper

adversarial attacks, downstream tasks, calculation, gradient-based methods

T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack

Boxin Wang, Hengzhi Pei, Boyuan Pan and Qian Chen, Shuohang Wang, Bo Li

Keywords Abstract Paper

adversarial generation, nlp tasks, sentiment analysis, qa

Neural Multi-task Text Normalization and Sanitization with Pointer-Generator

Hoang Nguyen, Sandro Cavallari

Keywords Abstract Paper

Understanding Neural Abstractive Summarization Models via Uncertainty

Jiacheng Xu, Shrey Desai, Greg Durrett

Keywords Abstract Paper

analyzing models, seqseq models, summarization decoders, pegasus

Towards Reversal-Based Textual Data Augmentation for NLI Problems with Opposable Classes

Alexey Tarasov

Keywords Abstract Paper

Adversarial stylometry in the wild: Transferable lexical substitution attacks on author profiling

Chris Emmery, Ákos Kádár, Grzegorz Chrupała

Keywords Abstract Paper

LIREx: Augmenting Language Inference with Relevant Explanations

Xinyan Zhao, V.G.Vinod Vydiswaran

Keywords Abstract Paper

Considering Likelihood in NLP Classiﬁcation Explanations with Occlusion and Language Modeling

David Harbecke, Christoph Alt

Keywords Abstract Paper

NLP, NLP Explanations, Language Modeling, NLP models

Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank

Eleftheria Briakou, Marine Carpuat

Keywords Abstract Paper

detecting content, cross-lingual nlp, machine problem, annotation

Refining Implicit Argument Annotation for UCCA

Ruixiang Cui, Daniel Hershcovich

Keywords Abstract Paper

Mixed Supervised Object Detection by Transferring Mask Prior and Semantic Similarity

Yan Liu, Zhijie Zhang, Li Niu and Junjie Chen, Liqing Zhang

Keywords Abstract Paper

vision, transfer learning

Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder

Daya Guo, Duyu Tang, Nan Duan and Jian Yin, Daxin Jiang, Ming Zhou

Keywords Abstract Paper

Evidence-Aware Generation, Generating texts, generation, generation texts

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search

Yuxuan Han, Jiaolong Yang, Ying Fu

Keywords Abstract Paper

Computer Vision, 2D and 3D Computer Vision, Explainable/Interpretable Machine Learning

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Ieva Staliūnaitė, Philip John Gorinski, Ignacio Iacobacci

Keywords Abstract Paper

CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation

Tianlu Wang, Xuezhi Wang, Yao Qin and Ben Packer, Kang Li, Jilin Chen, Alex Beutel, Ed Chi

Keywords Abstract Paper

sentiment classification, model re-training, nlp models, cat-gen model

MASKER: Masked Keyword Regularization for Reliable Text Classification

Seung Jun Moon, Sangwoo Mo, Kimin Lee and Jaeho Lee, Jinwoo Shin

Keywords Abstract Paper

FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization

Esin Durmus, He He, Mona Diab

Keywords Abstract Paper

Faithfulness Assessment, Abstractive Summarization, evaluating summary, reading comprehension

Reformulating Unsupervised Style Transfer as Paraphrase Generation

Kalpesh Krishna, John Wieting, Mohit Iyyer

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

Linyang Li, Ruotian Ma, Qipeng Guo and
Xiangyang Xue, Xipeng Qiu

Keywords Paper

Boxin Wang, Hengzhi Pei, Boyuan Pan and
Qian Chen, Shuohang Wang, Bo Li

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yan Liu, Zhijie Zhang, Li Niu and
Junjie Chen, Liqing Zhang

Keywords Paper

Daya Guo, Duyu Tang, Nan Duan and
Jian Yin, Daxin Jiang, Ming Zhou

Keywords Paper

Keywords Paper

Keywords Paper

Tianlu Wang, Xuezhi Wang, Yao Qin and
Ben Packer, Kang Li, Jilin Chen, Alex Beutel, Ed Chi

Keywords Paper

Seung Jun Moon, Sangwoo Mo, Kimin Lee and
Jaeho Lee, Jinwoo Shin

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yubo Chen, Chuhan Wu, Tao Qi and
Zhigang Yuan, Yongfeng Huang

Keywords Paper

Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara and
Chenhui Chu, Yuki Arase

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Guangfeng Yan, Lu Fan, Qimai Li and
Han Liu, Xiaotong Zhang, Xiao-Ming Wu, Albert Y.S. Lam

Keywords Paper

Keywords Paper

Yue Dong, Shuohang Wang, Zhe Gan and
Yu Cheng, Jackie Chi Kit Cheung, Jingjing Liu

Keywords Paper

Adhiguna Kuncoro, Lingpeng Kong, Daniel Fried and
Dani Yogatama, Laura Rimell, Chris Dyer, Phil Blunsom

Keywords Paper

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

Marius Mosbach, Stefania Degaetano-Ortlieb, Marie-Pauline Krielke and
Badr M. Abdullah, Dietrich Klakow

Keywords Paper

Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee and
Dheeraj Rajagopal, Jaime Carbonell, Yulia Tsvetkov

Keywords Paper

Wentao Ma, Yiming Cui, Chenglei Si and
Ting Liu, Shijin Wang, Guoping Hu

Keywords Paper

Keywords Paper