Generating Natural Language Attacks in a Hard Label Black Box Setting

02/02/2021

Generating Natural Language Attacks in a Hard Label Black Box Setting

Rishabh Maheshwary, Saket Maheshwary, Vikram Pudi

Keywords:

Abstract Paper Similar Papers

Abstract: We study an important and challenging task of attacking natural language processing models in a hard label black box setting. We propose a decision-based attack strategy that crafts high quality adversarial examples on text classification and entailment tasks. Our proposed attack strategy leverages population-based optimization algorithm to craft plausible and semantically similar adversarial examples by observing only the top label predicted by the target model. At each iteration, the optimization procedure allow word replacements that maximizes the overall semantic similarity between the original and the adversarial text. Further, our approach does not rely on using substitute models or any kind of training data. We demonstrate the efficacy of our proposed approach through extensive experimentation and ablation studies on five state-of-the-art target models across seven benchmark datasets. In comparison to attacks proposed in prior literature, we are able to achieve a higher success rate with lower word perturbation percentage that too in a highly restricted setting.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948632

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

BESA: BERT-based Simulated Annealing for Adversarial Text Attacks

Xinghao Yang, Weifeng Liu, Dacheng Tao, Wei Liu

Keywords Paper

Machine Learning, Adversarial Machine Learning, Natural Language Processing

0

0

0

0

14:01

08/12/2020

Fine-grained Information Status Classification Using Discourse Context-Aware BERT

Yufang Hou

Keywords Paper

0

0

0

0

13:13

06/12/2020

Learning Black-Box Attackers with Transferable Priors and Query Feedback

Jiancheng YANG, Yangzhou Jiang, Xiaoyang Huang and
Bingbing Ni, Chenglong Zhao

Keywords Paper

0

0

0

0

3:22

18/07/2021

Backdoor Scanning for Deep Neural Networks through K-Arm Optimization

Guangyu Shen, Yingqi Liu, Guanhong Tao and
Shengwei An, Qiuling Xu, Siyuan Cheng, Shiqing Ma, Xiangyu Zhang

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:12

02/02/2021

Re-TACRED: Addressing Shortcomings of the TACRED Dataset

George Stoica, Emmanouil Antonios Platanios, Barnabas Poczos

Keywords Paper

0

0

0

0

16:45

16/11/2020

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

Jack Hessel, Lillian Lee

Keywords Paper

modeling interactions, multimodal tasks, visual answering, multimodal learning

0

0

0

0

12:02

14/06/2020

Counterfactual Samples Synthesizing for Robust Visual Question Answering

Long Chen, Xin Yan, Jun Xiao and
Hanwang Zhang, Shiliang Pu, Yueting Zhuang

Keywords Paper

visual question answering, counterfactual, debias, language bias, data augmentation, visual-and-language

0

0

0

0

1:01

16/11/2020

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference

Jianguo Zhang, Kazuma Hashimoto, Wenhao Liu and
Chien-Sheng Wu, Yao Wan, Philip Yu, Richard Socher, Caiming Xiong

Keywords Paper

intent detection, detecting intents, oos detection, large-scale task

0

0

0

0

11:43

30/11/2020

Towards Robust Fine-grained Recognition by Maximal Separation of Discriminative Features

Krishna Kanth Nakka, Mathieu Salzmann

Keywords Paper

0

0

0

0

9:28

26/04/2020

I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifiers Adaptively

Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma

Keywords Paper

model comparison

0

0

0

0

4:53

04/07/2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization

Yuan Zang, Fanchao Qi, Chenghao Yang and
Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Paper

Textual attacking, Word-level attacking, combinatorial problem, Word-level Attacking

0

0

0

0

9:34

02/02/2021

Bigram and Unigram Based Text Attack via Adaptive Monotonic Heuristic Search

Xinghao Yang, Weifeng Liu, James Bailey and
Dacheng Tao, Wei Liu

Keywords Paper

0

0

0

0

17:17

14/06/2020

Multi-Scale Interactive Network for Salient Object Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Keywords Paper

saliency detection, salient object detection, feature interaction strategy, scale-insensitive loss, multi-scale features, multi-level features, fully convolutional network, deep learning

0

0

0

0

1:01

16/11/2020

XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning

Edoardo Maria Ponti, Goran Glavaš, Olga Majewska and
Qianchu Liu, Ivan Vulić, Anna Korhonen

Keywords Paper

machine reasoning, cross-lingual transfer, causal reasoning, multilingual pretraining

0

0

0

0

9:31

02/02/2021

Do Response Selection Models Really Know What’s Next? Utterance Manipulation Strategies for Multi-turn Response Selection

Taesun Whang, Dongyub Lee, Dongsuk Oh and
Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee

Keywords Paper

0

0

0

0

17:37

02/02/2021

EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation

Qi Zhou, Haipeng Chen, Yitao Zheng, Zhen Wang

Keywords Paper

0

0

0

0

19:28

02/02/2021

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and
Ekaterina Artemova, Irina Piontkovskaya

Keywords Paper

0

0

0

0

16:08

05/01/2021

Class-Wise Metric Scaling for Improved Few-Shot Classification

Ge Liu, Linglan Zhao, Wei Li and
Dashan Guo, Xiangzhong Fang

Keywords Paper

0

0

0

0

5:01

02/02/2021

MANGO: A Mask Attention Guided One-Stage Scene Text Spotter

Liang Qiao, Ying Chen, Zhanzhan Cheng and
Yunlu Xu, Yi Niu, Shiliang Pu, Fei Wu

Keywords Paper

0

0

0

0

16:32

04/07/2020

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Paper

Natural Inference, data augmentation, Robustifying Models, deep models

0

0

0

0

11:34

26/04/2020

Sign Bits Are All You Need for Black-Box Attacks

Abdullah Al-Dujaili, Una-May O'Reilly

Keywords Paper

Black-box adversarial attack models, Deep Nets, Adversarial Examples, Black-Box Optimization, Zeroth-Order Optimization

0

0

0

0

3:50

06/12/2021

Few-Shot Object Detection via Association and DIscrimination

Yuhang Cao, Jiaqi Wang, Ying Jin and
Tong Wu, Kai Chen, Ziwei Liu, Dahua Lin

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

10:31

03/05/2021

Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples

Ziang Yan, Yiwen Guo, Jian Liang, Changshui Zhang

Keywords Paper

hard-label attack, adversarial attack, black-box attack, reinforcement learning

0

0

0

0

4:55

03/05/2021

Towards Robustness Against Natural Language Word Substitutions

Xinshuai Dong, Anh Tuan Luu, Rongrong Ji, Hong Liu

Keywords Paper

Adversarial Defense, Natural Language Processing

0

0

0

0

6:06

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

06/12/2021

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning

Jongjin Park, Younggyo Seo, Chang Liu and
Li Zhao, Tao Qin, Jinwoo Shin, Tie-Yan Liu

Keywords Paper

reinforcement learning and planning, causality

0

0

0

0

12:12

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

14/09/2020

Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

Haw-Shiuan Chang, Shankar Vembu, Sunil Mohan and
Rheeya Uppaal, Andrew McCallum

Keywords Paper

0

0

0

0

15:03

02/02/2021

Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

Elad Amrani, Rami Ben-Ari, Daniel Rotman, Alex Bronstein

Keywords Paper

0

0

0

0

14:04

02/02/2021

Generating CCG Categories

Yufang Liu, Tao Ji, Yuanbin Wu, Man Lan

Keywords Paper

0

0

0

0

15:20

14/09/2020

FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems

Lu Chen, Jiao Sun, Wei Xu

Keywords Paper

watermark, ocr model, targeted white-box attack

0

0

0

0

15:14

03/05/2021

Sample-Efficient Automated Deep Reinforcement Learning

Jörg Franke, Gregor Koehler, André Biedenkapp, Frank Hutter

Keywords Paper

Neuroevolution, Hyperparameter Optimization, Deep Reinforcement Learning, AutoRL

0

0

0

0

4:36

02/02/2021

Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous Rendering Machines

Yangming Li, Kaisheng Yao

Keywords Paper

0

0

0

0

17:05

02/02/2021

Error-Correcting Output Codes with Ensemble Diversity for Robust Learning in Neural Networks

Yang Song, Qiyu Kang, Wee Peng Tay

Keywords Paper

0

0

0

0

15:04

18/07/2021

Delving into Deep Imbalanced Regression

Yuzhe Yang, Kaiwen Zha, YINGCONG CHEN and
Hao Wang, Dina Katabi

Keywords Paper

Applications

0

0

0

0

16:37

14/06/2020

DR Loss: Improving Object Detection by Distributional Ranking

Qi Qian, Lei Chen, Hao Li, Rong Jin

Keywords Paper

object detection, imbalance, distributional ranking loss

0

0

0

0

1:01

04/07/2020

LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation

Dong-Ho Lee, Rahul Khanna, Bill Yuchen Lin and
Seyeon Lee, Qinyuan Ye, Elizabeth Boschee, Leonardo Neves, Xiang Ren

Keywords Paper

sequence tasks, NLP tasks, named recognition, relation extraction

0

0

0

0

11:44

26/04/2020

Query-efficient Meta Attack to Deep Neural Networks

Jiawei Du, Hu Zhang, Joey Tianyi Zhou and
Yi Yang, Jiashi Feng

Keywords Paper

Adversarial attack, Meta learning

0

0

0

0

5:17

12/07/2020

Dual-Path Distillation: A Unified Framework to Improve Black-Box Attacks

Yonggang Zhang, Ya Li, Tongliang Liu, Xinmei Tian

Keywords Paper

Adversarial Examples

0

0

0

0

11:33

16/11/2020

Named Entity Recognition Only from Word Embeddings

Ying Luo, Hai Zhao, Junlang Zhan

Keywords Paper

named recognition, entity detection, type prediction, deep models

0

0

0

0

9:54