Interpretability is a kind of safety: An interpreter-based ensemble for adversary defense

23/08/2020

Interpretability is a kind of safety: An interpreter-based ensemble for adversary defense

Jingyuan Wang, Yufan Wu, Mingxuan Li, Xin Lin, Junjie Wu, Chao Li

Keywords: DNN interpretation, adversarial example defense, ensemble

Abstract Paper Similar Papers

Abstract: While having achieved great success in rich real-life applications, deep neural network (DNN) models have long been criticized for their vulnerability to adversarial attacks. Tremendous research efforts have been dedicated to mitigating the threats of adversarial attacks, but the essential trait of adversarial examples is not yet clear, and most existing methods are yet vulnerable to hybrid attacks and suffer from counterattacks. In light of this, in this paper, we first reveal a gradient-based correlation between sensitivity analysis-based DNN interpreters and the generation process of adversarial examples, which indicates the Achilles’s heel of adversarial attacks and sheds light on linking together the two long-standing challenges of DNN: fragility and unexplainability. We then propose an interpreter-based ensemble framework called X-Ensemble for robust adversary defense. X-Ensemble adopts a novel detection-rectification process and features in building multiple sub-detectors and a rectifier upon various types of interpretation information toward target classifiers. Moreover, X-Ensemble employs the Random Forests (RF) model to combine sub-detectors into an ensemble detector for adversarial hybrid attacks defense. The non-differentiable property of RF further makes it a precious choice against the counterattack of adversaries. Extensive experiments under various types of state-of-the-art attacks and diverse attack scenarios demonstrate the advantages of X-Ensemble to competitive baseline methods.

The video of this talk cannot be embedded. You can watch it here:

https://dl.acm.org/doi/10.1145/3394486.3403044#sec-supp

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at KDD 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Disrupting Deep Uncertainty Estimation Without Harming Accuracy

Ido Galil, Ran El-Yaniv

Keywords Paper

deep learning, machine learning, adversarial robustness and security

0

0

0

0

9:14

18/07/2021

Towards Defending against Adversarial Examples via Attack-Invariant Features

Dawei Zhou, Tongliang Liu, Bo Han and
Nannan Wang, Chunlei Peng, Xinbo Gao

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:19

18/07/2021

Double-Win Quant: Aggressively Winning Robustness of Quantized Deep Neural Networks via Random Precision Training and Inference

Yonggan Fu, Qixuan Yu, Meng Li and
Vikas Chandra, Yingyan Lin

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:20

06/12/2021

Backdoor Attack with Imperceptible Input and Latent Modification

Khoa D Doan, Yingjie Lao, Ping Li

Keywords Paper

deep learning, optimization, adversarial robustness and security, generative model

0

0

0

0

12:27

14/06/2020

A Self-supervised Approach for Adversarial Robustness

Muzammal Naseer, Salman Khan, Munawar Hayat and
Fahad Shahbaz Khan, Fatih Porikli

Keywords Paper

self-supervision, defense, attack, perceptual-features, classification, segmentation, object-detection, adversarial-learning, dynamic-defense, zero-shot

0

0

0

0

5:01

12/08/2020

Interpretable Deep Learning under Fire

Xinyang Zhang, Ningfei Wang, Hua Shen and
Shouling Ji, Xiapu Luo, Ting Wang

Keywords Paper

0

0

0

0

11:35

06/12/2021

Adversarial Feature Desensitization

Pouya Bashivan, Reza Bayat, Adam Ibrahim and
Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Richards, Irina Rish

Keywords Paper

deep learning, robustness, adversarial robustness and security, domain adaptation

0

0

0

0

13:27

14/06/2020

Learn2Perturb: An End-to-End Feature Perturbation Learning to Improve Adversarial Robustness

Ahmadreza Jeddi, Mohammad Javad Shafiee, Michelle Karg and
Christian Scharfenberger, Alexander Wong

Keywords Paper

adversarial robustness, network randomization, alternative back-propagation, trainable noise, adversarial training

0

0

0

0

1:01

26/04/2020

Improving Adversarial Robustness Requires Revisiting Misclassified Examples

Yisen Wang, Difan Zou, Jinfeng Yi and
James Bailey, Xingjun Ma, Quanquan Gu

Keywords Paper

Robustness, Adversarial Defense, Adversarial Training

0

0

0

0

5:02

14/06/2020

One Man’s Trash Is Another Man’s Treasure: Resisting Adversarial Examples by Adversarial Examples

Chang Xiao, Changxi Zheng

Keywords Paper

adversarial defense, adversarial examples, adversarial learning, deep learning

0

0

0

0

1:01

30/11/2020

Double Targeted Universal Adversarial Perturbations

Philipp Benz, Chaoning Zhang, Tooba Imtiaz, In So Kweon

Keywords Paper

0

0

0

0

9:28

12/07/2020

Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Jang-Hyun Kim, Wonho Choo, Hyun Oh Song

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:40

12/07/2020

On Breaking Deep Generative Model-based Defenses and Beyond

Yanzhi Chen, Renjie Xie, Zhanxing Zhu

Keywords Paper

Adversarial Examples

0

0

0

0

10:33

04/07/2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization

Yuan Zang, Fanchao Qi, Chenghao Yang and
Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Paper

Textual attacking, Word-level attacking, combinatorial problem, Word-level Attacking

0

0

0

0

9:34

14/06/2020

Adversarial Camouflage: Hiding Physical-World Attacks With Natural Styles

Ranjie Duan, Xingjun Ma, Yisen Wang and
James Bailey, A. K. Qin, Yun Yang

Keywords Paper

adversarial example, physical attack, camouflage

0

0

0

0

1:01

12/07/2020

Adversarial Neural Pruning with Latent Vulnerability Suppression

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Paper

Adversarial Examples

0

0

0

0

14:43

06/12/2020

Robustness of Bayesian Neural Networks to Gradient-Based Attacks

Ginevra Carbone, Matthew Wicker, Luca Laurenti and
Andrea Patane', Luca Bortolussi, Guido Sanguinetti

Keywords Paper

0

0

0

0

3:08

02/02/2021

Adversarial Robustness through Disentangled Representations

Shuo Yang, Tianyu Guo, Yunhe Wang, Chang Xu

Keywords Paper

0

0

0

0

15:00

14/06/2020

Boosting the Transferability of Adversarial Samples via Attention

Weibin Wu, Yuxin Su, Xixian Chen and
Shenglin Zhao, Irwin King, Michael R. Lyu, Yu-Wing Tai

Keywords Paper

adversarial learning, adversarial attack methods

0

0

0

0

1:01

14/06/2020

Benchmarking Adversarial Robustness on Image Classification

Yinpeng Dong, Qi-An Fu, Xiao Yang and
Tianyu Pang, Hang Su, Zihao Xiao, Jun Zhu

Keywords Paper

adversarial robustness, benchmark, evaluation, security, attack, defense, image classification

0

0

0

0

4:59

14/06/2020

Conditional Gaussian Distribution Learning for Open Set Recognition

Xin Sun, Zhenning Yang, Chi Zhang and
Keck-Voon Ling, Guohao Peng

Keywords Paper

open set recognition, conditional variational auto-encoder, gaussian distribution learning, probabilistic ladder architecture.

0

0

0

0

1:01

02/02/2021

Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain

Jinyu Tian, Jiantao Zhou, Yuanman Li, Jia Duan

Keywords Paper

0

0

0

0

18:59

06/12/2021

Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks

Yonggan Fu, Qixuan Yu, Yang Zhang and
Shang Wu, Xu Ouyang, David Cox, Yingyan Lin

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

12:48

26/04/2020

Defending Against Physically Realizable Attacks on Image Classification

Tong Wu, Liang Tong, Yevgeniy Vorobeychik

Keywords Paper

defense against physical attacks, adversarial machine learning

0

0

0

0

4:57

06/12/2020

Adversarial Example Games

Joey Bose, Gauthier Gidel, Hugo Berard and
Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, Will Hamilton

Keywords Paper

0

0

0

0

3:22

06/12/2021

Adversarial Robustness with Non-uniform Perturbations

Ecenaz Erdemir, Jeffrey Bickford, Luca Melis, Sergul Aydore

Keywords Paper

deep learning, optimization, machine learning, robustness, adversarial robustness and security

0

0

0

0

15:05

12/07/2020

Proper Network Interpretability Helps Adversarial Robustness in Classification

Akhilan Boopathy, Sijia Liu, Gaoyuan Zhang and
Cynthia Liu, Pin-Yu Chen, Shiyu Chang, Luca Daniel

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

15:01

18/07/2021

Towards Better Robust Generalization with Shift Consistency Regularization

Shufei Zhang, Zhuang Qian, Kaizhu Huang and
Qiufeng Wang, Rui Zhang, Xinping Yi

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:44

14/06/2020

Enhancing Cross-Task Black-Box Transferability of Adversarial Examples With Dispersion Reduction

Yantao Lu, Yunhan Jia, Jianyu Wang and
Bai Li, Weiheng Chai, Lawrence Carin, Senem Velipasalar

Keywords Paper

adversarial example, black-box attack, cross tasks, transferability, deep neural network

0

0

0

0

1:01

18/07/2021

Weight-covariance alignment for adversarially robust neural networks

Panagiotis Eustratiadis, Henry Gouk, Da Li, Timothy Hospedales

Keywords Paper

Deep Learning, Adversarial Networks

0

0

0

0

4:46

06/12/2021

Random Noise Defense Against Query-Based Black-Box Attacks

Zeyu Qin, Yanbo Fan, Hongyuan Zha, Baoyuan Wu

Keywords Paper

machine learning, robustness, adversarial robustness and security

0

0

0

0

12:53

03/05/2021

Protecting DNNs from Theft using an Ensemble of Diverse Models

Sanjay Kariyappa, Atul Prakash, Moinuddin K Qureshi

Keywords Paper

machine learning security, Model stealing

0

0

0

0

5:05

14/06/2020

Towards Verifying Robustness of Neural Networks Against A Family of Semantic Perturbations

Jeet Mohapatra, Tsui-Wei Weng, Pin-Yu Chen and
Sijia Liu, Luca Daniel

Keywords Paper

robustness verification, semantic attacks, adversarial examples, adversarial robustness, deep learning

0

0

0

0

5:00

06/12/2021

Towards Efficient and Effective Adversarial Training

Gaurang Sriramanan, Sravanti Addepalli, Arya Baburaj, Venkatesh Babu R

Keywords Paper

deep learning, optimization, robustness, adversarial robustness and security

0

0

0

0

14:24

12/07/2020

Adversarial Robustness Against the Union of Multiple Threat Models

Pratyush Maini, Eric Wong, Zico Kolter

Keywords Paper

Adversarial Examples

0

0

0

0

15:02

14/06/2020

What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images

Xing Xu, Jiefu Chen, Jinhui Xiao and
Lianli Gao, Fumin Shen, Heng Tao Shen

Keywords Paper

adversarial attack, scene text recognition, white-box attack, targeted attack, untargeted attack

0

0

0

0

5:00

18/07/2021

Neural Tangent Generalization Attacks

Jimmy Yuan, Shan-Hung (Brandon) Wu

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:14

06/12/2020

Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness

Long Zhao, Ting Liu, Xi Peng, Dimitris Metaxas

Keywords Paper

0

0

0

0

3:22

02/02/2021

Proactive Privacy-preserving Learning for Retrieval

Peng-Fei Zhang, Zi Huang, Xin-Shun Xu

Keywords Paper

0

0

0

0

17:14

06/12/2021

FINE Samples for Learning with Noisy Labels

Taehyeon Kim, Jongwoo Ko, sangwook Cho and
JinHwan Choi, Se-Young Yun

Keywords Paper

theory, deep learning, machine learning, vision, semi-supervised learning

0

1

0

0

11:09