Proper Network Interpretability Helps Adversarial Robustness in Classification

12/07/2020

Proper Network Interpretability Helps Adversarial Robustness in Classification

Akhilan Boopathy, Sijia Liu, Gaoyuan Zhang, Cynthia Liu, Pin-Yu Chen, Shiyu Chang, Luca Daniel

Keywords: Accountability, Transparency and Interpretability

Abstract Paper Similar Papers

Abstract: Recent works have empirically shown that there exist adversarial examples that can be hidden from neural network interpretability (namely, making network interpretation maps visually similar), and interpretability is itself susceptible to adversarial attacks. In this paper, we theoretically show that with a proper measurement of interpretation, it is actually difficult to prevent prediction-evasion adversarial attacks from causing interpretability discrepancy, as confirmed by experiments on MNIST, CIFAR-10 and Restricted ImageNet. Spurred by that, we develop an interpretability-aware defensive scheme built only on robust interpretation (without the need of resorting to adversarial loss minimization). We show that our defense achieves both robust classification and robust interpretation, outperforming state-of-the-art adversarial training methods against attacks of large perturbation in particular.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

A Self-supervised Approach for Adversarial Robustness

Muzammal Naseer, Salman Khan, Munawar Hayat and
Fahad Shahbaz Khan, Fatih Porikli

Keywords Paper

self-supervision, defense, attack, perceptual-features, classification, segmentation, object-detection, adversarial-learning, dynamic-defense, zero-shot

0

0

0

0

5:01

06/12/2021

Adversarial Robustness with Non-uniform Perturbations

Ecenaz Erdemir, Jeffrey Bickford, Luca Melis, Sergul Aydore

Keywords Paper

deep learning, optimization, machine learning, robustness, adversarial robustness and security

0

0

0

0

15:05

14/06/2020

One Man’s Trash Is Another Man’s Treasure: Resisting Adversarial Examples by Adversarial Examples

Chang Xiao, Changxi Zheng

Keywords Paper

adversarial defense, adversarial examples, adversarial learning, deep learning

0

0

0

0

1:01

06/12/2021

Adversarial Feature Desensitization

Pouya Bashivan, Reza Bayat, Adam Ibrahim and
Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Richards, Irina Rish

Keywords Paper

deep learning, robustness, adversarial robustness and security, domain adaptation

0

0

0

0

13:27

14/06/2020

Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

Sravanti Addepalli, Vivek B.S., Arya Baburaj and
Gaurang Sriramanan, R. Venkatesh Babu

Keywords Paper

adversarial robustness, adversarial defense, adversarial training, fast adversarial training, adversary-free training, adversarial attacks, efficient adversarial training, generalization, feature consistency, deep neural networks

0

0

0

0

1:01

12/07/2020

Adversarial Neural Pruning with Latent Vulnerability Suppression

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Paper

Adversarial Examples

0

0

0

0

14:43

26/08/2020

Robustness for Non-Parametric Classification: A Generic Attack and Defense

Yao-Yuan Yang, Cyrus Rashtchian, Yizhen Wang, Kamalika Chaudhuri

Keywords Paper

0

0

0

0

14:42

26/04/2020

Jacobian Adversarially Regularized Networks for Robustness

Alvin Chan, Yi Tay, Yew Soon Ong, Jie Fu

Keywords Paper

adversarial examples, robust machine learning, deep learning

0

0

0

1

5:20

03/05/2021

Improving Adversarial Robustness via Channel-wise Activation Suppressing

Yang Bai, Yuyuan Zeng, Yong Jiang and
Shu-Tao Xia, Daniel Ma, Yisen Wang

Keywords Paper

channel suppressing, Adversarial robustness, activation strategy.

0

0

0

0

15:53

26/04/2020

GAT: Generative Adversarial Training for Adversarial Example Detection and Classification

Xuwang Yin, Soheil Kolouri, Gustavo K Rohde

Keywords Paper

adversarial example detection, adversarial examples classification, robust optimization, ML security, generative modeling, generative classification

0

0

0

0

5:14

14/06/2020

Enhancing Cross-Task Black-Box Transferability of Adversarial Examples With Dispersion Reduction

Yantao Lu, Yunhan Jia, Jianyu Wang and
Bai Li, Weiheng Chai, Lawrence Carin, Senem Velipasalar

Keywords Paper

adversarial example, black-box attack, cross tasks, transferability, deep neural network

0

0

0

0

1:01

02/02/2021

Efficient Certification of Spatial Robustness

Anian Ruoss, Maximilian Baader, Mislav Balunović, Martin Vechev

Keywords Paper

0

1

0

0

16:30

18/11/2020

Towards understanding and improving the transferability of adversarial examples in deep neural networks

Lei Wu, Zhanxing Zhu

Keywords Paper

0

0

0

0

10:28

06/12/2020

Input-Aware Dynamic Backdoor Attack

Tuan Anh Nguyen, Anh Tran

Keywords Paper

0

0

0

0

3:21

14/06/2020

What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images

Xing Xu, Jiefu Chen, Jinhui Xiao and
Lianli Gao, Fumin Shen, Heng Tao Shen

Keywords Paper

adversarial attack, scene text recognition, white-box attack, targeted attack, untargeted attack

0

0

0

0

5:00

03/05/2021

Stochastic Security: Adversarial Defense Using Long-Run Dynamics of Energy-Based Models

Mitch Hill, Jonathan Mitchell, Song-Chun Zhu

Keywords Paper

energy-based model, adversarial defense, adversarial attack, Langevin sampling, Markov chain Monte Carlo, adversarial robustness

0

0

0

0

4:51

23/08/2020

Interpretability is a kind of safety: An interpreter-based ensemble for adversary defense

Jingyuan Wang, Yufan Wu, Mingxuan Li and
Xin Lin, Junjie Wu, Chao Li

Keywords Paper

DNN interpretation, adversarial example defense, ensemble

0

0

0

0

10:04

06/12/2021

Disrupting Deep Uncertainty Estimation Without Harming Accuracy

Ido Galil, Ran El-Yaniv

Keywords Paper

deep learning, machine learning, adversarial robustness and security

0

0

0

0

9:14

12/08/2020

Interpretable Deep Learning under Fire

Xinyang Zhang, Ningfei Wang, Hua Shen and
Shouling Ji, Xiapu Luo, Ting Wang

Keywords Paper

0

0

0

0

11:35

19/08/2021

A Rule Mining-based Advanced Persistent Threats Detection System

Sidahmed Benabderrahmane, Ghita Berrada, James Cheney, Petko Valtchev

Keywords Paper

Multidisciplinary Topics and Applications, Security and Privacy, Frequent Pattern Mining, Anomaly/Outlier Detection

0

0

0

0

15:27

30/11/2020

Attended-Auxiliary Supervision Representation for Face Anti-spoofing

Son Minh Nguyen, Linh Duy Tran, Masayuki Arai

Keywords Paper

0

0

0

0

6:50

14/06/2020

Towards Verifying Robustness of Neural Networks Against A Family of Semantic Perturbations

Jeet Mohapatra, Tsui-Wei Weng, Pin-Yu Chen and
Sijia Liu, Luca Daniel

Keywords Paper

robustness verification, semantic attacks, adversarial examples, adversarial robustness, deep learning

0

0

0

0

5:00

26/04/2020

Defending Against Physically Realizable Attacks on Image Classification

Tong Wu, Liang Tong, Yevgeniy Vorobeychik

Keywords Paper

defense against physical attacks, adversarial machine learning

0

0

0

0

4:57

02/02/2021

Adversarial Robustness through Disentangled Representations

Shuo Yang, Tianyu Guo, Yunhe Wang, Chang Xu

Keywords Paper

0

0

0

0

15:00

02/02/2021

Defending against Backdoors in Federated Learning with Robust Learning Rate

Mustafa Safa Ozdayi, Murat Kantarcioglu, Yulia R. Gel

Keywords Paper

0

0

0

0

16:19

02/02/2021

A Unified Multi-Scenario Attacking Network for Visual Object Tracking

Xuesong Chen, Canmiao Fu, Feng Zheng and
Yong Zhao, Hongsheng Li, Ping Luo, Guo-Jun Qi

Keywords Paper

0

0

0

0

18:19

06/12/2021

Backdoor Attack with Imperceptible Input and Latent Modification

Khoa D Doan, Yingjie Lao, Ping Li

Keywords Paper

deep learning, optimization, adversarial robustness and security, generative model

0

0

0

0

12:27

16/11/2020

Towards Debiasing NLU Models from Unknown Biases

Prasetya Ajie Utama, Nafise Sadat Moosavi, Iryna Gurevych

Keywords Paper

nlu tasks, nlu models, debiasing methods, self-debiasing framework

0

0

0

0

10:40

06/12/2020

Adversarial Example Games

Joey Bose, Gauthier Gidel, Hugo Berard and
Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, Will Hamilton

Keywords Paper

0

0

0

0

3:22

12/08/2020

SmartVerif: Push the Limit of Automation Capability of Verifying Security Protocols by Dynamic Strategies

Yan Xiong, Cheng Su, Wenchao Huang and
Fuyou Miao, Wansen Wang, Hengyi Ouyang

Keywords Paper

0

0

0

0

11:18

14/06/2020

The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks

Yuheng Zhang, Ruoxi Jia, Hengzhi Pei and
Wenxiao Wang, Bo Li, Dawn Song

Keywords Paper

model inversion attack, deep neural network, generative adversarial network, differential privacy

0

0

0

0

5:01

05/01/2021

Defense-Friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty

Camilo Pestana, Wei Liu, David Glance, Ajmal Mian

Keywords Paper

0

0

0

0

4:56

06/12/2020

AdvFlow: Inconspicuous Black-box Adversarial Attacks using Normalizing Flows

Hadi Mohaghegh Dolatabadi, Sarah Erfani, Christopher Leckie

Keywords Paper

0

0

0

0

3:59

18/07/2021

Weight-covariance alignment for adversarially robust neural networks

Panagiotis Eustratiadis, Henry Gouk, Da Li, Timothy Hospedales

Keywords Paper

Deep Learning, Adversarial Networks

0

0

0

0

4:46

03/05/2021

Provably robust classification of adversarial examples with detection

Fatemeh Sheikholeslami, Ali Lotfi, Zico Kolter

Keywords Paper

Adversarial robustness, robust deep learning

0

1

0

0

5:01

12/07/2020

Adversarial Robustness Against the Union of Multiple Threat Models

Pratyush Maini, Eric Wong, Zico Kolter

Keywords Paper

Adversarial Examples

0

0

0

0

15:02

14/06/2020

Boosting the Transferability of Adversarial Samples via Attention

Weibin Wu, Yuxin Su, Xixian Chen and
Shenglin Zhao, Irwin King, Michael R. Lyu, Yu-Wing Tai

Keywords Paper

adversarial learning, adversarial attack methods

0

0

0

0

1:01

14/06/2020

QEBA: Query-Efficient Boundary-Based Blackbox Attack

Huichen Li, Xiaojun Xu, Xiaolu Zhang and
Shuang Yang, Bo Li

Keywords Paper

adversarial machine learning, black-box attack, boundary-based attack, attacking public api

0

0

0

0

1:01

02/02/2021

Sparsity Aware Normalization for GANs

Idan Kligvasser, Tomer Michaeli

Keywords Paper

0

0

0

0

17:09

12/07/2020

Defense Through Diverse Directions

Christopher Bender, Yang Li, Yifeng Shi and
Michael K. Reiter, Junier Oliva

Keywords Paper

Adversarial Examples

0

0

0

0

15:06