Improving Adversarial Robustness via Channel-wise Activation Suppressing

Abstract: The study of adversarial examples and their activations have attracted significant attention for secure and robust learning with deep neural networks (DNNs). Different from existing works, in this paper, we highlight two new characteristics of adversarial examples from the channel-wise activation perspective: 1) the activation magnitudes of adversarial examples are higher than that of natural examples; and 2) the channels are activated more uniformly by adversarial examples than natural examples. We find that, while the state-of-the-art defense adversarial training has addressed the first issue of high activation magnitude via training on adversarial examples, the second issue of uniform activation remains. This motivates us to suppress redundant activations from being activated by adversarial perturbations during the adversarial training process, via a Channel-wise Activation Suppressing (CAS) training strategy. We show that CAS can train a model that inherently suppresses adversarial activations, and can be easily applied to existing defense methods to further improve their robustness. Our work provides a simplebut generic training strategy for robustifying the intermediate layer activations of DNNs.

Improving Adversarial Robustness via Channel-wise Activation Suppressing

Yang Bai, Yuyuan Zeng, Yong Jiang, Shu-Tao Xia, Daniel Ma, Yisen Wang

Comments

Similar Papers

Adversarial Feature Desensitization

Pouya Bashivan, Reza Bayat, Adam Ibrahim and Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Richards, Irina Rish

Keywords Abstract Paper

deep learning, robustness, adversarial robustness and security, domain adaptation

Improving Adversarial Robustness Requires Revisiting Misclassified Examples

Yisen Wang, Difan Zou, Jinfeng Yi and James Bailey, Xingjun Ma, Quanquan Gu

Keywords Abstract Paper

Robustness, Adversarial Defense, Adversarial Training

GAT: Generative Adversarial Training for Adversarial Example Detection and Classification

Xuwang Yin, Soheil Kolouri, Gustavo K Rohde

Keywords Abstract Paper

adversarial example detection, adversarial examples classification, robust optimization, ML security, generative modeling, generative classification

Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

Sravanti Addepalli, Vivek B.S., Arya Baburaj and Gaurang Sriramanan, R. Venkatesh Babu

Keywords Abstract Paper

adversarial robustness, adversarial defense, adversarial training, fast adversarial training, adversary-free training, adversarial attacks, efficient adversarial training, generalization, feature consistency, deep neural networks

Learn2Perturb: An End-to-End Feature Perturbation Learning to Improve Adversarial Robustness

Ahmadreza Jeddi, Mohammad Javad Shafiee, Michelle Karg and Christian Scharfenberger, Alexander Wong

Keywords Abstract Paper

adversarial robustness, network randomization, alternative back-propagation, trainable noise, adversarial training

Enhancing Intrinsic Adversarial Robustness via Feature Pyramid Decoder

Guanlin Li, Shuya Ding, Jun Luo, Chang Liu

Keywords Abstract Paper

adversarial defense, feature denoise, multi-task learning, self-supervised learning, image restoration, lipschitz constant constraint, feature pyramid decoder

Towards Better Robust Generalization with Shift Consistency Regularization

Shufei Zhang, Zhuang Qian, Kaizhu Huang and Qiufeng Wang, Rui Zhang, Xinping Yi

Keywords Abstract Paper

Algorithms, Adversarial Examples

Single-Step Adversarial Training With Dropout Scheduling

Vivek B.S., R. Venkatesh Babu

Keywords Abstract Paper

adversarial training, robustness, efficient training, representation learning, generalization, supervised learning, recognition, classification, neural networks, deep learning

Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks

Christoph Raab, Philipp Väth, Peter Meier, Frank-Michael Schleif

Keywords Abstract Paper

Concise Explanations of Neural Networks using Adversarial Training

Prasad Chalasani, Jiefeng Chen, Amrita Roy Chowdhury and Xi Wu, Somesh Jha

Keywords Abstract Paper

Accountability, Transparency and Interpretability

Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness

Long Zhao, Ting Liu, Xi Peng, Dimitris Metaxas

Keywords Abstract Paper

Improving Gradient Regularization using Complex-Valued Neural Networks

Eric Yeats, Yiran Chen, Hai Li

Keywords Abstract Paper

Algorithms, Adversarial Examples

Stochastic Security: Adversarial Defense Using Long-Run Dynamics of Energy-Based Models

Mitch Hill, Jonathan Mitchell, Song-Chun Zhu

Keywords Abstract Paper

energy-based model, adversarial defense, adversarial attack, Langevin sampling, Markov chain Monte Carlo, adversarial robustness

Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

Wei Hu, Zhiyuan Li, Dingli Yu

Keywords Abstract Paper

deep learning theory, regularization, noisy labels

Adversarial Neural Pruning with Latent Vulnerability Suppression

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Abstract Paper

Adversarial Examples

Proper Network Interpretability Helps Adversarial Robustness in Classification

Akhilan Boopathy, Sijia Liu, Gaoyuan Zhang and Cynthia Liu, Pin-Yu Chen, Shiyu Chang, Luca Daniel

Keywords Abstract Paper

Accountability, Transparency and Interpretability

Hold me tight! Influence of discriminative features on deep network boundaries

Guillermo Ortiz-Jimenez, Apostolos Modas, Seyed Moosavi-Dezfooli, Pascal Frossard

Keywords Abstract Paper

Robust Design of Deep Neural Networks Against Adversarial Attacks Based on Lyapunov Theory

Arash Rahnama, Andre T. Nguyen, Edward Raff

Keywords Abstract Paper

adversarial machine learning, robustness, control theory, lyapunov theory, spectral norm regularization, stability and robustness analysis of dnns, dissipativity and passivity theory, adversarial attacks, learning theory, mathematical analysis of dnns

Deep Residual Flow for Out of Distribution Detection

Ev Zisselman, Aviv Tamar

Keywords Abstract Paper

neural-networks, out-of-distribution detection, flow models, neural generative models, machine learning architectures.

Training GANs with Stronger Augmentations via Contrastive Discriminator

Jongheon Jeong, Jinwoo Shin

Keywords Abstract Paper

Pouya Bashivan, Reza Bayat, Adam Ibrahim and
Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Richards, Irina Rish

Keywords Paper

Yisen Wang, Difan Zou, Jinfeng Yi and
James Bailey, Xingjun Ma, Quanquan Gu

Keywords Paper

Keywords Paper

Sravanti Addepalli, Vivek B.S., Arya Baburaj and
Gaurang Sriramanan, R. Venkatesh Babu

Keywords Paper

Ahmadreza Jeddi, Mohammad Javad Shafiee, Michelle Karg and
Christian Scharfenberger, Alexander Wong

Keywords Paper

Keywords Paper

Shufei Zhang, Zhuang Qian, Kaizhu Huang and
Qiufeng Wang, Rui Zhang, Xinping Yi

Keywords Paper

Keywords Paper

Keywords Paper

Prasad Chalasani, Jiefeng Chen, Amrita Roy Chowdhury and
Xi Wu, Somesh Jha

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Akhilan Boopathy, Sijia Liu, Gaoyuan Zhang and
Cynthia Liu, Pin-Yu Chen, Shiyu Chang, Luca Daniel

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Muzammal Naseer, Salman Khan, Munawar Hayat and
Fahad Shahbaz Khan, Fatih Porikli

Keywords Paper

Zifeng Wang, Tong Jian, Aria Masoomi and
Stratis Ioannidis, Jennifer Dy

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Matthew Wicker, Luca Laurenti, Andrea Patane and
Zhuotong Chen, Zheng Zhang, Marta Kwiatkowska

Keywords Paper

Keywords Paper

Keywords Paper

Awais Muhammad, Fengwei Zhou, Chuanlong Xie and
Jiawei Li, Sung-Ho Bae, Zhenguo Li

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper