Dual Manifold Adversarial Robustness: Defense against Lp and non-Lp Adversarial Attacks

Abstract: Adversarial training is a popular defense strategy against attack threat models with bounded Lp norms. However, it often degrades the model performance on normal images and more importantly, the defense does not generalize well to novel attacks. Given the success of deep generative models such as GANs and VAEs in characterizing the underlying manifold of images, we investigate whether or not the aforementioned deficiencies of adversarial training can be remedied by exploiting the underlying manifold information. To partially answer this question, we consider the scenario when the manifold information of the underlying data is available. We use a subset of ImageNet natural images where an approximate underlying manifold is learned using StyleGAN. We also construct an ``On-Manifold ImageNet'' (OM-ImageNet) dataset by projecting the ImageNet samples onto the learned manifold. For OM-ImageNet, the underlying manifold information is exact. Using OM-ImageNet, we first show that on-manifold adversarial training improves both standard accuracy and robustness to on-manifold attacks. However, since no out-of-manifold perturbations are realized, the defense can be broken by Lp adversarial attacks. We further propose Dual Manifold Adversarial Training (DMAT) where adversarial perturbations in both latent and image spaces are used in robustifying the model. Our DMAT improves performance on normal images, and achieves comparable robustness to the standard adversarial training against Lp attacks. In addition, we observe that models defended by DMAT achieve improved robustness against novel attacks which manipulate images by global color shifts or various types of image filtering. Interestingly, similar improvements are also achieved when the defended models are tested on (out-of-manifold) natural images. These results demonstrate the potential benefits of using manifold information in enhancing robustness of deep learning models against various types of novel adversarial attacks.

Dual Manifold Adversarial Robustness: Defense against Lp and non-Lp Adversarial Attacks

Wei-An Lin, Chun Pong Lau, Alexander Levine, Rama Chellappa, Soheil Feizi

Comments

Similar Papers

Single-Step Adversarial Training With Dropout Scheduling

Vivek B.S., R. Venkatesh Babu

Keywords Abstract Paper

adversarial training, robustness, efficient training, representation learning, generalization, supervised learning, recognition, classification, neural networks, deep learning

AdvFlow: Inconspicuous Black-box Adversarial Attacks using Normalizing Flows

Hadi Mohaghegh Dolatabadi, Sarah Erfani, Christopher Leckie

Keywords Abstract Paper

Nesterov Accelerated Gradient and Scale Invariance for Adversarial Attacks

Jiadong Lin, Chuanbiao Song, Kun He and Liwei Wang, John E. Hopcroft

Keywords Abstract Paper

adversarial examples, adversarial attack, transferability, Nesterov accelerated gradient, scale invariance

Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification

Yixiao Ge, Dapeng Chen, Hongsheng Li

Keywords Abstract Paper

Label Refinery, Unsupervised Domain Adaptation, Person Re-identification

Attack to Explain Deep Representation

Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Keywords Abstract Paper

interpreting deep learning, adversarial attack, explanation attack, explainable ai, image generation

Alleviating Noisy-label Effects in Image Classification via Probability Transition Matrix

Ziqi Zhang, Yuexiang Li, Hongxin Wei and Kai Ma, Tao Xu, Yefeng Zheng

Keywords Abstract Paper

noisy labels, image classification, instance selection, robust learning, inter-class correlation, soft label, medical image

Boosting Adversarial Transferability through Enhanced Momentum

Xiaosen Wang, Jiadong Lin, Han Hu and Jingdong Wang, Kun He

Keywords Abstract Paper

adversarial transferability, adversarial attack, adversarial examples, optimization

You Only Need Adversarial Supervision for Semantic Image Synthesis

Edgar Schoenfeld, Vadim Sushko, Dan Zhang and Juergen Gall, Bernt Schiele, Anna Khoreva

Keywords Abstract Paper

GANs, Semantic Image Synthesis, Image Generation, Deep Learning

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Stéphane d'Ascoli, Hugo Touvron, Matthew Leavitt and Ari Morcos, Giulio Biroli, Levent Sagun

Keywords Abstract Paper

Deep Learning, Architectures

Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild

Weijia Wu, Ning Lu, Enze Xie and Yuxing Wang, Wenwen Yu, Cheng Yang, Hong Zhou

Keywords Abstract Paper

Negative Data Augmentation

Abhishek Sinha, Kumar Ayush, Jiaming Song and Burak Uzkent, Hongxia Jin, Stefano Ermon

Keywords Abstract Paper

self-supervised learning, anomaly detection, generative models, data augmentation

Designing Counterfactual Generators using Deep Model Inversion

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Abstract Paper

optimization, representation learning, interpretability

Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training

Lue Tao, Lei Feng, Jinfeng Yi and Sheng-Jun Huang, Songcan Chen

Keywords Abstract Paper

adversarial robustness and security

Achieving Robustness in the Wild via Adversarial Mixing With Disentangled Representations

Sven Gowal, Chongli Qin, Po-Sen Huang and Taylan Cemgil, Krishnamurthy Dvijotham, Timothy Mann, Pushmeet Kohli

Keywords Abstract Paper

real-world robustness, adversarial examples, disentangled latents, generative models, spurious correlations

Transformation GAN for Unsupervised Image Synthesis and Representation Learning

Jiayu Wang, Wengang Zhou, Guo-Jun Qi and Zhongqian Fu, Qi Tian, Houqiang Li

Keywords Abstract Paper

gan, unsupervised learning, representation learning

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Abstract Paper

Object-aware Contrastive Learning for Debiased Scene Representation

Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn and Chun-Liang Li, Jinwoo Shin

Keywords Abstract Paper

self-supervised learning, contrastive learning, representation learning

Rebooting ACGAN: Auxiliary Classifier GANs with Stable Training

Minguk Kang, Woohyeon Shim, Minsu Cho, Jaesik Park

Keywords Abstract Paper

generative model

Robust Local Features for Improving the Generalization of Adversarial Training

Chuanbiao Song, Kun He, Jiadong Lin and Liwei Wang, John E. Hopcroft

Keywords Abstract Paper

adversarial robustness, adversarial training, adversarial example, deep learning

Data-Efficient Instance Generation from Instance Discrimination

Ceyuan Yang, Yujun Shen, Yinghao Xu, Bolei Zhou

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Jiadong Lin, Chuanbiao Song, Kun He and
Liwei Wang, John E. Hopcroft

Keywords Paper

Keywords Paper

Keywords Paper

Ziqi Zhang, Yuexiang Li, Hongxin Wei and
Kai Ma, Tao Xu, Yefeng Zheng

Keywords Paper

Xiaosen Wang, Jiadong Lin, Han Hu and
Jingdong Wang, Kun He

Keywords Paper

Edgar Schoenfeld, Vadim Sushko, Dan Zhang and
Juergen Gall, Bernt Schiele, Anna Khoreva

Keywords Paper

Stéphane d'Ascoli, Hugo Touvron, Matthew Leavitt and
Ari Morcos, Giulio Biroli, Levent Sagun

Keywords Paper

Weijia Wu, Ning Lu, Enze Xie and
Yuxing Wang, Wenwen Yu, Cheng Yang, Hong Zhou

Keywords Paper

Abhishek Sinha, Kumar Ayush, Jiaming Song and
Burak Uzkent, Hongxia Jin, Stefano Ermon

Keywords Paper

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and
Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Paper

Lue Tao, Lei Feng, Jinfeng Yi and
Sheng-Jun Huang, Songcan Chen

Keywords Paper

Sven Gowal, Chongli Qin, Po-Sen Huang and
Taylan Cemgil, Krishnamurthy Dvijotham, Timothy Mann, Pushmeet Kohli

Keywords Paper

Jiayu Wang, Wengang Zhou, Guo-Jun Qi and
Zhongqian Fu, Qi Tian, Houqiang Li

Keywords Paper

Keywords Paper

Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin

Keywords Paper

Keywords Paper

Chuanbiao Song, Kun He, Jiadong Lin and
Liwei Wang, John E. Hopcroft

Keywords Paper

Keywords Paper

Denis Yarats, Amy Zhang, Ilya Kostrikov and
Brandon Amos, Joelle Pineau, Rob Fergus

Keywords Paper

Tianyu Pang, Kun Xu, Jun Zhu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Sravanti Addepalli, Vivek B.S., Arya Baburaj and
Gaurang Sriramanan, R. Venkatesh Babu

Keywords Paper

Huan Zhang, Hongge Chen, Chaowei Xiao and
Bo Li, Mingyan Liu, Duane Boning, Cho-Jui Hsieh

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yifan Zhang, Bryan Hooi, Dapeng Hu and
Jian Liang, Jiashi Feng

Keywords Paper

Kaiyang Cheng, Francesco Calivá, Rutwik Shah and
Misung Han, Sharmila Majumdar, Valentina Pedoia

Keywords Paper

Keywords Paper

Keywords Paper