Adaptive Verifiable Training Using Pairwise Class Similarity

Abstract: Verifiable training has shown success in creating neural networks that are provably robust to a given amount of noise. However, despite only enforcing a single robustness criterion, its performance scales poorly with dataset complexity. On CIFAR10, a non-robust LeNet model has a 21.63% error rate, while a model created using verifiable training and a L-infinity robustness criterion of 8/255, has an error rate of 57.10%. Upon examination, we find that when labeling visually similar classes, the model's error rate is as high as 61.65%. Thus, we attribute the loss in performance to inter-class similarity. Classes that are similar (i.e., close in the feature space) increase the difficulty of learning a robust model. While it may be desirable to train a model to be robust for a large robustness region, pairwise class similarities limit the potential gains. Furthermore, consideration must be made regarding the relative cost of mistaking one class for another. In security or safety critical tasks, similar classes are likely to belong to the same group, and thus are equally sensitive. In this work, we propose a new approach that utilizes inter-class similarity to improve the performance of verifiable training and create robust models with respect to multiple adversarial criteria. First, we cluster similar classes using agglomerate clustering and assign robustness criteria based on the degree of similarity between clusters. Next, we propose two methods to apply our approach: (1) the Inter-Group Robustness Prioritization method, which uses a custom loss term to create a single model with multiple robustness guarantees and (2) the neural decision tree method, which trains multiple sub-classifiers with different robustness guarantees and combines them in a decision tree architecture. Our experiments on Fashion-MNIST and CIFAR10 demonstrate that by prioritizing the robustness between the most dissimilar groups, we improve clean performance by up to 9.63% and 30.89% respectively. Furthermore, on CIFAR100, our approach reduces the clean error rate by 26.32%.

06/12/2021

adversarial training, adversarially robust generalization, mixup, adversarial defense, adversarial examples, adversarial robustness, security

5:01

25/07/2020

Adaptive Verifiable Training Using Pairwise Class Similarity

Shiqi Wang, Kevin Eykholt, Taesung Lee, Jiyong Jang, Ian Molloy

Comments

Similar Papers

Network-to-Network Regularization: Enforcing Occam's Razor to Improve Generalization

Rohan Ghosh, Mehul Motani

Keywords Abstract Paper

theory, deep learning, machine learning

When Do Neural Networks Outperform Kernel Methods?

Behrooz Ghorbani, Song Mei, Theodor Misiakiewicz, Andrea Montanari

Keywords Abstract Paper

Regularizing Class-Wise Predictions via Self-Knowledge Distillation

Sukmin Yun, Jongjin Park, Kimin Lee, Jinwoo Shin

Keywords Abstract Paper

image classification, regularization, self-knowledge distillation, generalization, calibration

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and Michael Pfeiffer, Dan Zhang

Keywords Abstract Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

Learning from Noisy Labels with Complementary Loss Functions

Deng-Bao Wang, Yong Wen, Lujia Pan, Min-Ling Zhang

Keywords Abstract Paper

Examining and Combating Spurious Features under Distribution Shift

Chunting Zhou, Xuezhe Ma, Paul Michel, Graham Neubig

Keywords Abstract Paper

Deep Learning, Embedding and Representation learning

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Abstract Paper

Distributionally Robust Neural Networks

Shiori Sagawa*, Pang Wei Koh*, Tatsunori B. Hashimoto, Percy Liang

Keywords Abstract Paper

distributionally robust optimization, deep learning, robustness, generalization, regularization

Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets

Max Ryabinin, Andrey Malinin, Mark Gales

Keywords Abstract Paper

machine learning

Adversarial Filters of Dataset Biases

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Abstract Paper

Deep Learning - Algorithms

Adversarial Vertex Mixup: Toward Better Adversarially Robust Generalization

Saehyung Lee, Hyungyu Lee, Sungroh Yoon

Keywords Abstract Paper

adversarial training, adversarially robust generalization, mixup, adversarial defense, adversarial examples, adversarial robustness, security

Jointly non-sampling learning for knowledge graph enhanced recommendation

Chong Chen, Min Zhang, Weizhi Ma and Yiqun Liu, Shaoping Ma

Keywords Abstract Paper

recommender systems, non-sampling learning, knowledge graph, implicit feedback, efficient

Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels

Zhaowei Zhu, Yiwen Song, Yang Liu

Keywords Abstract Paper

Deep Learning

Adversarial Learning for Robust Deep Clustering

Xu Yang, Cheng Deng, Kun Wei and Junchi Yan, Wei Liu

Keywords Abstract Paper

Robust Unsupervised Learning via L-statistic Minimization

Andreas Maurer, Daniela Angela Parletta, Andrea Paudice, Massimiliano Pontil

Keywords Abstract Paper

Theory, Statistical Learning Theory

Towards Understanding the Dynamics of the First-Order Adversaries

Zhun Deng, Hangfeng He, Jiaoyang Huang, Weijie Su

Keywords Abstract Paper

Adversarial Examples

Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization

Hongxin Wei, Lei Feng, Xiangyu Chen, Bo An

Keywords Abstract Paper

agreement, noisy labels, co-regularization, weakly supervised learning, joint training, disagreement, contrastive loss

Boost Neural Networks by Checkpoints

Feng Wang, Guoyizhe Wei, Qiao Liu and Jinxiang Ou, xian wei, Hairong Lv

Keywords Abstract Paper

deep learning

Discrete-Valued Neural Communication

Dianbo Liu, Alex Lamb, Kenji Kawaguchi and Anirudh Goyal ALIAS PARTH GOYAL, Chen Sun, Michael Mozer, Yoshua Bengio

Keywords Abstract Paper

deep learning, robustness, transformers, generative model, graph learning

Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds

Yujia Huang, Huan Zhang, Yuanyuan Shi and J. Zico Kolter, Anima Anandkumar

Keywords Abstract Paper

deep learning, robustness, adversarial robustness and security

Keywords Paper

Keywords Paper

Keywords Paper

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Shiori Sagawa, Pang Wei Koh, Tatsunori B. Hashimoto, Percy Liang

Keywords Paper

Keywords Paper

Ronan Le Bras, Swabha Swayamdipta, Chandra Bhagavatula and
Rowan Zellers, Matthew Peters, Ashish Sabharwal, Yejin Choi

Keywords Paper

Keywords Paper

Chong Chen, Min Zhang, Weizhi Ma and
Yiqun Liu, Shaoping Ma

Keywords Paper

Keywords Paper

Xu Yang, Cheng Deng, Kun Wei and
Junchi Yan, Wei Liu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Feng Wang, Guoyizhe Wei, Qiao Liu and
Jinxiang Ou, xian wei, Hairong Lv

Keywords Paper

Dianbo Liu, Alex Lamb, Kenji Kawaguchi and
Anirudh Goyal ALIAS PARTH GOYAL, Chen Sun, Michael Mozer, Yoshua Bengio

Keywords Paper

Yujia Huang, Huan Zhang, Yuanyuan Shi and
J. Zico Kolter, Anima Anandkumar

Keywords Paper

Keywords Paper

Bo Liu, Xingchao Liu, Xiaojie Jin and
Peter Stone, Qiang Liu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xu Chu, Yang Lin, Xiting Wang and
Xin Gao, Qi Tong, Hailong Yu, Yasha Wang

Keywords Paper

Miao Zhang, Huiqi Li, Shirui Pan and
Xiaojun Chang, Steven Su

Keywords Paper

Keywords Paper

Ge Liu, Linglan Zhao, Wei Li and
Dashan Guo, Xiangzhong Fang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jongheon Jeong, Sejun Park, Minkyu Kim and
Heung-Chang Lee, Do-Guk Kim, Jinwoo Shin

Keywords Paper