Detecting Adversarial Samples Using Influence Functions and Nearest Neighbors

14/06/2020

Detecting Adversarial Samples Using Influence Functions and Nearest Neighbors

Gilad Cohen, Guillermo Sapiro, Raja Giryes

Keywords: adversarial detection, k-nearest neighbors, influence functions

Abstract Paper Similar Papers

Abstract: Deep neural networks (DNNs) are notorious for their vulnerability to adversarial attacks, which are small perturbations added to their input images to mislead their prediction. Detection of adversarial examples is, therefore, a fundamental requirement for robust classification frameworks. In this work, we present a method for detecting such adversarial attacks, which is suitable for any pre-trained neural network classifier. We use influence functions to measure the impact of every training sample on the validation set data. From the influence scores, we find the most supportive training samples for any given validation example. A k-nearest neighbor (k-NN) model fitted on the DNN's activation layers is employed to search for the ranking of these supporting training samples. We observe that these samples are highly correlated with the nearest neighbors of the normal inputs, while this correlation is much weaker for adversarial inputs. We train an adversarial detector using the k-NN ranks and distances and show that it successfully distinguishes adversarial examples, getting state-of-the-art results on six attack methods with three datasets. Code is available at https://github.com/giladcohen/NNIF_adv_defense.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Improving Adversarial Robustness Requires Revisiting Misclassified Examples

Yisen Wang, Difan Zou, Jinfeng Yi and
James Bailey, Xingjun Ma, Quanquan Gu

Keywords Paper

Robustness, Adversarial Defense, Adversarial Training

0

0

0

0

5:02

26/04/2020

GAT: Generative Adversarial Training for Adversarial Example Detection and Classification

Xuwang Yin, Soheil Kolouri, Gustavo K Rohde

Keywords Paper

adversarial example detection, adversarial examples classification, robust optimization, ML security, generative modeling, generative classification

0

0

0

0

5:14

13/04/2021

Quantifying the privacy risks of learning high-dimensional graphical models

Sasi Kumar Murakonda, Reza Shokri, George Theodorakopoulos

Keywords Paper

0

0

0

0

3:15

06/12/2021

Adversarial Feature Desensitization

Pouya Bashivan, Reza Bayat, Adam Ibrahim and
Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Richards, Irina Rish

Keywords Paper

deep learning, robustness, adversarial robustness and security, domain adaptation

0

0

0

0

13:27

18/07/2021

RNNRepair: Automatic RNN Repair via Model-based Analysis

Xiaofei Xie, Wenbo Guo, Lei Ma and
Wei Le, Jian Wang, Lingjun Zhou, Yang Liu, Xinyu Xing

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:21

14/06/2020

A Self-supervised Approach for Adversarial Robustness

Muzammal Naseer, Salman Khan, Munawar Hayat and
Fahad Shahbaz Khan, Fatih Porikli

Keywords Paper

self-supervision, defense, attack, perceptual-features, classification, segmentation, object-detection, adversarial-learning, dynamic-defense, zero-shot

0

0

0

0

5:01

12/07/2020

Adversarial Neural Pruning with Latent Vulnerability Suppression

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Paper

Adversarial Examples

0

0

0

0

14:43

14/06/2020

One Man’s Trash Is Another Man’s Treasure: Resisting Adversarial Examples by Adversarial Examples

Chang Xiao, Changxi Zheng

Keywords Paper

adversarial defense, adversarial examples, adversarial learning, deep learning

0

0

0

0

1:01

03/05/2021

Protecting DNNs from Theft using an Ensemble of Diverse Models

Sanjay Kariyappa, Atul Prakash, Moinuddin K Qureshi

Keywords Paper

machine learning security, Model stealing

0

0

0

0

5:05

06/12/2021

Anti-Backdoor Learning: Training Clean Models on Poisoned Data

Yige Li, Xixiang Lyu, Nodens Koren and
Lingjuan Lyu, Bo Li, Xingjun Ma

Keywords Paper

deep learning

0

0

0

0

7:58

06/12/2021

Network-to-Network Regularization: Enforcing Occam's Razor to Improve Generalization

Rohan Ghosh, Mehul Motani

Keywords Paper

theory, deep learning, machine learning

0

0

0

0

14:07

18/07/2021

Double-Win Quant: Aggressively Winning Robustness of Quantized Deep Neural Networks via Random Precision Training and Inference

Yonggan Fu, Qixuan Yu, Meng Li and
Vikas Chandra, Yingyan Lin

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:20

06/12/2021

Excess Capacity and Backdoor Poisoning

Naren Manoj, Avrim Blum

Keywords Paper

theory, machine learning, robustness, adversarial robustness and security

0

0

0

0

10:38

06/12/2021

Qu-ANTI-zation: Exploiting Quantization Artifacts for Achieving Adversarial Outcomes

Sanghyun Hong, Michael-Andrei Panaitescu-Liess, Yigitcan Kaya, Tudor Dumitras

Keywords Paper

deep learning, adversarial robustness and security, federated learning

1

0

0

0

12:20

02/02/2021

Error-Correcting Output Codes with Ensemble Diversity for Robust Learning in Neural Networks

Yang Song, Qiyu Kang, Wee Peng Tay

Keywords Paper

0

0

0

0

15:04

04/07/2020

End-to-End Bias Mitigation by Modelling Biases in Corpora

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

Keywords Paper

End-to-End Mitigation, real-world scenarios, training, large-scale benchmarks

0

0

0

0

10:57

06/12/2021

Robust Deep Reinforcement Learning through Adversarial Loss

Tuomas Oikarinen, Wang Zhang, Alexandre Megretski and
Luca Daniel, Tsui-Wei Weng

Keywords Paper

reinforcement learning and planning, robustness, adversarial robustness and security

0

0

0

0

14:15

06/12/2021

Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks

Yonggan Fu, Qixuan Yu, Yang Zhang and
Shang Wu, Xu Ouyang, David Cox, Yingyan Lin

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

12:48

05/01/2021

Defense-Friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty

Camilo Pestana, Wei Liu, David Glance, Ajmal Mian

Keywords Paper

0

0

0

0

4:56

19/04/2021

Evaluating neural model robustness for machine comprehension

Winston Wu, Dustin Arendt, Svitlana Volkova

Keywords Paper

0

0

0

0

11:41

14/06/2020

Old Is Gold: Redefining the Adversarially Learned One-Class Classifier Training Paradigm

Muhammad Zaigham Zaheer, Jin-Ha Lee, Marcella Astrid, Seung-Ik Lee

Keywords Paper

anomaly detection, adversarial learning, one-class classification, autoencoder, novelty detection, outlier detection, semi supervised learning, ucsd pedestrian2, mnist, caltech -256

0

0

0

0

1:01

02/02/2021

Cascade Network with Guided Loss and Hybrid Attention for Finding Good Correspondences

Zhi Chen, Fan Yang, Wenbing Tao

Keywords Paper

0

0

0

0

17:32

06/12/2021

Backdoor Attack with Imperceptible Input and Latent Modification

Khoa D Doan, Yingjie Lao, Ping Li

Keywords Paper

deep learning, optimization, adversarial robustness and security, generative model

0

0

0

0

12:27

18/07/2021

Towards Better Robust Generalization with Shift Consistency Regularization

Shufei Zhang, Zhuang Qian, Kaizhu Huang and
Qiufeng Wang, Rui Zhang, Xinping Yi

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:44

06/12/2020

Learning from Failure: De-biasing Classifier from Biased Classifier

Junhyun Nam, Hyuntak Cha, Sungsoo Ahn and
Jaeho Lee, Jinwoo Shin

Keywords Paper

0

0

0

0

3:21

14/06/2020

Benchmarking Adversarial Robustness on Image Classification

Yinpeng Dong, Qi-An Fu, Xiao Yang and
Tianyu Pang, Hang Su, Zihao Xiao, Jun Zhu

Keywords Paper

adversarial robustness, benchmark, evaluation, security, attack, defense, image classification

0

0

0

0

4:59

18/07/2021

Instabilities of Offline RL with Pre-Trained Neural Representation

Ruosong Wang, Yifan Wu, Russ Salakhutdinov, Sham Kakade

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:19

12/07/2020

Alleviating Privacy Attacks via Causal Learning

Shruti Tople, Amit Sharma, Aditya Nori

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

14:45

02/02/2021

Improving Sample Efficiency in Model-Free Reinforcement Learning from Images

Denis Yarats, Amy Zhang, Ilya Kostrikov and
Brandon Amos, Joelle Pineau, Rob Fergus

Keywords Paper

0

0

0

0

12:19

06/12/2020

Posterior Re-calibration for Imbalanced Datasets

Junjiao Tian, Yen-Cheng Liu, Nathaniel Glaser and
Yen-Chang Hsu, Zsolt Kira

Keywords Paper

Algorithms -> Few-Shot Learning, Applications -> Computer Vision

0

0

0

0

3:23

03/05/2021

How Benign is Benign Overfitting ?

Amartya Sanyal, Puneet Dokania, Varun Kanade, Philip Torr

Keywords Paper

generalization, memorization, benign overfitting, adversarial robustness

0

0

0

0

10:56

26/04/2020

Nesterov Accelerated Gradient and Scale Invariance for Adversarial Attacks

Jiadong Lin, Chuanbiao Song, Kun He and
Liwei Wang, John E. Hopcroft

Keywords Paper

adversarial examples, adversarial attack, transferability, Nesterov accelerated gradient, scale invariance

0

0

0

0

3:59

18/07/2021

Learning to Generate Noise for Multi-Attack Robustness

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Paper

Applications, Privacy, Anonymity, and Security, Probabilistic Methods, MCMC, Algorithms, Adversarial Examples

0

0

0

0

5:12

06/12/2020

Hold me tight! Influence of discriminative features on deep network boundaries

Guillermo Ortiz-Jimenez, Apostolos Modas, Seyed Moosavi-Dezfooli, Pascal Frossard

Keywords Paper

0

1

0

0

3:18

26/04/2020

Defending Against Physically Realizable Attacks on Image Classification

Tong Wu, Liang Tong, Yevgeniy Vorobeychik

Keywords Paper

defense against physical attacks, adversarial machine learning

0

0

0

0

4:57

04/07/2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization

Yuan Zang, Fanchao Qi, Chenghao Yang and
Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Paper

Textual attacking, Word-level attacking, combinatorial problem, Word-level Attacking

0

0

0

0

9:34

14/06/2020

Learn2Perturb: An End-to-End Feature Perturbation Learning to Improve Adversarial Robustness

Ahmadreza Jeddi, Mohammad Javad Shafiee, Michelle Karg and
Christian Scharfenberger, Alexander Wong

Keywords Paper

adversarial robustness, network randomization, alternative back-propagation, trainable noise, adversarial training

0

0

0

0

1:01

08/12/2020

Enhancing Neural Models with Vulnerability via Adversarial Attack

Rong Zhang, Qifei Zhou, Bo An and
Weiping Li, Tong Mo, Bo Wu

Keywords Paper

0

0

0

0

14:47

14/06/2020

A Programmatic and Semantic Approach to Explaining and Debugging Neural Network Based Object Detectors

Edward Kim, Divya Gopinath, Corina Păsăreanu, Sanjit A. Seshia

Keywords Paper

population-level explanation, testing, perception, neural network, blackbox, scenario, object detection, machine learning, autonomous driving

0

0

0

0

4:58

02/02/2021

Membership Privacy for Machine Learning Models Through Knowledge Transfer

Virat Shejwalkar, Amir Houmansadr

Keywords Paper

0

0

0

0

18:01