Deep Feature Space Trojan Attack of Neural Networks by Controlled Detoxification

Abstract: Trojan (backdoor) attack is a form of adversarial attack on deep neural networks where the attacker provides victims with a model trained/retrained on malicious data. The backdoor can be activated when a normal input is stamped with a certain pattern called trigger, causing misclassification. Many existing trojan attacks have their triggers being input space patches/objects (e.g., a polygon with solid color) or simple input transformations such as Instagram filters. These simple triggers are susceptible to recent backdoor detection algorithms. We propose a novel deep feature space trojan attack with five characteristics: effectiveness, stealthiness, controllability, robustness and reliance on deep features. We conduct extensive experiments on 9 image classifiers on various datasets including ImageNet to demonstrate these properties and show that our attack can evade state-of-the-art defense.

14/06/2020

Yang Xiao, Bihuan Chen, Chendong Yu and
Zhengzi Xu, Zimu Yuan, Feng Li, Binghong Liu, Yang Liu, Wei Huo, Wei Zou, Wenchang Shi

Probabilistic Methods, Variational Inference, Algorithms, Boosting and Ensemble Methods; Probabilistic Methods; Probabilistic Methods, Bayesian Theory, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

4:57

04/07/2020

Multidisciplinary Topics and Applications, Security and Privacy, Classification, Mining Graphs, Semi Structured Data, Complex Data

13:28

06/12/2021

Deep Feature Space Trojan Attack of Neural Networks by Controlled Detoxification

Siyuan Cheng, Yingqi Liu, Shiqing Ma, Xiangyu Zhang

Comments

Similar Papers

TBT: Targeted Neural Network Attack With Bit Trojan

Adnan Siraj Rakin, Zhezhi He, Deliang Fan

Keywords Abstract Paper

trojan, targeted weight attack, bit-flip, row-hammer, security of dnn

Topological Detection of Trojaned Neural Networks

Songzhu Zheng, Yikai Zhang, Hubert Wagner and Mayank Goswami, Chao Chen

Keywords Abstract Paper

deep learning

Multi-Target Invisibly Trojaned Networks for Visual Recognition and Detection

Xinzhe Zhou, Wenhao Jiang, Sheng Qi, Yadong Mu

Keywords Abstract Paper

Machine Learning, Adversarial Machine Learning

Scalable Backdoor Detection in Neural Networks

Haripriya Harikumar, Vuong Le, Santu Rana and Sourangshu Bhattacharya, Sunil Gupta, Svetha Venkatesh

Keywords Abstract Paper

trojan attack, backdoor detection, deep learning model, optimisation

Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits

Jiawang Bai, Baoyuan Wu, Yong Zhang and Yiming Li, Zhifeng Li, Shu-Tao Xia

Keywords Abstract Paper

weight attack, bit-flip, targeted attack

Deep Neural Network Fingerprinting by Conferrable Adversarial Examples

Nils Lukas, Yuxuan Zhang, Florian Kerschbaum

Keywords Abstract Paper

Adversarial Examples, Conferrability, Transferability, Fingerprinting

Backdoor Attack with Imperceptible Input and Latent Modification

Khoa D Doan, Yingjie Lao, Ping Li

Keywords Abstract Paper

deep learning, optimization, adversarial robustness and security, generative model

Private Image Reconstruction from System Side Channels Using Generative Models

Yuanyuan Yuan, Shuai Wang, Junping Zhang

Keywords Abstract Paper

side channel analysis

A Unified Multi-Scenario Attacking Network for Visual Object Tracking

Xuesong Chen, Canmiao Fu, Feng Zheng and Yong Zhao, Hongsheng Li, Ping Luo, Guo-Jun Qi

Keywords Abstract Paper

Adversarial Example Games

Joey Bose, Gauthier Gidel, Hugo Berard and Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, Will Hamilton

Keywords Abstract Paper

DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit Flips

Fan Yao, Adnan Siraj Rakin, Deliang Fan

Keywords Abstract Paper

Variational Model Inversion Attacks

Kuan-Chieh Wang, YAN FU, Ke Li and Ashish Khisti, Richard Zemel, Alireza Makhzani

Keywords Abstract Paper

deep learning, generative model

Input-Aware Dynamic Backdoor Attack

Tuan Anh Nguyen, Anh Tran

Keywords Abstract Paper

Enhancing Cross-Task Black-Box Transferability of Adversarial Examples With Dispersion Reduction

Yantao Lu, Yunhan Jia, Jianyu Wang and Bai Li, Weiheng Chai, Lawrence Carin, Senem Velipasalar

Keywords Abstract Paper

adversarial example, black-box attack, cross tasks, transferability, deep neural network

Defending Against Model Stealing Attacks With Adaptive Misinformation

Sanjay Kariyappa, Moinuddin K. Qureshi

Keywords Abstract Paper

model stealing, security, machine learning, deep learning

Practical No-box Adversarial Attacks against DNNs

Qizhang Li, Yiwen Guo, Hao Chen

Keywords Abstract Paper

Multi-Teacher Single-Student Visual Transformer with Multi-Level Attention for Face Spoofing Detection

Yao-Hui Huang, Jun-Wei Hsieh, Ming-Ching Chang and Lipeng Ke, Siwei Lyu, Arpita Samanta Santra

Keywords Abstract Paper

Image Processcing, liveness detection, Face Anti-Spoofing, presentation attacks_x0000_

AdvFlow: Inconspicuous Black-box Adversarial Attacks using Normalizing Flows

Hadi Mohaghegh Dolatabadi, Sarah Erfani, Christopher Leckie

Keywords Abstract Paper

LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks

Hang Zhou, Dongdong Chen, Jing Liao and Kejiang Chen, Xiaoyi Dong, Kunlin Liu, Weiming Zhang, Gang Hua, Nenghai Yu

Keywords Abstract Paper

adversarial attack, point cloud recognition, generative models

MVP: Detecting Vulnerabilities using Patch-Enhanced Vulnerability Signatures

Yang Xiao, Bihuan Chen, Chendong Yu and Zhengzi Xu, Zimu Yuan, Feng Li, Binghong Liu, Yang Liu, Wei Huo, Wei Zou, Wenchang Shi

Keywords Abstract Paper

KOOBE: Towards Facilitating Exploit Generation of Kernel Out-Of-Bounds Write Vulnerabilities

Weiteng Chen, Xiaochen Zou, Guoren Li, Zhiyun Qian

Keywords Abstract Paper

Keywords Paper

Songzhu Zheng, Yikai Zhang, Hubert Wagner and
Mayank Goswami, Chao Chen

Keywords Paper

Keywords Paper

Haripriya Harikumar, Vuong Le, Santu Rana and
Sourangshu Bhattacharya, Sunil Gupta, Svetha Venkatesh

Keywords Paper

Jiawang Bai, Baoyuan Wu, Yong Zhang and
Yiming Li, Zhifeng Li, Shu-Tao Xia

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xuesong Chen, Canmiao Fu, Feng Zheng and
Yong Zhao, Hongsheng Li, Ping Luo, Guo-Jun Qi

Keywords Paper

Joey Bose, Gauthier Gidel, Hugo Berard and
Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, Will Hamilton

Keywords Paper

Keywords Paper

Kuan-Chieh Wang, YAN FU, Ke Li and
Ashish Khisti, Richard Zemel, Alireza Makhzani

Keywords Paper

Keywords Paper

Yantao Lu, Yunhan Jia, Jianyu Wang and
Bai Li, Weiheng Chai, Lawrence Carin, Senem Velipasalar

Keywords Paper

Keywords Paper

Keywords Paper

Yao-Hui Huang, Jun-Wei Hsieh, Ming-Ching Chang and
Lipeng Ke, Siwei Lyu, Arpita Samanta Santra

Keywords Paper

Keywords Paper

Hang Zhou, Dongdong Chen, Jing Liao and
Kejiang Chen, Xiaoyi Dong, Kunlin Liu, Weiming Zhang, Gang Hua, Nenghai Yu

Keywords Paper

Yang Xiao, Bihuan Chen, Chendong Yu and
Zhengzi Xu, Zimu Yuan, Feng Li, Binghong Liu, Yang Liu, Wei Huo, Wei Zou, Wenchang Shi

Keywords Paper

Keywords Paper

Keywords Paper

Bangjie Yin, Wenxuan Wang, Taiping Yao and
Junfeng Guo, Zelun Kong, Shouhong Ding, Jilin Li, Cong Liu

Keywords Paper

Keywords Paper

Keywords Paper

Pouya Bashivan, Reza Bayat, Adam Ibrahim and
Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Richards, Irina Rish

Keywords Paper

Keywords Paper

Guangyu Shen, Yingqi Liu, Guanhong Tao and
Shengwei An, Qiuling Xu, Siyuan Cheng, Shiqing Ma, Xiangyu Zhang

Keywords Paper

Keywords Paper

Weibin Wu, Yuxin Su, Xixian Chen and
Shenglin Zhao, Irwin King, Michael R. Lyu, Yu-Wing Tai

Keywords Paper

Keywords Paper

Lifeng Huang, Chengying Gao, Yuyin Zhou and
Cihang Xie, Alan L. Yuille, Changqing Zou, Ning Liu

Keywords Paper

Keywords Paper

Keywords Paper

Yuan Zang, Fanchao Qi, Chenghao Yang and
Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Paper

Keywords Paper