Understanding the Effect of Bias in Deep Anomaly Detection

Abstract: Anomaly detection presents a unique challenge in machine learning, due to the scarcity of labeled anomaly data. Recent work attempts to mitigate such problems by augmenting training of deep anomaly detection models with additional labeled anomaly samples. However, the labeled data often does not align with the target distribution and introduces harmful bias to the trained model. In this paper, we aim to understand the effect of a biased anomaly set on anomaly detection. Concretely, we view anomaly detection as a supervised learning task where the objective is to optimize the recall at a given false positive rate. We formally study the relative scoring bias of an anomaly detector, defined as the difference in performance with respect to a baseline anomaly detector. We establish the first finite sample rates for estimating the relative scoring bias for deep anomaly detection, and empirically validate our theoretical results on both synthetic and real-world datasets. We also provide an extensive empirical study on how a biased training anomaly set affects the anomaly score function and therefore the detection performance on different anomaly classes. Our study demonstrates scenarios in which the biased anomaly set can be useful or problematic, and provides a solid benchmark for future research.

18/07/2021

anomaly detection, adversarial learning, one-class classification, autoencoder, novelty detection, outlier detection, semi supervised learning, ucsd pedestrian2, mnist, caltech -256

1:01

18/07/2021

Understanding the Effect of Bias in Deep Anomaly Detection

Ziyu Ye, Yuxin Chen, Haitao Zheng

Comments

Similar Papers

RATT: Leveraging Unlabeled Data to Guarantee Generalization

Saurabh Garg, Sivaraman Balakrishnan, Zico Kolter, Zachary Lipton

Keywords Abstract Paper

Probabilistic Methods, Graphical Models, Theory, Computational Complexity, Theory, Models of Learning and Generalization

Good classifiers are abundant in the interpolating regime

Ryan Theisen, Jason Klusowski, Michael Mahoney

Keywords Abstract Paper

Understanding the failure modes of out-of-distribution generalization

Vaishnavh Nagarajan, Anders J Andreassen, Behnam Neyshabur

Keywords Abstract Paper

theoretical study, spurious correlations, out-of-distribution generalization, empirical risk minimization

Cross-Domain Few-Shot Classification via Adversarial Task Augmentation

Haoqing Wang, Zhi-Hong Deng

Keywords Abstract Paper

Computer Vision, Recognition, Adversarial Machine Learning, Deep Learning

STEP: Out-of-Distribution Detection in the Presence of Limited In-Distribution Labeled Data

Zhi Zhou, Lan-Zhe Guo, Zhanzhan Cheng and Yu-Feng Li, Shiliang Pu

Keywords Abstract Paper

optimization, semi-supervised learning

Unbiased Classification through Bias-Contrastive and Bias-Balanced Learning

Youngkyu Hong, Eunho Yang

Keywords Abstract Paper

machine learning, contrastive learning, fairness

DaST: Data-Free Substitute Training for Adversarial Attacks

Mingyi Zhou, Jing Wu, Yipeng Liu and Shuaicheng Liu, Ce Zhu

Keywords Abstract Paper

adversarial attacks, machine learning, generative adversarial networks, computer vision

VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain

Jinsung Yoon, Yao Zhang, James Jordon, Mihaela van der Schaar

Keywords Abstract Paper

Representation Learning With Statistical Independence to Mitigate Bias

Ehsan Adeli, Qingyu Zhao, Adolf Pfefferbaum and Edith V. Sullivan, Li Fei-Fei, Juan Carlos Niebles, Kilian M. Pohl

Keywords Abstract Paper

Fair Generative Modeling via Weak Supervision

Kristy Choi, Aditya Grover, Trisha Singh and Rui Shu, Stefano Ermon

Keywords Abstract Paper

Deep Learning - Generative Models and Autoencoders

RCA: A Deep Collaborative Autoencoder Approach for Anomaly Detection

Boyang Liu, Ding Wang, Kaixiang Lin and Pang-Ning Tan, Jiayu Zhou

Keywords Abstract Paper

Data Mining, Anomaly/Outlier Detection, Unsupervised Learning

What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation

Vitaly Feldman, Chiyuan Zhang

Keywords Abstract Paper

Learning Semantic Context from Normal Samples for Unsupervised Anomaly Detection

Xudong Yan, Huaidong Zhang, Xuemiao Xu and Xiaowei Hu, Pheng-Ann Heng

Keywords Abstract Paper

Old Is Gold: Redefining the Adversarially Learned One-Class Classifier Training Paradigm

Muhammad Zaigham Zaheer, Jin-Ha Lee, Marcella Astrid, Seung-Ik Lee

Keywords Abstract Paper

anomaly detection, adversarial learning, one-class classification, autoencoder, novelty detection, outlier detection, semi supervised learning, ucsd pedestrian2, mnist, caltech -256

RNNRepair: Automatic RNN Repair via Model-based Analysis

Xiaofei Xie, Wenbo Guo, Lei Ma and Wei Le, Jian Wang, Lingjun Zhou, Yang Liu, Xinyu Xing

Keywords Abstract Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

Error-Bounded Correction of Noisy Labels

Songzhu Zheng, Pengxiang Wu, Aman Goswami and Mayank Goswami, Dimitris Metaxas, Chao Chen

Keywords Abstract Paper

Deep Learning - Algorithms

Identifying and Correcting Label Bias in Machine Learning

Heinrich Jiang, Ofir Nachum

Keywords Abstract Paper

On Recovering from Modeling Errors Using Testing Bayesian Networks

Haiying Huang, Adnan Darwiche

Keywords Abstract Paper

Probabilistic Methods, Graphical Models

Does learning require memorization? A short tale about a long tail

Vitaly Feldman

Keywords Abstract Paper

Long-tailed Distribution, Privacy-preserving Learning, Interpolation, Overfitting, Generalization

Few-Shot Learning via Feature Hallucination With Variational Inference

Qinxuan Luo, Lingfeng Wang, Jingguo Lv and Shiming Xiang, Chunhong Pan

Keywords Abstract Paper

G2D: Generate to Detect Anomaly

Masoud Pourreza, Bahram Mohammadi, Mostafa Khaki and Samir Bouindour, Hichem Snoussi, Mohammad Sabokrou

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zhi Zhou, Lan-Zhe Guo, Zhanzhan Cheng and
Yu-Feng Li, Shiliang Pu

Keywords Paper

Keywords Paper

Mingyi Zhou, Jing Wu, Yipeng Liu and
Shuaicheng Liu, Ce Zhu

Keywords Paper

Keywords Paper

Ehsan Adeli, Qingyu Zhao, Adolf Pfefferbaum and
Edith V. Sullivan, Li Fei-Fei, Juan Carlos Niebles, Kilian M. Pohl

Keywords Paper

Kristy Choi, Aditya Grover, Trisha Singh and
Rui Shu, Stefano Ermon

Keywords Paper

Boyang Liu, Ding Wang, Kaixiang Lin and
Pang-Ning Tan, Jiayu Zhou

Keywords Paper

Keywords Paper

Xudong Yan, Huaidong Zhang, Xuemiao Xu and
Xiaowei Hu, Pheng-Ann Heng

Keywords Paper

Keywords Paper

Xiaofei Xie, Wenbo Guo, Lei Ma and
Wei Le, Jian Wang, Lingjun Zhou, Yang Liu, Xinyu Xing

Keywords Paper

Songzhu Zheng, Pengxiang Wu, Aman Goswami and
Mayank Goswami, Dimitris Metaxas, Chao Chen

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Qinxuan Luo, Lingfeng Wang, Jingguo Lv and
Shiming Xiang, Chunhong Pan

Keywords Paper

Masoud Pourreza, Bahram Mohammadi, Mostafa Khaki and
Samir Bouindour, Hichem Snoussi, Mohammad Sabokrou

Keywords Paper

Keywords Paper

Keywords Paper

Zhen Fang, Jie Lu, Anjin Liu and
Feng Liu, Guangquan Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI and
Toniann Pitassi, Richard Zemel

Keywords Paper

Keywords Paper

Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis and
Ian Reid, Gustavo Carneiro

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper