Group masked autoencoder based density estimator for audio anomaly detection

02/11/2020

Group masked autoencoder based density estimator for audio anomaly detection

Ritwik Giri, Fangzhou Cheng, Karim Helwani, Srikanth V. Tenneti, Umut Isik, Arvindh Krishnaswamy

Keywords:

Abstract Paper Similar Papers

Abstract: In this paper, we address the problem of detecting previously unseen anomalous audio events, when the training dataset itself does not contain any examples of anomalies. While the traditional density estimation techniques, such as Gaussian Mixture Model (GMM) showed promise in past for the problem at hand, recent advances in neural density estimation techniques, have made them suitable for anomaly detection task. In this work, we develop a novel neural density estimation technique based on the Group-Masked Autoencoder, that estimates the density of an audio time series by taking into account the intra-frame statistics of the signal. Our proposed approach has been validated using the DCASE 2020 challenge dataset (Task 2 - <i>Unsupervised Detection of Anomalous Sounds for Machine Condition Monitoring</i>). We demonstrate the effectiveness of our approach by comparing against the baseline autoencoder model, and also against recently proposed Interpolating Deep Neural Network (IDNN) model.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at DCASE 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/11/2020

Self-supervised classification for detecting anomalous sounds

Ritwik Giri, Srikanth V. Tenneti, Fangzhou Cheng and
Karim Helwani, Umut Isik, Arvindh Krishnaswamy

Keywords Paper

0

0

0

0

13:28

03/05/2021

Neural Synthesis of Binaural Speech From Mono Audio

Alexander Richard, Dejan Markovic, Israel Gebru and
Steven Krenn, Gladstone A Butler, Fernando Torre, Yaser Sheikh

Keywords Paper

speech generation, speech processing, binaural speech, neural sound synthesis, sound spatialization, binaural audio

0

0

0

0

15:00

06/12/2021

Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport

Hsin-Yi Lin, Huan-Hsin Tseng, Xugang Lu, Yu Tsao

Keywords Paper

theory, machine learning, adversarial robustness and security, domain adaptation, optimal transport

0

0

0

0

14:40

02/11/2020

Anomalous sound detection as a simple binary classification problem with careful selection of proxy outlier examples

Paul Primus, Verena Haunschmid, Patrick Praher, Gerhard Widmer

Keywords Paper

0

0

0

0

15:23

26/04/2020

From Variational to Deterministic Autoencoders

Partha Ghosh, Mehdi S. M. Sajjadi, Antonio Vergari and
Michael Black, Bernhard Scholkopf

Keywords Paper

Unsupervised learning, Generative Models, Variational Autoencoders, Regularization

0

0

0

0

4:59

18/07/2021

Non-Negative Bregman Divergence Minimization for Deep Direct Density Ratio Estimation

Masahiro Kato, Takeshi Teshima

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

5:17

26/04/2020

High Fidelity Speech Synthesis with Adversarial Networks

Mikołaj Bińkowski, Jeff Donahue, Sander Dieleman and
Aidan Clark, Erich Elsen, Norman Casagrande, Luis C. Cobo, Karen Simonyan

Keywords Paper

texttospeech, speechsynthesis, audiosynthesis, gans, generativeadversarialnetworks, implicitgenerativemodels

0

0

0

0

15:07

03/05/2021

Multiscale Score Matching for Out-of-Distribution Detection

Ahsan Mahmood, Junier Oliva, Martin A Styner

Keywords Paper

out-of-distribution detection, deep learning, score matching, outlier detection

0

0

0

0

5:13

18/07/2021

Uniform Convergence, Adversarial Spheres and a Simple Remedy

Gregor Bachmann, Seyed Moosavi, Thomas Hofmann

Keywords Paper

Theory, Deep learning Theory

0

2

0

0

5:52

12/07/2020

Predictive Sampling with Forecasting Autoregressive Models

Auke Wiggers, Emiel Hoogeboom

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

12:23

02/02/2021

Denoising Distantly Supervised Named Entity Recognition via a Hypergeometric Probabilistic Model

Wenkai Zhang, Hongyu Lin, Xianpei Han and
Le Sun, Huidan Liu, Zhicheng Wei, Nicholas Yuan

Keywords Paper

0

0

0

0

19:22

26/04/2020

Meta Dropout: Learning to Perturb Latent Features for Generalization

Hae Beom Lee, Taewook Nam, Eunho Yang, Sung Ju Hwang

Keywords Paper

0

1

0

0

4:46

26/04/2020

On Generalization Error Bounds of Noisy Gradient Methods for Non-Convex Learning

Jian Li, Xuanyuan Luo, Mingda Qiao

Keywords Paper

learning theory, generalization, nonconvex learning, stochastic gradient descent, Langevin dynamics

0

0

0

0

4:50

18/07/2021

On the Inherent Regularization Effects of Noise Injection During Training

Oussama Dhifallah, Yue Lu

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:25

06/12/2021

A universal probabilistic spike count model reveals ongoing modulation of neural variability

David Liu, Mate Lengyel

Keywords Paper

generative model, kernel methods

0

0

0

0

15:06

06/12/2021

Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models

Yi Sui, Ga Wu, Scott Sanner

Keywords Paper

deep learning, optimization, machine learning, vision

0

0

0

0

10:29

18/07/2021

SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform

Yuhang He, Niki Trigoni, Andrew Markham

Keywords Paper

Applications, Audio and Speech Processing

0

0

0

0

4:34

14/06/2020

Watch Your Up-Convolution: CNN Based Generative Deep Neural Networks Are Failing to Reproduce Spectral Distributions

Ricard Durall, Margret Keuper, Janis Keuper

Keywords Paper

spectral regularization, gan, deepfake, up-convolution, generative models, frequency spectrum

0

0

0

0

1:00

03/08/2020

MaskAAE: Latent space optimization for Adversarial Auto-Encoders

Arnab Mondal, Sankalan Pal Chowdhury, Aravind Jayendran and
Himanshu Asnani, Parag Singla, Prathosh A P

Keywords Paper

0

0

0

0

7:54

30/11/2020

GAN-based Noise Model for Denoising Real Images

Linh Duy Tran, Son Minh Nguyen, Masayuki Arai

Keywords Paper

0

0

0

0

7:13

06/12/2021

Scalable Bayesian GPFA with automatic relevance determination and discrete noise models

Kristopher Jensen, Ta-Chu Kao, Jasmine Stone, Guillaume Hennequin

Keywords Paper

optimization, neuroscience, generative model, kernel methods

0

0

0

0

7:23

06/12/2020

Self-Adaptive Training: beyond Empirical Risk Minimization

Lang Huang, Chao Zhang, Hongyang Zhang

Keywords Paper

Deep Learning -> Generative Models, Algorithms -> Semi-Supervised Learning

0

0

0

0

3:23

02/11/2020

Sound event localization and detection based on CRNN using rectangular filters and channel rotation data augmentation

Francesca Ronchini, Daniel Arteaga, Andrés Pérez-López

Keywords Paper

0

0

0

0

12:51

06/12/2020

Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness

Long Zhao, Ting Liu, Xi Peng, Dimitris Metaxas

Keywords Paper

0

0

0

0

3:22

04/07/2020

Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation

Bei Li, Hui Liu, Ziyang Wang and
Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li

Keywords Paper

Context-Aware Translation, document-level translation, document-level NMT, document-level

0

0

0

0

6:42

06/12/2020

Factorized Neural Processes for Neural Processes: K-Shot Prediction of Neural Responses

Ronald (James) Cotton, Fabian Sinz, Andreas Tolias

Keywords Paper

0

0

0

0

3:18

14/09/2020

MMCNN: A Multi-branch Multi-scale Convolutional Neural Network for Motor Imagery Classification

Ziyu Jia, Youfang Lin, Jing Wang and
Kaixin Yang, Tianhang Liu, Xinwang Zhang

Keywords Paper

motor imagery, convolutional neural network, eeg signal, brain–computer interface

0

0

0

0

12:20

06/12/2021

Learning to Generate Visual Questions with Noisy Supervision

Shen Kai, Lingfei Wu, Siliang Tang and
Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Paper

generative model

0

0

0

0

14:54

03/05/2021

Noise against noise: stochastic label noise helps combat inherent label noise

Pengfei Chen, Guangyong Chen, Junjie Ye and
jingwei zhao, Pheng-Ann Heng

Keywords Paper

Regularization, SGD noise, Robust Learning, Noisy Labels

0

0

0

0

9:42

06/12/2021

On the Frequency Bias of Generative Models

Katja Schwarz, Yiyi Liao, Andreas Geiger

Keywords Paper

generative model

0

0

0

0

11:09

30/11/2020

Do We Need Sound for Sound Source Localization?

Takashi Oya, Shohei Iwase, Ryota Natsume and
Takahiro Itazuri, Shugo Yamaguchi, Shigeo Morishima

Keywords Paper

0

0

0

0

8:43

06/12/2021

NORESQA: A Framework for Speech Quality Assessment using Non-Matching References

Pranay Manocha, Buye Xu, Anurag Kumar

Keywords Paper

deep learning, robustness, self-supervised learning

0

0

0

0

14:30

02/02/2021

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Tejas Gokhale, Rushil Anirudh, Bhavya Kailkhura and
Jayaraman J. Thiagarajan, Chitta Baral, Yezhou Yang

Keywords Paper

0

0

0

0

19:57

06/12/2021

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

Kanghyun Choi, Deokki Hong, Noseong Park and
Youngsok Kim, Jinho Lee

Keywords Paper

deep learning, privacy

0

0

0

0

11:56

14/06/2020

Noise Robust Generative Adversarial Networks

Takuhiro Kaneko, Tatsuya Harada

Keywords Paper

generative adversarial networks (gans), image synthesis, noise robust models, image denoising, deep generative models, adversarial training, reparameterization trick, transformation constraint, image restoration, weakly supervised learning

0

0

0

0

1:01

19/08/2021

Rethink the Connections among Generalization, Memorization, and the Spectral Bias of DNNs

Xiao Zhang, Haoyi Xiong, Dongrui Wu

Keywords Paper

Machine Learning, Deep Learning, Learning Theory

0

0

0

0

7:22

02/11/2020

Domain-adversarial training and trainable parallel front-end for the DCASE 2020 task 4 sound event detection challenge

Samuele Cornell, Michel Olvera, Manuel Pariente and
Giovanni Pepe, Emanuele Principi, Leonardo Gabrielli, Stefano Squartini

Keywords Paper

0

0

0

0

9:59

02/11/2020

Searching for efficient network architectures for acoustic scene classification

Yuzhong Wu, Tan Lee

Keywords Paper

0

0

0

0

14:37

06/12/2021

Explaining heterogeneity in medial entorhinal cortex with task-driven neural networks

Aran Nayebi, Alexander Attinger, Malcolm Campbell and
Kiah Hardcastle, Isabel Low, Caitlin S Mallory, Gabriel Mel, Ben Sorscher, Alex H Williams, Surya Ganguli, Lisa Giocomo, Dan Yamins

Keywords Paper

deep learning

0

0

0

0

14:10

26/04/2020

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech

David Harwath, Wei-Ning Hsu, James Glass

Keywords Paper

visually-grounded speech, self-supervised learning, discrete representation learning, vision and language, vision and speech, hierarchical representation learning

0

0

0

0

13:42