Variational Autoencoders for Sparse and Overdispersed Discrete Data

26/08/2020

Variational Autoencoders for Sparse and Overdispersed Discrete Data

He Zhao, Piyush Rai, Lan Du, Wray Buntine, Dinh Phung, Mingyuan Zhou

Keywords:

Abstract Paper Similar Papers

Abstract: Many applications, such as text modelling, high-throughput sequencing, and recommender systems, require analysing sparse, high-dimensional, and overdispersed discrete (count or binary) data. Recent deep probabilistic models based on variational autoencoders (VAE) have shown promising results on discrete data but may have inferior modelling performance due to the insufficient capability in modelling overdispersion and model misspecification. To address these issues, we develop a VAE-based framework using the negative binomial distribution as the data distribution. We also provide an analysis of its properties vis-\`{a}-vis other models. We conduct extensive experiments on three problems from discrete data analysis: text analysis/topic modelling, collaborative filtering, and multi-label learning. Our models outperform state-of-the-art approaches on these problems, while also capturing the phenomenon of overdispersion more effectively.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Neural Mixed Counting Models for Dispersed Topic Discovery

Jiemin Wu, Yanghui Rao, Zusheng Zhang and
Haoran Xie, Qing Li, Fu Lee Wang, Ziye Chen

Keywords Paper

Dispersed Discovery, mining topics, Neural Models, Mixed models

0

0

0

0

10:29

14/06/2020

Deep Generative Model for Robust Imbalance Classification

Xinyue Wang, Yilin Lyu, Liping Jing

Keywords Paper

imbalance classification, deep generative classifier, generative modelrobust classification

0

0

0

0

1:01

18/07/2021

Post-selection inference with HSIC-Lasso

Tobias Freidling, Benjamin Poignard, Héctor Climente-González, Makoto Yamada

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:03

30/11/2020

MLIFeat: Multi-level information fusion based deep local features

Yuyang Zhang Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences and
Jinge Wang, Shibiao Xu, Xiao Liu, Xiaopeng Zhang

Keywords Paper

0

0

0

0

5:28

26/04/2020

Short and Sparse Deconvolution --- A Geometric Approach

Yenson Lau, Qing Qu, Han-Wen Kuo and
Pengcheng Zhou, Yuqian Zhang, John Wright

Keywords Paper

0

0

0

0

7:18

18/07/2021

On Linear Identifiability of Learned Representations

Geoffrey Roeder, Luke Metz, Durk Kingma

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:11

04/07/2020

Perturbation Based Learning for Structured NLP tasks with Application to Dependency Parsing

Amichay Doitch, Ram Yazdi, Tamir Hazan, Roi Reichart

Keywords Paper

Structured tasks, Dependency Parsing, NLP, sampling

0

0

0

0

10:53

02/02/2021

Unsupervised Active Learning via Subspace Learning

Changsheng Li, Kaihang Mao, Lingyan Liang and
Dongchun Ren, Wei Zhang, Ye Yuan, Guoren Wang

Keywords Paper

0

0

0

0

16:45

16/11/2020

Short Text Topic Modeling with Topic Distribution Quantization and Negative Sampling Decoder

Xiaobao Wu, Chunping Li, Yan Zhu, Yishu Miao

Keywords Paper

decoding, short modeling, topic models, neural model

0

0

0

0

10:30

26/04/2020

Variational Autoencoders for Highly Multivariate Spatial Point Processes Intensities

Baichuan Yuan, Xiaowei Wang, Jianxin Ma and
Chang Zhou, Andrea L. Bertozzi, Hongxia Yang

Keywords Paper

VAE, collaborative filtering, recommender systems, spatial point process

0

0

0

0

4:58

06/12/2020

Probabilistic Circuits for Variational Inference in Discrete Graphical Models

Andy Shih, Stefano Ermon

Keywords Paper

0

0

0

0

3:18

06/12/2021

Overcoming the curse of dimensionality with Laplacian regularization in semi-supervised learning

Vivien Cabannes, Loucas Pillaud-Vivien, Francis Bach, Alessandro Rudi

Keywords Paper

machine learning, kernel methods, semi-supervised learning

0

0

0

0

14:24

19/08/2021

Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness

Dazhong Shen, Chuan Qin, Chao Wang and
Hengshu Zhu, Enhong Chen, Hui Xiong

Keywords Paper

Machine Learning, Bayesian Learning, Probabilistic Machine Learning, Unsupervised Learning

0

0

0

0

13:04

18/07/2021

Mandoline: Model Evaluation under Distribution Shift

Mayee Chen, Karan Goel, Nimit Sohoni and
Fait Poms, Kayvon Fatahalian, Christopher Re

Keywords Paper

Algorithms, Others

0

0

0

1

5:49

02/02/2021

Temporal Latent Auto-Encoder: A Method for Probabilistic Multivariate Time Series Forecasting

Nam Nguyen, Brian Quanz

Keywords Paper

0

0

0

0

21:03

14/09/2020

Weak approximation of transformed stochastic gradient MCMC

Soma Yokoi, Takuma Otsuka, Issei Sat

Keywords Paper

0

0

0

0

13:39

06/12/2020

Benchmarking Deep Learning Interpretability in Time Series Predictions

Aya Abdelsalam Ismail, Mohamed Gunady, Hector Corrada Bravo, Soheil Feizi

Keywords Paper

0

0

0

0

3:37

22/11/2021

SLURP: Side Learning Uncertainty for Regression Problems

Xuanlong Yu, Gianni Franchi, Emanuel Aldea

Keywords Paper

Uncertainty estimation, Confidence estimation, Auxiliary model, Monocular depth, Optical flow

0

0

0

0

3:03

06/12/2021

FINE Samples for Learning with Noisy Labels

Taehyeon Kim, Jongwoo Ko, sangwook Cho and
JinHwan Choi, Se-Young Yun

Keywords Paper

theory, deep learning, machine learning, vision, semi-supervised learning

0

1

0

0

11:09

06/12/2020

An implicit function learning approach for parametric modal regression

Yangchen Pan, Ehsan Imani, Amir-massoud Farahmand, Martha White

Keywords Paper

0

0

0

0

3:09

04/07/2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization

Yuan Zang, Fanchao Qi, Chenghao Yang and
Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Paper

Textual attacking, Word-level attacking, combinatorial problem, Word-level Attacking

0

0

0

0

9:34

02/02/2021

Longitudinal Deep Kernel Gaussian Process Regression

Junjie Liang, Yanting Wu, Dongkuan Xu, Vasant G Honavar

Keywords Paper

0

0

0

0

16:27

18/07/2021

BasisDeVAE: Interpretable Simultaneous Dimensionality Reduction and Feature-Level Clustering with Derivative-Based Variational Autoencoders

Dominic Danks, Christopher Yau

Keywords Paper

Probabilistic Methods, Others

0

0

0

0

5:06

13/04/2021

Automatic differentiation variational inference with mixtures

Warren Morningstar, Sharad Vikram, Cusuh Ham and
Andrew Gallagher, Joshua Dillon

Keywords Paper

0

0

0

0

3:05

03/05/2021

Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Kaidi Cao, Yining Chen, Junwei Lu and
Nikos Arechiga, Adrien Gaidon, Tengyu Ma

Keywords Paper

imbalanced learning, noise robust learning, deep learning

0

0

0

0

5:14

12/07/2020

Efficiently sampling functions from Gaussian process posteriors

James Wilson, Viacheslav Borovitskiy, Alexander Terenin and
Peter Mostowsky, Marc Deisenroth

Keywords Paper

Gaussian Processes

0

0

0

0

14:40

22/11/2021

Duplicate Latent Representation Suppression for Multi-object Variational Autoencoders

Li Nanbo, Robert B Fisher

Keywords Paper

object-centric representation learning, variational autoencoders, scene representation

0

0

0

0

2:58

05/04/2021

SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier Detection

Yue Zhao, Xiyang Hu, Cheng Cheng and
Cong Wang, Changlin Wan, Wen Wang, Jianing Yang, Haoping Bai, Zheng Li, Cao Xiao, Yunlong Wang, Zhi Qiao, Jimeng Sun, Leman Akoglu

Keywords Paper

Algorithms -> Adversarial Learning, Algorithms -> Image Segmentation; Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Applications -> Imag

0

0

0

0

18:47

05/04/2021

SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier Detection

Yue Zhao, Xiyang Hu, Cheng Cheng and
Cong Wang, Changlin Wan, Wen Wang, Jianing Yang, Haoping Bai, Zheng Li, Cao Xiao, Yunlong Wang, Zhi Qiao, Jimeng Sun, Leman Akoglu

Keywords Paper

Algorithms -> Adversarial Learning, Algorithms -> Image Segmentation; Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Applications -> Imag

0

0

0

0

4:53

06/12/2020

Walsh-Hadamard Variational Inference for Bayesian Deep Learning

Simone Rossi, Sebastien Marmin, Maurizio Filippone

Keywords Paper

0

0

0

0

2:59

06/12/2021

Bayesian Adaptation for Covariate Shift

Aurick Zhou, Sergey Levine

Keywords Paper

deep learning, machine learning, robustness, vision, domain adaptation

0

0

0

0

8:21

12/07/2020

A Chance-Constrained Generative Framework for Sequence Optimization

Xianggen Liu, Jian Peng, Qiang Liu, Sen Song

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

12:40

25/07/2020

A general knowledge distillation framework for counterfactual recommendation via uniform data

Dugang Liu, Pengxiang Cheng, Zhenhua Dong and
Xiuqiang He, Weike Pan, Zhong Ming

Keywords Paper

counterfactual learning, uniform data, recommender systems, knowledge distillation

0

0

0

0

14:06

13/04/2021

Comparing the value of labeled and unlabeled data in method-of-moments latent variable estimation

Mayee Chen, Benjamin Cohen-Wang, Stephen Mussmann and
Frederic Sala, Christopher Re

Keywords Paper

0

0

0

0

3:04

14/06/2020

Label Distribution Learning on Auxiliary Label Space Graphs for Facial Expression Recognition

Shikai Chen, Jianfeng Wang, Yuedong Chen and
Zhongchao Shi, Xin Geng, Yong Rui

Keywords Paper

facial expression, facial expression recognition, label distribution learning, annotation inconsistency

0

0

0

0

1:00

13/04/2021

Learning bijective feature maps for linear ICA

Alexander Camuto, Matthew Willetts, Chris Holmes and
Brooks Paige, Stephen Roberts

Keywords Paper

0

0

0

0

3:02

04/08/2021

Statistical Query Algorithms and Low Degree Tests Are Almost Equivalent

Matthew S Brennan, Guy Bresler, Sam Hopkins and
Jerry Li, Tselil Schramm

Keywords Paper

0

0

0

0

13:30

08/12/2020

Attention Transfer Network for Aspect-level Sentiment Classification

Fei Zhao, Zhen Wu, Xinyu Dai

Keywords Paper

0

0

0

0

13:34

12/07/2020

Automatic Reparameterisation of Probabilistic Programs

Maria Gorinova, Dave Moore, Matthew Hoffman

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

15:40

02/02/2021

DeepPseudo: Pseudo Value Based Deep Learning Models for Competing Risk Analysis

Md Mahmudur Rahman, Koji Matsuo, Shinya Matsuzaki, Sanjay Purushotham

Keywords Paper

0

0

0

0

18:31