Finite mixture models do not reliably learn the number of components

Abstract: Scientists and engineers are often interested in learning the number of subpopulations (or components) present in a data set. A common suggestion is to use a finite mixture model (FMM) with a prior on the number of components. Past work has shown the resulting FMM component-count posterior is consistent; that is, the posterior concentrates on the true, generating number of components. But consistency requires the assumption that the component likelihoods are perfectly specified, which is unrealistic in practice. In this paper, we add rigor to data-analysis folk wisdom by proving that under even the slightest model misspecification, the FMM component-count posterior diverges: the posterior probability of any particular finite number of components converges to 0 in the limit of infinite data. Contrary to intuition, posterior-density consistency is not sufficient to establish this result. We develop novel sufficient conditions that are more realistic and easily checkable than those common in the asymptotics literature. We illustrate practical consequences of our theory on simulated and real data.

06/12/2020

Finite mixture models do not reliably learn the number of components

Diana Cai, Trevor Campbell, Tamara Broderick

Comments

Similar Papers

Approximate Cross-Validation for Structured Models

Soumya Ghosh, Will Stephenson, Stan Nguyen and Sameer Deshpande, Tamara Broderick

Keywords Abstract Paper

Adaptive, Distribution-Free Prediction Intervals for Deep Networks

Danijel Kivaranovic, Kory D. Johnson, Hannes Leeb

Keywords Abstract Paper

Continuous Latent Process Flows

Ruizhi Deng, Marcus Brubaker, Greg Mori, Andreas M Lehrmann

Keywords Abstract Paper

generative model

Deep reconstruction of strange attractors from time series

William Gilpin

Keywords Abstract Paper

Structured Dropout Variational Inference for Bayesian Neural Networks

Son Nguyen, Duong Nguyen, Khai Nguyen and Khoat Than, Hung Bui, Nhat Ho

Keywords Abstract Paper

deep learning, generative model

Infinite Gaussian Mixture Modeling with an Improved Estimation of the Number of Clusters

Avi Matza, Yuval Bistritz

Keywords Abstract Paper

Theoretical bounds on estimation error for meta-learning

James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI and Toniann Pitassi, Richard Zemel

Keywords Abstract Paper

meta learning, minimax risk, few-shot, lower bounds, learning theory

Stability and Generalization of Bilevel Programming in Hyperparameter Optimization

Fan Bao, Guoqiang Wu, Chongxuan LI and Jun Zhu, Bo Zhang

Keywords Abstract Paper

optimization

Repulsive Deep Ensembles are Bayesian

Francesco D'Angelo, Vincent Fortuin

Keywords Abstract Paper

deep learning, optimization

Overparameterization Improves Robustness to Covariate Shift in High Dimensions

Nilesh Tripuraneni, Ben Adlam, Jeffrey Pennington

Keywords Abstract Paper

theory, deep learning, machine learning, robustness

Adversarially robust estimate and risk analysis in linear regression

Yue Xing, Ruizhi Zhang, Guang Cheng

Keywords Abstract Paper

Backward-Compatible Prediction Updates: A Probabilistic Approach

Frederik Träuble, Julius von Kügelgen, Matthäus Kleindessner and Francesco Locatello, Bernhard Schölkopf, Peter Gehler

Keywords Abstract Paper

machine learning, vision

Conformal Bayesian Computation

Edwin Fong, Chris C Holmes

Keywords Abstract Paper

machine learning

Benign Overfitting in Multiclass Classification: All Roads Lead to Interpolation

Ke Wang, Vidya Muthukumar, Christos Thrampoulidis

Keywords Abstract Paper

machine learning

When Is Unsupervised Disentanglement Possible?

Daniella Horan, Eitan Richardson, Yair Weiss

Keywords Abstract Paper

machine learning, generative model, representation learning

Deep Recurrent Belief Propagation Network for POMDPs

Yuhui Wang, Xiaoyang Tan

Keywords Abstract Paper

Non-asymptotic Analysis for Nonparametric Testing

Yun Yang, Zuofeng Shang, Guang Cheng

Keywords Abstract Paper

Regression, Concentration inequalities

Task-Robust Model-Agnostic Meta-Learning

Liam Collins, Aryan Mokhtari, Sanjay Shakkottai

Keywords Abstract Paper

Group testing and local search: is there a computational-statistical gap?

Fotis Iliopoulos, Ilias Zadik

Keywords Abstract Paper

Fast Multi-label Learning

Xiuwen Gong, Dong Yuan, Wei Bao

Keywords Abstract Paper

Machine Learning, Multi-instance; Multi-label; Multi-view learning

Consistency of a Recurrent Language Model With Respect to Incomplete Decoding

Sean Welleck, Ilia Kulikov, Jaedeok Kim and Richard Yuanzhe Pang, Kyunghyun Cho

Keywords Abstract Paper

receiving sequences, neural models, recurrent model, common algorithms

Soumya Ghosh, Will Stephenson, Stan Nguyen and
Sameer Deshpande, Tamara Broderick

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Son Nguyen, Duong Nguyen, Khai Nguyen and
Khoat Than, Hung Bui, Nhat Ho

Keywords Paper

Keywords Paper

James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI and
Toniann Pitassi, Richard Zemel

Keywords Paper

Fan Bao, Guoqiang Wu, Chongxuan LI and
Jun Zhu, Bo Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Frederik Träuble, Julius von Kügelgen, Matthäus Kleindessner and
Francesco Locatello, Bernhard Schölkopf, Peter Gehler

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Sean Welleck, Ilia Kulikov, Jaedeok Kim and
Richard Yuanzhe Pang, Kyunghyun Cho

Keywords Paper

Shengjia Zhao, Michael Kim, Roshni Sahoo and
Tengyu Ma, Stefano Ermon

Keywords Paper

Keywords Paper

Akash Kumar Dhaka, Alejandro Catalina, Michael Andersen and
Måns Magnusson, Jonathan Huggins, Aki Vehtari

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

Yingbo Gao, Weiyue Wang, Christian Herold and
Zijian Yang, Hermann Ney

Keywords Paper

Keywords Paper

Hongyu Lin, Yaojie Lu, Jialong Tang and
Xianpei Han, Le Sun, Zhicheng Wei, Nicholas Jing Yuan

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper