The EM Algorithm gives Sample-Optimality for Learning Mixtures of Well-Separated Gaussians

09/07/2020

The EM Algorithm gives Sample-Optimality for Learning Mixtures of Well-Separated Gaussians

Jeongyeol Kwon, Constantine Caramanis

Keywords: Non-convex optimization, Clustering, Concentration inequalities, High-dimensional statistics, PAC learning

Abstract Paper Similar Papers

Abstract: We consider the problem of spherical Gaussian Mixture models with $k \geq 3$ components when the components are well separated. A fundamental previous result established that separation of $\Omega(\sqrt{\log k})$ is necessary and sufficient for identifiability of the parameters with \textit{polynomial} sample complexity (Regev and Vijayaraghavan, 2017). In the same context, we show that $\tilde{O} (kd/\epsilon^2)$ samples suffice for any $\epsilon \lesssim 1/k$, closing the gap from polynomial to linear, and thus giving the first optimal sample upper bound for the parameter estimation of well-separated Gaussian mixtures. We accomplish this by proving a new result for the Expectation-Maximization (EM) algorithm: we show that EM converges locally, under separation $\Omega(\sqrt{\log k})$. The previous best-known guarantee required $\Omega(\sqrt{k})$ separation (Yan, et al., 2017). Unlike prior work, our results do not assume or use prior knowledge of the (potentially different) mixing weights or variances of the Gaussian components. Furthermore, our results show that the finite-sample error of EM does not depend on non-universal quantities such as pairwise distances between means of Gaussian components.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLT 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

08/07/2020

Deterministic Sparse Fourier Transform with an 𝓁_{∞} Guarantee

Yi Li, Vasileios Nakos

Keywords Paper

Fourier sparse recovery, derandomization, incoherent matrices

0

0

0

0

19:52

06/12/2021

An Online Riemannian PCA for Stochastic Canonical Correlation Analysis

Zihang Meng, Rudrasis Chakraborty, Vikas Singh

Keywords Paper

optimization, fairness

0

0

0

0

14:14

09/07/2020

Logsmooth Gradient Concentration and Tighter Runtimes for Metropolized Hamiltonian Monte Carlo

Yin Tat Lee, Ruoqi Shen, Kevin Tian

Keywords Paper

Sampling algorithms, Bayesian methods

0

0

0

0

14:57

04/08/2021

The Bethe and Sinkhorn Permanents of Low Rank Matrices and Implications for Profile Maximum Likelihood

Nima Anari, Moses Charikar, Kirankumar Shiragur, Aaron Sidford

Keywords Paper

0

0

0

0

18:20

06/12/2021

A Comprehensively Tight Analysis of Gradient Descent for PCA

Zhiqiang Xu, Ping Li

Keywords Paper

optimization

0

0

0

0

4:37

06/12/2020

Truncated Linear Regression in High Dimensions

Constantinos Daskalakis, Dhruv Rohatgi, Emmanouil Zampetakis

Keywords Paper

0

0

0

0

3:17

06/12/2021

List-Decodable Mean Estimation in Nearly-PCA Time

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and
Jerry Li, Kevin Tian

Keywords Paper

theory, clustering

0

0

0

0

14:21

06/12/2020

The Flajolet-Martin Sketch Itself Preserves Differential Privacy: Private Counting with Minimal Space

Adam Smith, Shuang Song, Abhradeep Guha Thakurta

Keywords Paper

0

0

0

0

3:17

06/12/2021

Mixture weights optimisation for Alpha-Divergence Variational Inference

Kamélia Daudel, randal douc

Keywords Paper

generative model

0

0

0

0

15:41

04/08/2021

Approximation Algorithms for Socially Fair Clustering

Yury Makarychev, Ali Vakilian

Keywords Paper

0

0

0

0

16:31

13/04/2021

Explicit regularization of stochastic gradient methods through duality

Anant Raj, Francis Bach

Keywords Paper

0

0

0

0

2:53

06/12/2021

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

optimization

0

0

0

0

10:44

12/07/2020

Inertial Block Proximal Methods for Non-Convex Non-Smooth Optimization

Hien Le, Nicolas Gillis, Panagiotis Patrinos

Keywords Paper

Optimization - General

0

0

0

0

15:31

13/04/2021

vqSGD: Vector quantized stochastic gradient descent

Venkata Gandikota, Daniel Kane, Raj Kumar Maity, Arya Mazumdar

Keywords Paper

0

0

0

0

3:11

06/12/2021

Towards Sample-Optimal Compressive Phase Retrieval with Sparse and Generative Priors

Zhaoqiang Liu, Subhroshekhar Ghosh, Jonathan Scarlett

Keywords Paper

theory, optimization, generative model

0

0

0

0

10:41

04/08/2021

Optimal dimension dependence of the Metropolis-Adjusted Langevin Algorithm

Sinho Chewi, Chen Lu, Kwangjun Ahn and
Xiang Cheng, Thibaut Le Gouic, Philippe Rigollet

Keywords Paper

0

0

0

0

16:38

12/07/2020

Accelerated Stochastic Gradient-free and Projection-free Methods

Feihu Huang, Lue Tao, Songcan Chen

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:05

26/08/2020

A Unified Theory of SGD: Variance Reduction, Sampling, Quantization and Coordinate Descent

Eduard Gorbunov, Filip Hanzely, Peter Richtarik

Keywords Paper

0

0

0

0

13:13

06/12/2021

Beyond Smoothness: Incorporating Low-Rank Analysis into Nonparametric Density Estimation

Robert A Vandermeulen, Antoine Ledent

Keywords Paper

theory

0

0

0

0

12:58

06/12/2020

Revisiting Frank-Wolfe for Polytopes: Strict Complementarity and Sparsity

Dan Garber

Keywords Paper

0

0

0

0

3:22

12/07/2020

Eliminating the Invariance on the Loss Landscape of Linear Autoencoders

Reza Oftadeh, Jiayi Shen, Zhangyang Wang, Dylan Shell

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:10

26/08/2020

Bayesian experimental design using regularized determinantal point processes

Michal Derezinski, Feynman Liang, Michael Mahoney

Keywords Paper

0

0

0

0

17:58

13/04/2021

Inductive mutual information estimation: A convex maximum-entropy copula approach

Yves-Laurent Kom Samo

Keywords Paper

0

0

0

0

2:57

06/12/2021

A Non-commutative Extension of Lee-Seung's Algorithm for Positive Semidefinite Factorizations

Yong Sheng Soh, Antonios Varvitsiotis

Keywords Paper

theory, optimization

0

0

0

0

13:34

06/12/2021

Approximating the Permanent with Deep Rejection Sampling

Juha Harviainen, Antti Röyskö, Mikko Koivisto

Keywords Paper

0

0

0

0

12:26

06/12/2021

Non-asymptotic convergence bounds for Wasserstein approximation using point clouds

Quentin Mérigot, Filippo Santambrogio, Clément SARRAZIN

Keywords Paper

optimization, machine learning, optimal transport

0

0

0

0

14:49

13/04/2021

Homeomorphic-invariance of EM: Non-asymptotic convergence in KL divergence for exponential families via mirror descent

Frederik Kunstner, Raunak Kumar, Mark Schmidt

Keywords Paper

0

0

0

0

2:48

18/07/2021

Consistent regression when oblivious outliers overwhelm

Tommaso d'Orsi, Gleb Novikov, David Steurer

Keywords Paper

Theory, Game Theory and Computational Economics, Theory, Theory, Computational Complexity

0

0

0

0

4:42

02/02/2021

Multi-Objective Submodular Maximization by Regret Ratio Minimization with Theoretical Guarantee

Chao Feng, Chao Qian

Keywords Paper

0

0

0

0

15:19

26/08/2020

EM Converges for a Mixture of Many Linear Regressions

Jeongyeol Kwon, Constantine Caramanis

Keywords Paper

0

0

0

0

11:26

03/08/2020

A Practical Riemannian Algorithm for Computing Dominant Generalized Eigenspace

Zhiqiang Xu, Ping Li

Keywords Paper

0

0

0

0

8:02

06/12/2020

Faster Wasserstein Distance Estimation with the Sinkhorn Divergence

Lénaïc Chizat, Pierre Roussillon, Flavien Léger and
François-Xavier Vialard, Gabriel Peyré

Keywords Paper

0

0

1

1

3:21

18/07/2021

Streaming and Distributed Algorithms for Robust Column Subset Selection

Shuli Jiang, Dongyu Li, Irene Mengze Li and
Arvind Mahankali, David Woodruff

Keywords Paper

Algorithms, Deep Learning, Generative Models, Deep Learning, Predictive Models; Deep Learning, Recurrent Networks

0

0

0

0

7:26

06/12/2020

Projection Robust Wasserstein Distance and Riemannian Optimization

Darren Lin, Chenyou Fan, Nhat Ho and
Marco Cuturi, Michael Jordan

Keywords Paper

Optimization -> Non-Convex Optimization; Optimization -> Stochastic Optimization, Deep Learning -> Optimization for Deep Networks

0

0

0

1

3:01

06/12/2020

Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication Time

Jerry Li, Guanghao Ye

Keywords Paper

0

0

0

0

3:13

22/06/2020

Positive semidefinite programming: Mixed, parallel, and width-independent

Arun Jambulapati, Yin Tat Lee, Jerry Li and
Swati Padmanabhan, Kevin Tian

Keywords Paper

semidefinite programming, approximation algorithm, mixed packing and covering, width-independent algorithm, parallel algorithm

0

0

0

0

18:12

26/08/2020

Stochastic Recursive Variance-Reduced Cubic Regularization Methods

Dongruo Zhou, Quanquan Gu

Keywords Paper

0

0

0

0

15:42

13/04/2021

Minimax estimation of laplacian constrained precision matrices

Jiaxi Ying, José Vinícius de Miranda Cardoso, Daniel Palomar

Keywords Paper

0

0

0

0

3:00

06/12/2021

Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery

Lijun Ding, Liwei Jiang, Yudong Chen and
Qing Qu, Zhihui Zhu

Keywords Paper

0

0

0

0

14:02

09/07/2020

A Fast Spectral Algorithm for Mean Estimation with Sub-Gaussian Rates

Zhixian Lei, Kyle Luh, Prayaag Venkat, Fred Zhang

Keywords Paper

High-dimensional statistics, Adversarial learning and robustness

0

0

0

0

15:00