On the minimax optimality of the EM algorithm for learning two-component mixed linear regression

13/04/2021

On the minimax optimality of the EM algorithm for learning two-component mixed linear regression

Jeongyeol Kwon, Nhat Ho, Constantine Caramanis

Keywords:

Abstract Paper Similar Papers

Abstract: We study the convergence rates of the EM algorithm for learning two-component mixed linear regression under all regimes of signal-to-noise ratio (SNR). We resolve a long-standing question that many recent results have attempted to tackle: we completely characterize the convergence behavior of EM, and show that the EM algorithm achieves minimax optimal sample complexity under all SNR regimes. In particular, when the SNR is sufficiently large, the EM updates converge to the true parameter \theta^{*} at the standard parametric convergence rate \calo((d/n)^{1/2}) after \calo(\log(n/d)) iterations. In the regime where the SNR is above \calo((d/n)^{1/4}) and below some constant, the EM iterates converge to a \calo({\rm SNR}^{-1} (d/n)^{1/2}) neighborhood of the true parameter, when the number of iterations is of the order \calo({\rm SNR}^{-2} \log(n/d)). In the low SNR regime where the SNR is below \calo((d/n)^{1/4}), we show that EM converges to a \calo((d/n)^{1/4}) neighborhood of the true parameters, after \calo((n/d)^{1/2}) iterations. Notably, these results are achieved under mild conditions of either random initialization or an efficiently computable local initialization. By providing tight convergence guarantees of the EM algorithm in middle-to-low SNR regimes, we fill the remaining gap in the literature, and significantly, reveal that in low SNR, EM changes rate, matching the n^{-1/4} rate of the MLE, a behavior that previous work had been unable to show.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Robustness Analysis of Non-Convex Stochastic Gradient Descent using Biased Expectations

Kevin Scaman, Cedric Malherbe

Keywords Paper

0

0

0

0

3:09

06/12/2020

On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems

Panayotis Mertikopoulos, Nadav Hallak, Ali Kavis, Volkan Cevher

Keywords Paper

0

0

0

0

3:27

13/04/2021

Homeomorphic-invariance of EM: Non-asymptotic convergence in KL divergence for exponential families via mirror descent

Frederik Kunstner, Raunak Kumar, Mark Schmidt

Keywords Paper

0

0

0

0

2:48

06/12/2020

CSER: Communication-efficient SGD with Error Reset

Cong Xie, Shuai Zheng, Sanmi Koyejo and
Indranil Gupta, Mu Li, Haibin Lin

Keywords Paper

0

0

0

0

3:12

06/12/2021

Efficient Generalization with Distributionally Robust Learning

Soumyadip Ghosh, Mark Squillante, Ebisa Wollega

Keywords Paper

optimization, machine learning

0

0

0

0

14:57

26/08/2020

Alternating Minimization Converges Super-Linearly for Mixed Linear Regression

Avishek Ghosh, Ramchandran Kannan

Keywords Paper

0

0

0

0

12:56

26/08/2020

EM Converges for a Mixture of Many Linear Regressions

Jeongyeol Kwon, Constantine Caramanis

Keywords Paper

0

0

0

0

11:26

06/12/2020

Revisiting Frank-Wolfe for Polytopes: Strict Complementarity and Sparsity

Dan Garber

Keywords Paper

0

0

0

0

3:22

12/07/2020

Optimal Estimator for Unlabeled Linear Regression

Hang Zhang, Ping Li

Keywords Paper

Learning Theory

0

0

0

0

14:58

26/08/2020

Fast and Furious Convergence: Stochastic Second Order Methods under Interpolation

Si Yi Meng, Sharan Vaswani, Issam Hadj Laradji and
Mark Schmidt, Simon Lacoste-Julien

Keywords Paper

0

0

0

0

14:20

06/12/2020

Escaping Saddle-Point Faster under Interpolation-like Conditions

Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra

Keywords Paper

0

0

0

0

3:19

06/12/2021

Non-convex Distributionally Robust Optimization: Non-asymptotic Analysis

Jikai Jin, Bohang Zhang, Haiyang Wang, Liwei Wang

Keywords Paper

optimization

0

0

0

0

14:05

06/12/2020

Task-Robust Model-Agnostic Meta-Learning

Liam Collins, Aryan Mokhtari, Sanjay Shakkottai

Keywords Paper

0

0

0

0

3:17

04/08/2021

The Bethe and Sinkhorn Permanents of Low Rank Matrices and Implications for Profile Maximum Likelihood

Nima Anari, Moses Charikar, Kirankumar Shiragur, Aaron Sidford

Keywords Paper

0

0

0

0

18:20

14/09/2020

Efficiency of Coordinate Descent Methods For Structured Nonconvex Optimization

Qi Deng, Chenghao Lan

Keywords Paper

coordinate descent method, nonconvex optimization, nonsmooth optimization

0

0

0

0

3:20

13/04/2021

Explicit regularization of stochastic gradient methods through duality

Anant Raj, Francis Bach

Keywords Paper

0

0

0

0

2:53

06/12/2021

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

optimization

0

0

0

0

10:44

06/12/2021

Near-optimal Offline and Streaming Algorithms for Learning Non-Linear Dynamical Systems

Suhas Kowshik, Dheeraj Nagaraj, Prateek Jain, Praneeth Netrapalli

Keywords Paper

theory

0

0

0

0

14:43

02/02/2021

On Convergence of Gradient Expected Sarsa(λ)

Long Yang, Gang Zheng, Yu Zhang and
Qian Zheng, Pengfei Li, Gang Pan

Keywords Paper

0

0

0

0

11:27

06/12/2020

Random Reshuffling: Simple Analysis with Vast Improvements

Konstantin Mishchenko, Ahmed Khaled Ragab Bayoumi, Peter Richtarik

Keywords Paper

Reinforcement Learning and Planning -> Planning; Reinforcement Learning and Planning -> Reinforcement Learning, Reinforcement Learning and Planning

0

0

0

0

3:08

02/02/2021

Infinite Gaussian Mixture Modeling with an Improved Estimation of the Number of Clusters

Avi Matza, Yuval Bistritz

Keywords Paper

0

0

0

0

20:14

12/07/2020

Data Amplification: Instance-Optimal Property Estimation

Yi Hao, Alon Orlitsky

Keywords Paper

Learning Theory

0

0

0

0

15:06

06/12/2020

Distributionally Robust Federated Averaging

Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

Keywords Paper

0

0

0

0

3:11

26/08/2020

Finite-Time Error Bounds for Biased Stochastic Approximation with Applications to Q-Learning

Gang Wang, Georgios B. Giannakis

Keywords Paper

0

0

0

0

14:03

06/12/2021

Label Noise SGD Provably Prefers Flat Global Minimizers

Alex Damian, Tengyu Ma, Jason Lee

Keywords Paper

optimization, machine learning

0

0

0

0

11:31

04/08/2021

SGD in the Large: Average-case Analysis, Asymptotics, and Stepsize Criticality

Courtney Paquette, Kiwon Lee, Fabian Pedregosa, Elliot Paquette

Keywords Paper

0

0

0

0

14:38

18/07/2021

A Second look at Exponential and Cosine Step Sizes: Simplicity, Adaptivity, and Performance

Xiaoyu Li, Zhenxun Zhuang, Francesco Orabona

Keywords Paper

Optimization, Non-Convex Optimization

0

0

0

0

5:07

06/12/2020

Linear-Sample Learning of Low-Rank Distributions

Ayush Jain, Alon Orlitsky

Keywords Paper

0

0

0

0

3:22

22/06/2020

Non-adaptive adaptive sampling on turnstile streams

Sepideh Mahabadi, Ilya Razenshteyn, David P. Woodruff, Samson Zhou

Keywords Paper

volume maximization, determinantal point processes, computational geometry, streaming algorithms

0

0

0

0

25:07

09/07/2020

Winnowing with Gradient Descent

Ehsan Amid, Manfred K. Warmuth

Keywords Paper

Online learning,

0

0

0

0

14:22

18/07/2021

On the Convergence of Hamiltonian Monte Carlo with Stochastic Gradients

Difan Zou, Quanquan Gu

Keywords Paper

Probabilistic Methods, Monte Carlo Methods

0

0

0

0

5:39

22/06/2020

A polynomial lower bound on adaptive complexity of submodular maximization

Wenzheng Li, Paul Liu, Jan Vondrák

Keywords Paper

submodular, optimization, symmetry gap, lower bound, adaptive model

0

0

0

0

24:00

06/12/2021

Convergence Rates of Stochastic Gradient Descent under Infinite Noise Variance

Hongjian Wang, Mert Gurbuzbalaban, Lingjiong Zhu and
Umut Simsekli, Murat Erdogdu

Keywords Paper

optimization

0

0

0

0

8:24

06/12/2021

Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations

Ayush Sekhari, Christoph Dann, Mehryar Mohri and
Yishay Mansour, Karthik Sridharan

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

11:22

06/12/2021

Fast Minimum-norm Adversarial Attacks through Adaptive Norm Constraints

Maura Pintor, Fabio Roli, Wieland Brendel, Battista Biggio

Keywords Paper

optimization, machine learning, robustness, adversarial robustness and security, vision

0

0

0

0

11:35

04/08/2021

Convergence rates and approximation results for SGD and its continuous-time counterpart

Xavier Fontaine, Valentin De Bortoli, Alain Durmus

Keywords Paper

0

0

0

0

17:35

06/12/2020

Debiasing Averaged Stochastic Gradient Descent to handle missing values

Aude Sportisse, Claire Boyer, Aymeric Dieuleveut, Julie Josse

Keywords Paper

0

0

0

0

3:23

06/12/2021

Minibatch and Momentum Model-based Methods for Stochastic Weakly Convex Optimization

Qi Deng, Wenzhi Gao

Keywords Paper

optimization, robustness

0

0

0

0

13:12

06/12/2021

PLUGIn: A simple algorithm for inverting generative models with recovery guarantees

Babhru Joshi, Xiaowei Li, Yaniv Plan, Ozgur Yilmaz

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

14:58

18/07/2021

Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality

Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin LIANG

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

4:23