Learning a Latent Simplex in Input Sparsity Time

Abstract: We consider the problem of learning a latent $k$-vertex simplex $K\in\mathbb{R}^d$, given $\mathbf{A}\in\mathbb{R}^{d\times n}$, which can be viewed as $n$ data points that are formed by randomly perturbing some latent points in $K$, possibly beyond $K$. A large class of latent variable models, such as adversarial clustering, mixed membership stochastic block models, and topic models can be cast in this view of learning a latent simplex. Bhattacharyya and Kannan (SODA 2020) give an algorithm for learning such a $k$-vertex latent simplex in time roughly $O(k\cdot\text{nnz}(\mathbf{A}))$, where $\text{nnz}(\mathbf{A})$ is the number of non-zeros in $\mathbf{A}$. We show that the dependence on $k$ in the running time is unnecessary given a natural assumption about the mass of the top $k$ singular values of $\mathbf{A}$, which holds in many of these applications. Further, we show this assumption is necessary, as otherwise an algorithm for learning a latent simplex would imply a better low rank approximation algorithm than what is known. We obtain a spectral low-rank approximation to $\mathbf{A}$ in input-sparsity time and show that the column space thus obtained has small $\sin\Theta$ (angular) distance to the right top-$k$ singular space of $\mathbf{A}$. Our algorithm then selects $k$ points in the low-rank subspace with the largest inner product (in absolute value) with $k$ carefully chosen random vectors. By working in the low-rank subspace, we avoid reading the entire matrix in each iteration and thus circumvent the $\Theta(k\cdot\text{nnz}(\mathbf{A}))$ running time.

04/08/2021

Algorithms, Meta-Learning, Algorithms, Few-Shot Learning; Algorithms, Multitask and Transfer Learning, Theory, Statistical Learning Theory

5:03

06/12/2021

Learning a Latent Simplex in Input Sparsity Time

Ainesh Bakshi, Chiranjib Bhattacharyya, Ravi Kannan, David Woodruff, Samson Zhou

Comments

Similar Papers

Reduced-Rank Regression with Operator Norm Error

Praneeth Kacham, David Woodruff

Keywords Abstract Paper

Near-Optimal Entrywise Sampling of Numerically Sparse Matrices

Vladimir Braverman, Robert Krauthgamer, Aditya R Krishnan, Shay Sapir

Keywords Abstract Paper

Sketching Transformed Matrices with Applications to Natural Language Processing

Yingyu Liang, Zhao Song, Mengdi Wang and Lin Yang, Xin Yang

Keywords Abstract Paper

Online k-means clustering

Vincent Cohen-Addad, Benjamin Guedj, Varun Kanade, Guy Rom

Keywords Abstract Paper

Meta Learning for Support Recovery in High-dimensional Precision Matrix Estimation

Qian Zhang, Yilin Zheng, Jean Honorio

Keywords Abstract Paper

Algorithms, Meta-Learning, Algorithms, Few-Shot Learning; Algorithms, Multitask and Transfer Learning, Theory, Statistical Learning Theory

Optimal Sketching for Trace Estimation

Shuli Jiang, Hai Pham, David Woodruff, Richard Zhang

Keywords Abstract Paper

machine learning

Dynamics of Stochastic Momentum Methods on Large-scale, Quadratic Models

Courtney Paquette, Elliot Paquette

Keywords Abstract Paper

theory, optimization

Second-Order Information in Non-Convex Stochastic Optimization: Power and Limitations

Yossi Arjevani, Yair Carmon, John Duchi and Dylan Foster, Ayush Sekhari, Karthik Sridharan

Keywords Abstract Paper

Non-convex optimization, Stochastic optimization

List-Decodable Mean Estimation in Nearly-PCA Time

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and Jerry Li, Kevin Tian

Keywords Abstract Paper

theory, clustering

How Good is SGD with Random Shuffling?

Itay M Safran, Ohad Shamir

Keywords Abstract Paper

Convex optimization,

Efficiently learning structured distributions from untrusted batches

Sitan Chen, Jerry Li, Ankur Moitra

Keywords Abstract Paper

sum-of-squares, federated learning, VC complexity, Robust statistics

Minimax Regret of Switching-Constrained Online Convex Optimization: No Phase Transition

Lin Chen, Qian Yu, Hannah Lawrence, Amin Karbasi

Keywords Abstract Paper

Instance-Dependent Bounds for Zeroth-order Lipschitz Optimization with Error Certificates

Francois Bachoc, Tom Cesari, Sébastien Gerchinovitz

Keywords Abstract Paper

theory, optimization

Improving the Sample and Communication Complexity for Decentralized Non-Convex Optimization: Joint Gradient Estimation and Tracking

Haoran Sun, Songtao Lu, Mingyi Hong

Keywords Abstract Paper

Optimization - Non-convex

Oracle Complexity in Nonsmooth Nonconvex Optimization

Guy Kornowski, Ohad Shamir

Keywords Abstract Paper

theory, deep learning, optimization

Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces

Vincent Cohen-Addad, David Saulpic, Chris Schwiegelshohn

Keywords Abstract Paper

clustering

Statistical-Query Lower Bounds via Functional Gradients

Surbhi Goel, Aravind Gollakota, Adam Klivans

Keywords Abstract Paper

Algorithms for heavy-tailed statistics: Regression, covariance estimation, and beyond

Yeshwanth Cherapanamjeri, Samuel B. Hopkins, Tarun Kathuria and Prasad Raghavendra, Nilesh Tripuraneni

Keywords Abstract Paper

Sum-of-squares, Algorithms, Heavy-Tailed Estimation

Quick streaming algorithms for maximization of monotone submodular functions in linear time

Alan Kuhnle

Keywords Abstract Paper

Optimal Dynamic Regret in Exp-Concave Online Learning

Dheeraj Baby, Yu-Xiang Wang

Keywords Abstract Paper

Locally Private Hypothesis Selection

Sivakanth Gopi, Gautam Kamath, Janardhan D Kulkarni and Aleksandar Nikolov, Steven Wu, Huanyu Zhang

Keywords Abstract Paper

Privacy, fairness, Distribution learning/testing

Keywords Paper

Keywords Paper

Yingyu Liang, Zhao Song, Mengdi Wang and
Lin Yang, Xin Yang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yossi Arjevani, Yair Carmon, John Duchi and
Dylan Foster, Ayush Sekhari, Karthik Sridharan

Keywords Paper

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and
Jerry Li, Kevin Tian

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yeshwanth Cherapanamjeri, Samuel B. Hopkins, Tarun Kathuria and
Prasad Raghavendra, Nilesh Tripuraneni

Keywords Paper

Keywords Paper

Keywords Paper

Sivakanth Gopi, Gautam Kamath, Janardhan D Kulkarni and
Aleksandar Nikolov, Steven Wu, Huanyu Zhang

Keywords Paper

Keywords Paper

Simon Du, Wei Hu, Sham M Kakade and
Jason Lee, Qi Lei

Keywords Paper

Keywords Paper

Vincent Cohen-Addad, Silvio Lattanzi, Ashkan Norouzi-Fard and
Christian Sohler, Ola Svensson

Keywords Paper

Keywords Paper

Dimitris Fotakis, Loukas Kavouras, Grigorios Koumoutsos and
Stratis Skoulakis, Manolis Vardas

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Alain Durmus, Eric Moulines, Alexey Naumov and
Sergey Samsonov, Kevin Scaman, Hoi-To Wai

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper