Dimensionality Reduction for the Sum-of-Distances Metric

18/07/2021

Dimensionality Reduction for the Sum-of-Distances Metric

Zhili Feng, Praneeth Kacham, David Woodruff

Keywords: Neuroscience and Cognitive Science, Deep Learning, Biologically Plausible Deep Networks; Neuroscience and Cognitive Science, Connectomics; Neuroscience and Cog, Algorithms, Dimensionality Reduction

Abstract Paper Similar Papers

Abstract: We give a dimensionality reduction procedure to approximate the sum of distances of a given set of $n$ points in $R^d$ to any ``shape'' that lies in a $k$-dimensional subspace. Here, by ``shape'' we mean any set of points in $R^d$. Our algorithm takes an input in the form of an $n \times d$ matrix $A$, where each row of $A$ denotes a data point, and outputs a subspace $P$ of dimension $O(k^{3}/\epsilon^6)$ such that the projections of each of the $n$ points onto the subspace $P$ and the distances of each of the points to the subspace $P$ are sufficient to obtain an $\epsilon$-approximation to the sum of distances to any arbitrary shape that lies in a $k$-dimensional subspace of $R^d$. These include important problems such as $k$-median, $k$-subspace approximation, and $(j,l)$ subspace clustering with $j \cdot l \leq k$. Dimensionality reduction reduces the data storage requirement to $(n+d)k^{3}/\epsilon^6$ from nnz$(A)$. Here nnz$(A)$ could potentially be as large as $nd$. Our algorithm runs in time nnz$(A)/\epsilon^2 + (n+d)$poly$(k/\epsilon)$, up to logarithmic factors. For dense matrices, where nnz$(A) \approx nd$, we give a faster algorithm, that runs in time $nd + (n+d)$poly$(k/\epsilon)$ up to logarithmic factors. Our dimensionality reduction algorithm can also be used to obtain poly$(k/\epsilon)$ size coresets for $k$-median and $(k,1)$-subspace approximation problems in polynomial time.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Input-Sparsity Low Rank Approximation in Schatten Norm

Yi Li, David Woodruff

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

15:36

18/07/2021

Near-Optimal Algorithms for Explainable k-Medians and k-Means

Kostya Makarychev, Liren Shan

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

0

5:12

06/12/2021

A Faster Maximum Cardinality Matching Algorithm with Applications in Machine Learning

Nathaniel Lahn, Sharath Raghvendra, Jiacheng Ye

Keywords Paper

optimization, machine learning, graph learning

0

0

0

0

14:49

09/07/2020

How to trap a gradient flow

Dan Mikulincer, Sebastien Bubeck

Keywords Paper

Non-convex optimization,

0

0

0

0

15:01

18/07/2021

In-Database Regression in Input Sparsity Time

Rajesh Jayaram, Alireza Samadian, David Woodruff, Peng Ye

Keywords Paper

Algorithms, Dimensionality Reduction

0

0

0

0

5:08

03/05/2021

Deep Learning meets Projective Clustering

Alaa Maalouf, Harry Lang, Daniela Rus, Dan Feldman

Keywords Paper

NLP, Compressing Deep Networks, Matrix Factorization, SVD

0

0

0

0

5:26

08/07/2020

Approximate Nearest Neighbor for Curves --- Simple, Efficient, and Deterministic

Arnold Filtser, Omrit Filtser, Matthew Katz

Keywords Paper

polygonal curves, Fréchet distance, dynamic time warping, approximation algorithms, (asymmetric) approximate nearest neighbor, range counting

0

0

0

0

19:55

06/12/2020

Dynamic Submodular Maximization

Technische Monemizadeh

Keywords Paper

0

0

0

0

3:08

06/12/2021

An Online Riemannian PCA for Stochastic Canonical Correlation Analysis

Zihang Meng, Rudrasis Chakraborty, Vikas Singh

Keywords Paper

optimization, fairness

0

0

0

0

14:14

09/07/2020

An O(m/eps^3.5)-Cost Algorithm for Semidefinite Programs with Diagonal Constraints

Swati Padmanabhan, Yin Tat Lee

Keywords Paper

Convex optimization, Approximation algorithms, Combinatorial optimization

0

0

0

0

12:34

08/07/2020

Linearly Representable Submodular Functions: An Algebraic Algorithm for Minimization

Rohit Gurjar and Rajat Rathi

Keywords Paper

Submodular Minimization, Parallel Algorithms, Derandomization, Algebraic Algorithms

0

0

0

0

24:54

06/12/2021

Coresets for Decision Trees of Signals

Ibrahim Jubran, Ernesto Evgeniy Sanches Shayda, Ilan I Newman, Dan Feldman

Keywords Paper

machine learning

0

0

0

0

14:50

12/07/2020

Lower Complexity Bounds for Finite-Sum Convex-Concave Minimax Optimization Problems

Guangzeng Xie, Luo Luo, yijiang lian, Zhihua Zhang

Keywords Paper

Optimization - Convex

0

0

0

0

12:09

04/08/2021

The Bethe and Sinkhorn Permanents of Low Rank Matrices and Implications for Profile Maximum Likelihood

Nima Anari, Moses Charikar, Kirankumar Shiragur, Aaron Sidford

Keywords Paper

0

0

0

0

18:20

06/12/2021

Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces

Vincent Cohen-Addad, David Saulpic, Chris Schwiegelshohn

Keywords Paper

clustering

0

0

0

0

16:06

26/08/2020

Sketching Transformed Matrices with Applications to Natural Language Processing

Yingyu Liang, Zhao Song, Mengdi Wang and
Lin Yang, Xin Yang

Keywords Paper

0

0

0

0

11:17

18/07/2021

Randomized Dimensionality Reduction for Facility Location and Single-Linkage Clustering

Shyam Narayanan, Sandeep Silwal, Piotr Indyk, Or Zamir

Keywords Paper

Algorithms, Dimensionality Reduction

0

0

0

0

5:00

02/02/2021

Approximate Multiplication of Sparse Matrices with Limited Space

Yuanyu Wan, Lijun Zhang

Keywords Paper

0

0

0

0

19:26

18/07/2021

Streaming and Distributed Algorithms for Robust Column Subset Selection

Shuli Jiang, Dongyu Li, Irene Mengze Li and
Arvind Mahankali, David Woodruff

Keywords Paper

Algorithms, Deep Learning, Generative Models, Deep Learning, Predictive Models; Deep Learning, Recurrent Networks

0

0

0

0

7:26

22/06/2020

An improved cutting plane method for convex optimization, convex-concave games, and its applications

Haotian Jiang, Yin Tat Lee, Zhao Song, Sam Chiu-wai Wong

Keywords Paper

convex-concave games, cutting plane method, market equilibrium, fast rectangular matrix multiplication, convex optimization

0

0

0

0

23:51

18/07/2021

Two-way kernel matrix puncturing: towards resource-efficient PCA and spectral clustering

Romain COUILLET, Florent Chatelain, Nicolas Le Bihan

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:19

22/06/2020

Top-𝑘-convolution and the quest for near-linear output-sensitive subset sum

Karl Bringmann, Vasileios Nakos

Keywords Paper

Subset Sum, pseudopolynomial, output-sensitive, convolution, restricted sumset

0

0

0

0

25:48

06/12/2021

The Complexity of Sparse Tensor PCA

Davin Choo, Tommaso d'Orsi

Keywords Paper

0

0

0

0

15:10

06/12/2021

Escape saddle points by a simple gradient-descent based algorithm

Chenyi Zhang, Tongyang Li

Keywords Paper

optimization

0

0

0

0

14:49

06/12/2021

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization

Tian Ye, Simon Du

Keywords Paper

theory, optimization

0

0

0

0

14:51

03/08/2020

Exponentially faster shortest paths in the congested clique

Michal Dory, Merav Parter

Keywords Paper

congested clique, shortest paths, near-additive emulator

0

0

0

0

23:50

06/12/2021

Best of Both Worlds: Practical and Theoretically Optimal Submodular Maximization in Parallel

Yixin Chen, Tonmoy Dey, Alan Kuhnle

Keywords Paper

0

0

0

0

15:04

18/07/2021

Finding k in Latent $k-$ polytope

Chiru Bhattacharyya, Ravindran Kannan, Amit Kumar

Keywords Paper

Algorithms, Components Analysis (e.g., CCA, ICA, LDA, PCA)

0

0

0

0

5:08

06/12/2021

Dimensionality Reduction for Wasserstein Barycenter

Zachary Izzo, Sandeep Silwal, Samson Zhou

Keywords Paper

machine learning

0

0

0

0

11:10

22/06/2020

Coresets for clustering in euclidean spaces: Importance sampling is nearly optimal

Lingxiao Huang, Nisheeth K. Vishnoi

Keywords Paper

Coresets, k-means, Importance sampling, Dimension reduction, Clustering, k-median

0

0

0

0

19:23

13/04/2021

vqSGD: Vector quantized stochastic gradient descent

Venkata Gandikota, Daniel Kane, Raj Kumar Maity, Arya Mazumdar

Keywords Paper

0

0

0

0

3:11

04/08/2021

Approximation Algorithms for Socially Fair Clustering

Yury Makarychev, Ali Vakilian

Keywords Paper

0

0

0

0

16:31

08/07/2020

The Online Min-Sum Set Cover Problem

Dimitris Fotakis, Loukas Kavouras, Grigorios Koumoutsos and
Stratis Skoulakis, Manolis Vardas

Keywords Paper

Online Algorithms, Competitive Analysis, Min-Sum Set Cover

0

0

0

0

25:10

06/12/2020

Fast Convergence of Langevin Dynamics on Manifold: Geodesics meet Log-Sobolev

Xiao Wang, Qi Lei, Ioannis Panageas

Keywords Paper

0

0

0

0

3:16

18/07/2021

One-sided Frank-Wolfe algorithms for saddle problems

Vladimir Kolmogorov, Thomas Pock

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:07

04/08/2021

Reduced-Rank Regression with Operator Norm Error

Praneeth Kacham, David Woodruff

Keywords Paper

0

0

0

0

18:07

18/07/2021

Accelerated Algorithms for Smooth Convex-Concave Minimax Problems with O(1/k^2) Rate on Squared Gradient Norm

TaeHo Yoon, Ernest Ryu

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

17:03

12/07/2020

Improving the Sample and Communication Complexity for Decentralized Non-Convex Optimization: Joint Gradient Estimation and Tracking

Haoran Sun, Songtao Lu, Mingyi Hong

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:56

09/07/2020

Private Mean Estimation of Heavy-Tailed Distributions

Gautam Kamath, Vikrant Singhal, Jonathan Ullman

Keywords Paper

Privacy, fairness, Distribution learning/testing

0

0

0

0

13:24

06/12/2021

Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize

Alain Durmus, Eric Moulines, Alexey Naumov and
Sergey Samsonov, Kevin Scaman, Hoi-To Wai

Keywords Paper

machine learning

0

0

0

0

12:53