Unconditional Coresets for Regularized Loss Minimization

26/08/2020

Unconditional Coresets for Regularized Loss Minimization

Alireza Samadian, Kirk Pruhs, Benjamin Moseley, Sungjin Im, Ryan Curtin

Keywords:

Abstract Paper Similar Papers

Abstract: We design and mathematically analyze sampling-based algorithms for regularized loss minimization problems that are implementable in popular computational models for large data, in which the access to the data is restricted in some way. Our main result is that if the regularizer's effect does not become negligible as the norm of the hypothesis scales, and as the data scales, then a uniform sample of modest size is with high probability a coreset. In the case that the loss function is either logistic regression or soft-margin support vector machines, and the regularizer is one of the common recommended choices, this result implies that a uniform sample of size $O(d \sqrt{n})$ is with high probability a coreset of $n$ points in $\Re^d$. We contrast this upper bound with two lower bounds. The first lower bound shows that our analysis of uniform sampling is tight; that is, a smaller uniform sample will likely not be a core set. The second lower bound shows that in some sense uniform sampling is close to optimal, as significantly smaller core sets do not generally exist.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Sample Complexity of Robust Linear Classification on Separated Data

Robi Bhattacharjee, Somesh Jha, Kamalika Chaudhuri

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:26

06/12/2020

Bayesian Pseudocoresets

Dionysis Manousakas, Zuheng Xu, Cecilia Mascolo, Trevor Campbell

Keywords Paper

0

0

0

0

3:19

06/12/2021

Dimensionality Reduction for Wasserstein Barycenter

Zachary Izzo, Sandeep Silwal, Samson Zhou

Keywords Paper

machine learning

0

0

0

0

11:10

09/07/2020

How Good is SGD with Random Shuffling?

Itay M Safran, Ohad Shamir

Keywords Paper

Convex optimization,

0

0

0

0

11:50

06/12/2020

A novel variational form of the Schatten-$p$ quasi-norm

Paris Giampouras, Rene Vidal, Athanasios Rontogiannis, Benjamin Haeffele

Keywords Paper

0

0

0

0

3:14

04/08/2021

Non-Euclidean Differentially Private Stochastic Convex Optimization

Raef Bassily, Cristobal Guzman, Anupama Nandi

Keywords Paper

0

0

0

0

16:40

18/07/2021

Meta Learning for Support Recovery in High-dimensional Precision Matrix Estimation

Qian Zhang, Yilin Zheng, Jean Honorio

Keywords Paper

Algorithms, Meta-Learning, Algorithms, Few-Shot Learning; Algorithms, Multitask and Transfer Learning, Theory, Statistical Learning Theory

0

0

0

0

5:03

06/12/2020

Coresets for Near-Convex Functions

Murad Tukan, Alaa Maalouf, Dan Feldman

Keywords Paper

0

0

0

0

3:22

03/05/2021

Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy

Akinori Ebihara, Taiki Miyagawa, Kazuyuki Sakurai, Hitoshi Imaoka

Keywords Paper

Density ratio estimation, Early classification, Sequential probability ratio test

0

0

0

0

9:55

13/04/2021

Exponential convergence rates of classification errors on learning with SGD and random features

Shingo Yashima, Atsushi Nitanda, Taiji Suzuki

Keywords Paper

0

0

0

0

2:58

06/12/2021

Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning

Gen Li, Laixi Shi, Yuxin Chen and
Yuantao Gu, Yuejie Chi

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

15:32

06/12/2021

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

optimization

0

0

0

0

10:44

13/04/2021

Gradient descent in RKHS with importance labeling

Tomoya Murata, Taiji Suzuki

Keywords Paper

0

0

0

0

3:04

18/07/2021

Streaming and Distributed Algorithms for Robust Column Subset Selection

Shuli Jiang, Dongyu Li, Irene Mengze Li and
Arvind Mahankali, David Woodruff

Keywords Paper

Algorithms, Deep Learning, Generative Models, Deep Learning, Predictive Models; Deep Learning, Recurrent Networks

0

0

0

0

7:26

06/12/2020

Stability of Stochastic Gradient Descent on Nonsmooth Convex Losses

Raef Bassily, Vitaly Feldman, Cristóbal Guzmán, Kunal Talwar

Keywords Paper

0

0

0

0

3:11

18/07/2021

Robust Unsupervised Learning via L-statistic Minimization

Andreas Maurer, Daniela Angela Parletta, Andrea Paudice, Massimiliano Pontil

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:03

06/12/2021

List-Decodable Mean Estimation in Nearly-PCA Time

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and
Jerry Li, Kevin Tian

Keywords Paper

theory, clustering

0

0

0

0

14:21

06/12/2021

High-probability Bounds for Non-Convex Stochastic Optimization with Heavy Tails

Ashok Cutkosky, Harsh Mehta

Keywords Paper

deep learning, optimization

0

0

0

0

20:14

09/07/2020

High probability guarantees for stochastic convex optimization

Damek Davis, Dmitriy Drusvyatskiy

Keywords Paper

Stochastic optimization, Computational complexity, Convex optimization, Excess risk bounds and generalization error bounds

0

0

0

0

15:10

18/07/2021

Active Slices for Sliced Stein Discrepancy

Wenbo Gong, Kaibo Zhang, Yingzhen Li, Jose Miguel Hernandez-Lobato

Keywords Paper

, Deep Learning, Efficient Inference Methods, Algorithms, Kernel Methods

0

0

0

0

5:47

13/04/2021

Fundamental limits of ridge-regularized empirical risk minimization in high dimensions

Hossein Taheri, Ramtin Pedarsani, Christos Thrampoulidis

Keywords Paper

0

0

0

0

3:33

09/07/2020

The estimation error of general first order methods

Michael V Celentano, Andrea Montanari, Yuchen Wu

Keywords Paper

High-dimensional statistics, Computational complexity, Matrix/tensor estimation, Regression

0

0

0

0

14:10

06/12/2020

An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits

Julian Katz-Samuels, Lalit Jain, zohar karnin, Kevin Jamieson

Keywords Paper

0

0

0

0

3:20

12/07/2020

Understanding the Impact of Model Incoherence on Convergence of Incremental SGD with Random Reshuffle

Shaocong Ma, Yi Zhou

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:33

13/04/2021

Principal component regression with semirandom observations via matrix completion

Aditya Bhaskara, Aravinda Kanchana Ruwanpathirana, Maheshakya Wijewardena

Keywords Paper

0

0

0

0

2:48

06/12/2020

Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

Benjamin Aubin, Florent Krzakala, Yue Lu, Lenka Zdeborová

Keywords Paper

0

0

0

0

3:08

14/09/2020

NoisyCUR: An algorithm for two-cost budgeted matrix completion

Dong Hu, Alex Gittens, Malik Magdon-Ismail

Keywords Paper

matrix completion, low-rank approximation, nuclear norm minimization

0

0

0

0

14:05

13/04/2021

Learning-to-rank with partitioned preference: Fast estimation for the plackett-luce model

Jiaqi Ma, Xinyang Yi, Weijing Tang and
Zhe Zhao, Lichan Hong, Ed Chi, Qiaozhu Mei

Keywords Paper

0

0

0

0

3:03

06/12/2020

Robust Sub-Gaussian Principal Component Analysis and Width-Independent Schatten Packing

Arun Jambulapati, Jerry Li, Kevin Tian

Keywords Paper

0

0

0

0

3:22

09/07/2020

Data-driven confidence bands for distributed nonparametric regression

Valeriy Avanesov

Keywords Paper

Kernel methods, Excess risk bounds and generalization error bounds, Regression, Sampling algorithms, Supervised learning

0

0

0

0

14:42

09/07/2020

Locally Private Hypothesis Selection

Sivakanth Gopi, Gautam Kamath, Janardhan D Kulkarni and
Aleksandar Nikolov, Steven Wu, Huanyu Zhang

Keywords Paper

Privacy, fairness, Distribution learning/testing

0

0

0

0

14:58

26/08/2020

Integrals over Gaussians under Linear Domain Constraints

Alexandra Gessner, Oindrila Kanjilal, Philipp Hennig

Keywords Paper

0

0

0

0

13:51

06/12/2021

Surrogate Regret Bounds for Polyhedral Losses

Rafael Frongillo, Bo Waggoner

Keywords Paper

machine learning

0

0

0

0

15:05

06/12/2021

Determinantal point processes based on orthogonal polynomials for sampling minibatches in SGD

Rémi Bardenet, Subhroshekhar Ghosh, Meixia LIN

Keywords Paper

optimization, machine learning

0

0

0

0

14:51

06/12/2021

Bayesian decision-making under misspecified priors with applications to meta-learning

Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy and
Daniel Hsu, Thodoris Lykouris, Miro Dudik, Robert E Schapire

Keywords Paper

meta learning, bandits

0

0

0

0

14:58

04/08/2021

Near-Optimal Entrywise Sampling of Numerically Sparse Matrices

Vladimir Braverman, Robert Krauthgamer, Aditya R Krishnan, Shay Sapir

Keywords Paper

0

0

0

0

16:59

06/12/2020

Tight First- and Second-Order Regret Bounds for Adversarial Linear Bandits

Shinji Ito, Shuichi Hirahara, Tasuku Soma, Yuichi Yoshida

Keywords Paper

0

0

0

0

3:24

26/08/2020

On the optimality of kernels for high-dimensional clustering

Leena C Vankadara, Debarghya Ghoshdastidar

Keywords Paper

0

0

0

0

12:25

06/12/2021

A Comprehensively Tight Analysis of Gradient Descent for PCA

Zhiqiang Xu, Ping Li

Keywords Paper

optimization

0

0

0

0

4:37

16/11/2020

Best-First Beam Search

Clara Meister, Ryan Cotterell, Tim Vieira

Keywords Paper

nlp tasks, exact search, decoding, heuristic algorithm

0

0

0

0

12:19