Choosing the Sample with Lowest Loss makes SGD Robust

26/08/2020

Choosing the Sample with Lowest Loss makes SGD Robust

Vatsal Shah, Xiaoxia Wu, Sujay Sanghavi

Keywords:

Abstract Paper Similar Papers

Abstract: The presence of outliers can potentially significantly skew the parameters of machine learning models trained via stochastic gradient descent (SGD). In this paper we propose a simple variant of the simple SGD method: in each step, first choose a set of k samples, then from these choose the one with the smallest current loss, and do an SGD-like update with this chosen sample. Vanilla SGD corresponds to k=1, i.e. no choice; k>=2 represents a new algorithm that is however effectively minimizing a non-convex surrogate loss. Our main contribution is a theoretical analysis of the robustness properties of this idea for ML problems which are sums of convex losses; these are backed up with synthetic and small-scale neural network experiments.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

On Learning Ising Models under Huber's Contamination Model

Adarsh Prasad, Vishwak Srinivasan, Sivaraman Balakrishnan, Pradeep Ravikumar

Keywords Paper

0

0

0

0

3:16

04/08/2021

Outlier-Robust Learning of Ising Models Under Dobrushin's Condition

Ilias Diakonikolas, Daniel M. Kane, Alistair Stewart, Yuxin Sun

Keywords Paper

0

0

0

0

16:22

06/12/2021

On the Role of Optimization in Double Descent: A Least Squares Study

Ilja Kuzborskij, Csaba Szepesvari, Omar Rivasplata and
Amal Rannen-Triki, Razvan Pascanu

Keywords Paper

theory, deep learning, optimization

0

0

0

0

13:48

06/12/2020

A Novel Approach for Constrained Optimization in Graphical Models

Sara Rouhani, Tahrima Rahman, Vibhav Gogate

Keywords Paper

0

0

0

0

3:21

06/12/2020

A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions

Wei Deng, Guang Lin, Faming Liang

Keywords Paper

0

0

0

0

3:26

06/12/2020

Coresets for Near-Convex Functions

Murad Tukan, Alaa Maalouf, Dan Feldman

Keywords Paper

0

0

0

0

3:22

06/12/2020

A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems

Jiawei Zhang, Peijun Xiao, Ruoyu Sun, Zhiquan Luo

Keywords Paper

0

0

0

0

3:12

18/07/2021

Robust Unsupervised Learning via L-statistic Minimization

Andreas Maurer, Daniela Angela Parletta, Andrea Paudice, Massimiliano Pontil

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:03

06/12/2020

Statistical-Query Lower Bounds via Functional Gradients

Surbhi Goel, Aravind Gollakota, Adam Klivans

Keywords Paper

0

0

0

0

3:24

06/12/2020

Towards Scalable Bayesian Learning of Causal DAGs

Jussi Viinikka, Antti Hyttinen, Johan Pensar, Mikko Koivisto

Keywords Paper

Theory -> Learning Theory, Theory -> Frequentist Statistics

0

0

0

0

3:25

06/12/2020

A Statistical Mechanics Framework for Task-Agnostic Sample Design in Machine Learning

Bhavya Kailkhura, Jayaraman Thiagarajan, Qunwei Li and
Jize Zhang, Yi Zhou, Timo Bremer

Keywords Paper

0

0

0

0

3:21

18/07/2021

Dissecting Supervised Constrastive Learning

Florian Graf, Christoph Hofer, Marc Niethammer, Roland Kwitt

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

17:13

06/12/2020

Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation

Devavrat Shah, Dogyoon Song, Zhi Xu, Yuzhe Yang

Keywords Paper

0

0

0

0

3:22

06/12/2020

DAGs with No Fears: A Closer Look at Continuous Optimization for Learning Bayesian Networks

Dennis Wei, Tian Gao, Yue Yu

Keywords Paper

0

0

0

0

3:23

13/04/2021

Regularized ERM on random subspaces

Andrea Della Vecchia, Jaouad Mourtada, Ernesto De Vito, Lorenzo Rosasco

Keywords Paper

0

0

0

0

2:57

09/07/2020

How Good is SGD with Random Shuffling?

Itay M Safran, Ohad Shamir

Keywords Paper

Convex optimization,

0

0

0

0

11:50

26/08/2020

Structured Conditional Continuous Normalizing Flows for Efficient Amortized Inference in Graphical Models

Christian Weilbach, Boyan Beronov, Frank Wood, William Harvey

Keywords Paper

0

0

0

0

14:27

13/04/2021

Gradient descent in RKHS with importance labeling

Tomoya Murata, Taiji Suzuki

Keywords Paper

0

0

0

0

3:04

03/05/2021

Few-Shot Learning via Learning the Representation, Provably

Simon Du, Wei Hu, Sham M Kakade and
Jason Lee, Qi Lei

Keywords Paper

statistical learning theory, representation learning

0

0

0

0

6:29

12/07/2020

Streaming k-Submodular Maximization under Noise subject to Size Constraint

Lan N. Nguyen, My T. Thai

Keywords Paper

Optimization - General

0

0

1

1

14:52

14/06/2020

Select to Better Learn: Fast and Accurate Deep Learning Using Data Selection From Nonlinear Manifolds

Mohsen Joneidi, Saeed Vahidian, Ashkan Esmaeili and
Weijia Wang, Nazanin Rahnavard, Bill Lin, Mubarak Shah

Keywords Paper

data sebset selection, spectrum pursuit, open-set identification, few shot classification, generative adversarial networks and deep learning

0

0

0

0

1:00

06/12/2021

On the Power of Differentiable Learning versus PAC and SQ Learning

Emmanuel Abbe, Pritish Kamath, Eran Malach and
Colin Sandon, Nathan Srebro

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:57

22/06/2020

Efficiently learning structured distributions from untrusted batches

Sitan Chen, Jerry Li, Ankur Moitra

Keywords Paper

sum-of-squares, federated learning, VC complexity, Robust statistics

0

0

0

0

24:38

06/12/2020

On the Tightness of Semidefinite Relaxations for Certifying Robustness to Adversarial Examples

Richard Zhang

Keywords Paper

0

0

0

0

3:13

06/12/2021

Learned Robust PCA: A Scalable Deep Unfolding Approach for High-Dimensional Outlier Detection

HanQin Cai, Jialin Liu, Wotao Yin

Keywords Paper

deep learning, machine learning

0

0

0

0

8:07

12/07/2020

Minimally distorted Adversarial Examples with a Fast Adaptive Boundary Attack

Francesco Croce, Matthias Hein

Keywords Paper

Adversarial Examples

0

0

0

0

15:12

06/12/2021

Determinantal point processes based on orthogonal polynomials for sampling minibatches in SGD

Rémi Bardenet, Subhroshekhar Ghosh, Meixia LIN

Keywords Paper

optimization, machine learning

0

0

0

0

14:51

12/07/2020

Consistent Structured Prediction with Max-Min Margin Markov Networks

Alex Nowak, Francis Bach, Alessandro Rudi

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

13:42

06/12/2020

Adaptive Discretization for Model-Based Reinforcement Learning

Sean Sinclair, Tianyu Wang, Gauri Jain and
Sid Banerjee, Christina Yu

Keywords Paper

0

0

0

0

3:12

18/07/2021

Exact Optimization of Conformal Predictors via Incremental and Decremental Learning

Giovanni Cherubin, Konstantinos Chatzikokolakis, Martin Jaggi

Keywords Paper

Probabilistic Methods

0

0

0

0

5:48

06/12/2020

Improved guarantees and a multiple-descent curve for Column Subset Selection and the Nystrom method

Michal Derezinski, Rajiv Khanna, Michael W Mahoney

Keywords Paper

0

0

0

0

3:30

06/12/2020

AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularity

Silviu-Marian Udrescu, Andrew Tan, Jiahai Feng and
Orisvaldo Neto, Tailin Wu, Max Tegmark

Keywords Paper

0

0

0

0

3:13

14/06/2020

Adaptive Hierarchical Down-Sampling for Point Cloud Classification

Ehsan Nezhadarya, Ehsan Taghavi, Ryan Razani and
Bingbing Liu, Jun Luo

Keywords Paper

critical points layer, pooling layer, graph neural networks, point cloud, down-sampling

0

0

0

0

1:00

12/07/2020

Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning

Qing Li, Siyuan Huang, Yining Hong and
Yixin Chen, Ying Nian Wu, Song-Chun Zhu

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:01

06/12/2021

Robustifying Algorithms of Learning Latent Trees with Vector Variables

Fengzhuo Zhang, Vincent Tan

Keywords Paper

theory, graph learning

0

0

0

0

13:21

06/12/2020

Extrapolation Towards Imaginary 0-Nearest Neighbour and Its Improved Convergence Rate

Akifumi Okuno, Hidetoshi Shimodaira

Keywords Paper

0

0

0

0

3:14

12/07/2020

Semiparametric Nonlinear Bipartite Graph Representation Learning with Provable Guarantees

Sen Na, Yuwei Luo, Zhuoran Yang and
Zhaoran Wang, Mladen Kolar

Keywords Paper

Representation Learning

0

0

0

0

13:33

26/08/2020

Conditional Linear Regression

Diego Calderon, Brendan Juba, Sirui Li and
Zongyi Li, Lisa Ruan

Keywords Paper

0

0

0

0

14:31

14/06/2020

On the Regularization Properties of Structured Dropout

Ambar Pal, Connor Lane, René Vidal, Benjamin D. Haeffele

Keywords Paper

dropout, regularization, dropblock, dropconnect, neural networks, optimization, low rank, nuclear norm, k-support norm

0

0

0

0

1:01

12/07/2020

Double Trouble in Double Descent: Bias and Variance(s) in the Lazy Regime

Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli, Florent Krzakala

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:11