An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks

06/12/2021

An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks

Shashank Rajput, Kartik Sreenivasan, Dimitris Papailiopoulos, Amin Karbasi

Keywords: deep learning

Abstract Paper Similar Papers

Abstract: It is well known that modern deep neural networks are powerful enough to memorize datasets even when the labels have been randomized. Recently, Vershynin(2020) settled a long standing question by Baum(1988), proving that deep threshold networks can memorize $n$ points in $d$ dimensions using $\widetilde{\mathcal{O}}(e^{1/\delta^2}+\sqrt{n})$ neurons and $\widetilde{\mathcal{O}}(e^{1/\delta^2}(d+\sqrt{n})+n)$ weights, where $\delta$ is the minimum distance between the points. In this work, we improve the dependence on $\delta$ from exponential to almost linear, proving that $\widetilde{\mathcal{O}}(\frac{1}{\delta}+\sqrt{n})$ neurons and $\widetilde{\mathcal{O}}(\frac{d}{\delta}+n)$ weights are sufficient. Our construction uses Gaussian random weights only in the first layer, while all the subsequent layers use binary or integer weights. We also prove new lower bounds by connecting memorization in neural networks to the purely geometric problem of separating $n$ points on a sphere using hyperplanes.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Span Recovery for Deep Neural Networks with Applications to Input Obfuscation

Rajesh Jayaram, David P. Woodruff, Qiuyi Zhang

Keywords Paper

Span recovery, low rank neural networks, adversarial attack

0

0

0

0

5:19

03/05/2021

Deep Networks and the Multiple Manifold Problem

Sam Buchanan, Dar Gilboa, John Wright

Keywords Paper

low-dimensional structure, overparameterized neural networks, deep learning

0

0

0

0

5:14

06/12/2021

PLUGIn: A simple algorithm for inverting generative models with recovery guarantees

Babhru Joshi, Xiaowei Li, Yaniv Plan, Ozgur Yilmaz

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

14:58

09/07/2020

A Corrective View of Neural Networks: Representation, Memorization and Learning

Dheeraj M Nagaraj, Guy Bresler

Keywords Paper

Neural networks/deep learning, Learning with algebraic or combinatorial structure, Supervised learning

0

0

0

0

13:38

06/12/2020

Optimal Lottery Tickets via Subset Sum: Logarithmic Over-Parameterization is Sufficient

Ankit Pensia, Shashank Rajput, Alliot Nagle and
Harit Vishwakarma, Dimitrios Papailiopoulos

Keywords Paper

Reinforcement Learning and Planning -> Model-Based RL; Reinforcement Learning and Planning -> Planning; Reinforcement Learning , Reinforcement Learning and Planning

0

0

0

0

3:30

03/05/2021

Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets?

Zhiyuan Li, Yi Zhang, Sanjeev Arora

Keywords Paper

equivariance, fully-connected, sample complexity separation, convolutional neural networks

0

0

0

0

15:18

18/07/2021

Besov Function Approximation and Binary Classification on Low-Dimensional Manifolds Using Convolutional Residual Networks

Hao Liu, Minshuo Chen, Tuo Zhao, Wenjing Liao

Keywords Paper

Applications, Computer Vision, , Theory, Deep learning Theory

0

0

0

0

5:14

18/07/2021

Geometry of the Loss Landscape in Overparameterized Neural Networks: Symmetries and Invariances

Berfin Simsek, François Ged, Arthur Jacot and
Francesco Spadaro, Clement Hongler, Wulfram Gerstner, Johanni Brea

Keywords Paper

Theory, Algorithms, Representation Learning, Algorithms, Large Scale Learning; Applications, Natural Language Processing; Deep Learning, Efficient Inference Methods;

0

0

0

0

5:05

06/12/2021

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

optimization

0

0

0

0

10:44

06/12/2020

Analytic Characterization of the Hessian in Shallow ReLU Models: A Tale of Symmetry

Yossi Arjevani, Michael Field

Keywords Paper

0

0

0

0

3:13

06/12/2020

The Flajolet-Martin Sketch Itself Preserves Differential Privacy: Private Counting with Minimal Space

Adam Smith, Shuang Song, Abhradeep Guha Thakurta

Keywords Paper

0

0

0

0

3:17

09/07/2020

How to trap a gradient flow

Dan Mikulincer, Sebastien Bubeck

Keywords Paper

Non-convex optimization,

0

0

0

0

15:01

06/12/2020

Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication Time

Jerry Li, Guanghao Ye

Keywords Paper

0

0

0

0

3:13

06/12/2020

On Adaptive Distance Estimation

Yeshwanth Cherapanamjeri, Jelani Nelson

Keywords Paper

0

0

0

0

3:16

14/06/2020

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions

Alvin Wan, Xiaoliang Dai, Peizhao Zhang and
Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, Joseph E. Gonzalez

Keywords Paper

nas, dnas, fbnet, state-of-the-art, imagenet, mobilenetv3, efficientnet, classification, neural architecture search, differentiable neural architecture search

0

0

0

0

1:01

06/12/2021

Analysis of one-hidden-layer neural networks via the resolvent method

Vanessa Piccolo, Dominik Schröder

Keywords Paper

theory, deep learning

0

0

0

0

11:28

06/12/2021

DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense neural networks

Wei Sun, Aojun Zhou, Sander Stuijk and
Rob Wijnhoven, Andrew Nelson, hongsheng Li, Henk Corporaal

Keywords Paper

deep learning

0

0

0

0

15:07

12/07/2020

Improving the Sample and Communication Complexity for Decentralized Non-Convex Optimization: Joint Gradient Estimation and Tracking

Haoran Sun, Songtao Lu, Mingyi Hong

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:56

06/12/2021

On the Sample Complexity of Privately Learning Axis-Aligned Rectangles

Menachem Sadigurschi, Uri Stemmer

Keywords Paper

theory, privacy

0

0

0

0

14:00

06/12/2021

Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces

Vincent Cohen-Addad, David Saulpic, Chris Schwiegelshohn

Keywords Paper

clustering

0

0

0

0

16:06

06/12/2021

Efficient Equivariant Network

Lingshen He, Yuxuan Chen, zhengyang shen and
Yiming Dong, Yisen Wang, Zhouchen Lin

Keywords Paper

deep learning, vision

0

0

0

0

8:20

18/07/2021

PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization

Zhize Li, Hongyan Bao, Xiangliang Zhang, Peter Richtarik

Keywords Paper

Optimization

0

0

0

0

11:53

06/12/2021

Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training

Sheng Liu, Xiao Li, Yuexiang Zhai and
Chong You, Zhihui Zhu, Carlos Fernandez-Granda, Qing Qu

Keywords Paper

deep learning, machine learning, robustness, generative model

0

0

0

0

6:45

09/07/2020

Balancing Gaussian vectors in high dimension

Paxton M Turner, Raghu Meka, Philippe Rigollet

Keywords Paper

Combinatorial optimization, Approximation algorithms, Concentration inequalities, High-dimensional statistics, Stochastic optimization

0

0

0

0

13:39

06/12/2021

Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize

Alain Durmus, Eric Moulines, Alexey Naumov and
Sergey Samsonov, Kevin Scaman, Hoi-To Wai

Keywords Paper

machine learning

0

0

0

0

12:53

06/12/2021

List-Decodable Mean Estimation in Nearly-PCA Time

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and
Jerry Li, Kevin Tian

Keywords Paper

theory, clustering

0

0

0

0

14:21

06/12/2020

Fourier Sparse Leverage Scores and Approximate Kernel Learning

Tamas Erdelyi, Cameron Musco, Christopher Musco

Keywords Paper

0

0

0

0

3:25

06/12/2021

Cardinality constrained submodular maximization for random streams

Paul Liu, Aviad Rubinstein, Jan Vondrak, Junyao Zhao

Keywords Paper

optimization

0

0

0

0

14:11

17/08/2020

Human-in-the-loop differential subspace search in high-dimensional latent space

Chia-Hsing Chiu, Yuki Koyama, Yu-Chi Lai and
Takeo Igarashi, Yonghao Yue

Keywords Paper

human-in-the-loop optimization, dimensionality reduction, generative models

0

0

0

0

19:48

06/12/2020

An Optimal Elimination Algorithm for Learning a Best Arm

Avinatan Hassidim, Ron Kupfer, Yaron Singer

Keywords Paper

0

0

0

0

3:23

18/07/2021

Towards Certifying L-infinity Robustness using Neural Networks with L-inf-dist Neurons

Bohang Zhang, Tianle Cai, Zhou Lu and
Di He, Liwei Wang

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:11

03/05/2021

Learning-based Support Estimation in Sublinear Time

talyaa01 Eden, Piotr Indyk, Shyam Narayanan and
Ronitt Rubinfeld, Sandeep Silwal, Tal Wagner

Keywords Paper

chebyshev polynomial, distinct elements, learning-based, sublinear, support estimation

0

0

0

0

9:48

26/04/2020

Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks

Yu Bai, Jason D. Lee

Keywords Paper

Neural Tangent Kernels, over-parametrized neural networks, deep learning theory

0

0

0

0

5:25

09/07/2020

Universal Approximation with Deep Narrow Networks

Patrick Kidger, Terry J Lyons

Keywords Paper

Neural networks/deep learning, Regression

0

0

0

0

13:40

06/12/2020

Linear-Sample Learning of Low-Rank Distributions

Ayush Jain, Alon Orlitsky

Keywords Paper

0

0

0

0

3:22

18/07/2021

Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps

Renyi Chen, Molei Tao

Keywords Paper

Algorithms, Time Series and Sequences

0

0

0

0

5:21

06/12/2020

Asymptotic normality and confidence intervals for derivatives of 2-layers neural network in the random features model

Yiwei Shen, Pierre C Bellec

Keywords Paper

0

0

0

0

3:12

26/04/2020

Stochastic AUC Maximization with Deep Neural Networks

Mingrui Liu, Zhuoning Yuan, Yiming Ying, Tianbao Yang

Keywords Paper

Stochastic AUC Maximization, Deep Neural Networks

0

0

0

0

4:58

09/07/2020

Learning Polynomials in Few Relevant Dimensions

Sitan Chen, Raghu Meka

Keywords Paper

Regression, Convex optimization, High-dimensional statistics, Non-convex optimization

0

0

0

0

15:03

03/08/2020

Exponentially faster shortest paths in the congested clique

Michal Dory, Merav Parter

Keywords Paper

congested clique, shortest paths, near-additive emulator

0

0

0

0

23:50