Universal Approximation with Deep Narrow Networks

09/07/2020

Universal Approximation with Deep Narrow Networks

Patrick Kidger, Terry J Lyons

Keywords: Neural networks/deep learning, Regression

Abstract Paper Similar Papers

Abstract: The classical Universal Approximation Theorem holds for neural networks of arbitrary width and bounded depth. Here we consider the natural `dual' scenario for networks of bounded width and arbitrary depth. Precisely, let $n$ be the number of inputs neurons, $m$ be the number of output neurons, and let $\rho$ be any nonaffine continuous function, with a continuous nonzero derivative at some point. Then we show that the class of neural networks of arbitrary depth, width $n + m + 2$, and activation function $\rho$, is dense in $C(K; \mathbb{R}^m)$ for $K \subseteq \mathbb{R}^n$ with $K$ compact. This covers every activation function possible to use in practice, and also includes polynomial activation functions, which is unlike the classical version of the theorem, and provides a qualitative difference between deep narrow networks and shallow wide networks. We then consider several extensions of this result. In particular we consider nowhere differentiable activation functions, density in noncompact domains with respect to the $L^p$-norm, and how the width may be reduced to just $n + m + 1$ for `most' activation functions.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLT 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Geometry of the Loss Landscape in Overparameterized Neural Networks: Symmetries and Invariances

Berfin Simsek, François Ged, Arthur Jacot and
Francesco Spadaro, Clement Hongler, Wulfram Gerstner, Johanni Brea

Keywords Paper

Theory, Algorithms, Representation Learning, Algorithms, Large Scale Learning; Applications, Natural Language Processing; Deep Learning, Efficient Inference Methods;

0

0

0

0

5:05

03/05/2021

Large-width functional asymptotics for deep Gaussian neural networks

Daniele Bracale, Stefano Favaro, Sandra Fortini, Stefano Peluchetti

Keywords Paper

deep learning theory, stochastic process, Gaussian process, infinitely wide neural network

0

0

0

0

4:48

06/12/2020

A Universal Approximation Theorem of Deep Neural Networks for Expressing Probability Distributions

Yulong Lu, Jianfeng Lu

Keywords Paper

0

0

0

0

2:55

03/05/2021

Deep Neural Tangent Kernel and Laplace Kernel Have the Same RKHS

Lin Chen, Sheng Xu

Keywords Paper

Laplace kernel, Reproducing kernel Hilbert space, Neural tangent kernel, Singularity analysis

0

0

0

0

5:04

12/07/2020

Better depth-width trade-offs for neural networks through the lens of dynamical systems

Evangelos Chatziafratis, Ioannis Panageas, Sai Ganesh Nagarajan

Keywords Paper

Deep Learning - Theory

0

0

0

0

16:21

26/04/2020

A closer look at the approximation capabilities of neural networks

Kai Fong Ernest Chong

Keywords Paper

deep learning, approximation, universal approximation theorem

0

0

0

0

5:06

26/04/2020

Span Recovery for Deep Neural Networks with Applications to Input Obfuscation

Rajesh Jayaram, David P. Woodruff, Qiuyi Zhang

Keywords Paper

Span recovery, low rank neural networks, adversarial attack

0

0

0

0

5:19

06/12/2021

The Complexity of Sparse Tensor PCA

Davin Choo, Tommaso d'Orsi

Keywords Paper

0

0

0

0

15:10

09/07/2020

A Corrective View of Neural Networks: Representation, Memorization and Learning

Dheeraj M Nagaraj, Guy Bresler

Keywords Paper

Neural networks/deep learning, Learning with algebraic or combinatorial structure, Supervised learning

0

0

0

0

13:38

08/07/2020

On the Degree of Boolean Functions as Polynomials over ℤ_m

Xiaoming Sun, Yuan Sun, Jiaheng Wang and
Kewen Wu, Zhiyu Xia, Yufan Zheng

Keywords Paper

Boolean function, polynomial, modular degree, Ramsey theory

0

0

0

0

17:41

03/05/2021

Deep Networks and the Multiple Manifold Problem

Sam Buchanan, Dar Gilboa, John Wright

Keywords Paper

low-dimensional structure, overparameterized neural networks, deep learning

0

0

0

0

5:14

09/07/2020

How to trap a gradient flow

Dan Mikulincer, Sebastien Bubeck

Keywords Paper

Non-convex optimization,

0

0

0

0

15:01

06/12/2020

A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems

Jiawei Zhang, Peijun Xiao, Ruoyu Sun, Zhiquan Luo

Keywords Paper

0

0

0

0

3:12

03/05/2021

Universal approximation power of deep residual neural networks via nonlinear control theory

Paulo Tabuada, Bahman Gharesifard

Keywords Paper

nonlinear control theory, Deep residual neural networks, universal approximation

0

0

0

0

4:48

06/12/2021

An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks

Shashank Rajput, Kartik Sreenivasan, Dimitris Papailiopoulos, Amin Karbasi

Keywords Paper

deep learning

0

0

0

0

9:58

06/12/2020

Dynamic Submodular Maximization

Technische Monemizadeh

Keywords Paper

0

0

0

0

3:08

06/12/2021

Analytic Study of Families of Spurious Minima in Two-Layer ReLU Neural Networks: A Tale of Symmetry II

Yossi Arjevani, Michael Field

Keywords Paper

theory, deep learning, optimization

0

0

0

0

8:40

06/12/2020

Neural Networks with Small Weights and Depth-Separation Barriers

Gal Vardi, Ohad Shamir

Keywords Paper

0

0

0

0

3:08

03/05/2021

Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets?

Zhiyuan Li, Yi Zhang, Sanjeev Arora

Keywords Paper

equivariance, fully-connected, sample complexity separation, convolutional neural networks

0

0

0

0

15:18

13/04/2021

On the number of linear functions composing deep neural network: Towards a refined definition of neural networks complexity

Yuuki Takai, Akiyoshi Sannai, Matthieu Cordonnier

Keywords Paper

0

0

0

0

3:09

18/07/2021

Two-way kernel matrix puncturing: towards resource-efficient PCA and spectral clustering

Romain COUILLET, Florent Chatelain, Nicolas Le Bihan

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:19

06/12/2021

On the Equivalence between Neural Network and Support Vector Machine

Yilan Chen, Wei Huang, Lam Nguyen, Tsui-Wei Weng

Keywords Paper

theory, deep learning, optimization, robustness

0

0

0

0

15:12

09/07/2020

Private Mean Estimation of Heavy-Tailed Distributions

Gautam Kamath, Vikrant Singhal, Jonathan Ullman

Keywords Paper

Privacy, fairness, Distribution learning/testing

0

0

0

0

13:24

06/12/2020

Sparse and Continuous Attention Mechanisms

André Martins, António Farinhas, Marcos Treviso and
Vlad Niculae, Pedro Aguiar, Mario Figueiredo

Keywords Paper

0

0

0

0

3:17

06/12/2021

RED : Looking for Redundancies for Data-FreeStructured Compression of Deep Neural Networks

Edouard YVINEC, Arnaud Dapogny, Matthieu Cord, Kevin Bailly

Keywords Paper

deep learning, vision

0

0

0

0

14:59

26/04/2020

On Universal Equivariant Set Networks

Nimrod Segol, Yaron Lipman

Keywords Paper

deep learning, universality, set functions, equivariance

0

0

0

0

5:02

18/07/2021

Tight Bounds on the Smallest Eigenvalue of the Neural Tangent Kernel for Deep ReLU Networks

Quynh Nguyen, Marco Mondelli, Guido Montufar

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:04

26/04/2020

Functional vs. parametric equivalence of ReLU networks

Mary Phuong, Christoph H. Lampert

Keywords Paper

ReLU networks, symmetry, functional equivalence, over-parameterization

0

0

0

0

5:15

18/07/2021

Dimensionality Reduction for the Sum-of-Distances Metric

Zhili Feng, Praneeth Kacham, David Woodruff

Keywords Paper

Neuroscience and Cognitive Science, Deep Learning, Biologically Plausible Deep Networks; Neuroscience and Cognitive Science, Connectomics; Neuroscience and Cog, Algorithms, Dimensionality Reduction

0

0

0

0

17:12

12/07/2020

Frequency Bias in Neural Networks for Input of Non-Uniform Density

Ronen Basri, Meirav Galun, Amnon Geifman and
David Jacobs, Yoni Kasten, Shira Kritchman

Keywords Paper

Deep Learning - Theory

0

0

0

0

11:18

20/07/2020

Neural network integral representations with the ReLU activation function

Armenak Petrosyan, Anton Dereventsov, Clayton G. Webster

Keywords Paper

0

0

0

0

12:37

06/12/2020

Deterministic Approximation for Submodular Maximization over a Matroid in Nearly Linear Time

Kai Han, zongmai Cao, Shuang Cui, Benwei Wu

Keywords Paper

0

0

0

0

3:16

06/12/2021

An Online Riemannian PCA for Stochastic Canonical Correlation Analysis

Zihang Meng, Rudrasis Chakraborty, Vikas Singh

Keywords Paper

optimization, fairness

0

0

0

0

14:14

06/12/2021

A Non-commutative Extension of Lee-Seung's Algorithm for Positive Semidefinite Factorizations

Yong Sheng Soh, Antonios Varvitsiotis

Keywords Paper

theory, optimization

0

0

0

0

13:34

06/12/2021

Efficient Algorithms for Learning Depth-2 Neural Networks with General ReLU Activations

Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

Keywords Paper

theory, deep learning

0

0

0

0

14:31

18/07/2021

Besov Function Approximation and Binary Classification on Low-Dimensional Manifolds Using Convolutional Residual Networks

Hao Liu, Minshuo Chen, Tuo Zhao, Wenjing Liao

Keywords Paper

Applications, Computer Vision, , Theory, Deep learning Theory

0

0

0

0

5:14

12/07/2020

Second-Order Provable Defenses against Adversarial Attacks

Sahil Singla, Soheil Feizi

Keywords Paper

Adversarial Examples

0

0

0

0

12:45

06/12/2020

Analytic Characterization of the Hessian in Shallow ReLU Models: A Tale of Symmetry

Yossi Arjevani, Michael Field

Keywords Paper

0

0

0

0

3:13

12/07/2020

A Mean Field Analysis Of Deep ResNet And Beyond: Towards Provably Optimization Via Overparameterization From Depth

Yiping Lu, Chao Ma, Yulong Lu and
Jianfeng Lu, Lexing Ying

Keywords Paper

Deep Learning - Theory

0

0

0

0

4:37

18/07/2021

Regularized Submodular Maximization at Scale

Ehsan Kazemi, shervin minaee, Moran Feldman, Amin Karbasi

Keywords Paper

Optimization, Combinatorial Optimization

0

0

0

0

5:17