Representation Costs of Linear Neural Networks: Analysis and Design

06/12/2021

Representation Costs of Linear Neural Networks: Analysis and Design

Zhen Dai, Mina Karzand, Nathan Srebro

Keywords: deep learning

Abstract Paper Similar Papers

Abstract: For different parameterizations (mappings from parameters to predictors), we study the regularization cost in predictor space induced by $l_2$ regularization on the parameters (weights). We focus on linear neural networks as parameterizations of linear predictors. We identify the representation cost of certain sparse linear ConvNets and residual networks. In order to get a better understanding of how the architecture and parameterization affect the representation cost, we also study the reverse problem, identifying which regularizers on linear predictors (e.g., $l_p$ norms, group norms, the $k$-support-norm, elastic net) can be the representation cost induced by simple $l_2$ regularization, and designing the parameterizations that do so.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

On Linear Stability of SGD and Input-Smoothness of Neural Networks

Chao Ma, Lexing Ying

Keywords Paper

deep learning, optimization, robustness, adversarial robustness and security

0

0

0

0

10:15

13/04/2021

Influence decompositions for neural network attribution

Kyle Reing, Greg Ver Steeg, Aram Galstyan

Keywords Paper

0

0

0

0

2:52

12/07/2020

Topologically Densified Distributions

Christoph Hofer, Florian Graf, Marc Niethammer, Roland Kwitt

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:07

13/04/2021

Graphical normalizing flows

Antoine Wehenkel, Gilles Louppe

Keywords Paper

0

0

0

0

3:04

03/05/2021

Wasserstein-2 Generative Networks

Alexander Korotin, Vage Egiazarian, Arip Asadulaev and
Alexander Safin, Evgeny Burnaev

Keywords Paper

input-convex neural networks, cycle-consistency regularization, non-minimax optimization, optimal transport maps, wasserstein-2 distance

0

0

0

1

5:10

06/12/2021

DNN-based Topology Optimisation: Spatial Invariance and Neural Tangent Kernel

Benjamin Dupuis, Arthur Jacot

Keywords Paper

deep learning, optimization

0

0

0

0

9:39

26/04/2020

Gradient $\ell_1$ Regularization for Quantization Robustness

Milad Alizadeh, Arash Behboodi, Mart van Baalen and
Christos Louizos, Tijmen Blankevoort, Max Welling

Keywords Paper

quantization, regularization, robustness, gradient regularization

0

0

0

0

5:01

06/12/2021

Lattice partition recovery with dyadic CART

OSCAR HERNAN MADRID PADILLA, Yi Yu, Alessandro Rinaldo

Keywords Paper

machine learning, graph learning

0

0

0

0

13:36

26/04/2020

Pure and Spurious Critical Points: a Geometric Study of Linear Networks

Matthew Trager, Kathlén Kohn, Joan Bruna

Keywords Paper

Loss landscape, linear networks, algebraic geometry

0

0

0

0

5:22

03/08/2020

Distributed computation and reconfiguration in actively dynamic networks

Othon Michail, George Skretas, Paul G. Spirakis

Keywords Paper

polylogarithmic time, distributed algorithms, edge complexity, transformation, reconfiguration, dynamic networks

0

0

0

0

24:10

06/12/2021

On Path Integration of Grid Cells: Group Representation and Isotropic Scaling

Ruiqi Gao, Jianwen Xie, Xue-Xin Wei and
Song-Chun Zhu, Ying Nian Wu

Keywords Paper

optimization, representation learning

0

0

0

0

16:11

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

03/05/2021

A unifying view on implicit bias in training linear neural networks

Chulhee (Charlie) Yun, Shankar Krishnan, Hossein Mobahi

Keywords Paper

convergence, implicit bias, gradient flow, implicit regularization, gradient descent

0

0

0

0

5:24

14/06/2020

Bayesian Adversarial Human Motion Synthesis

Rui Zhao, Hui Su, Qiang Ji

Keywords Paper

probabilistic graphical model, adversarial learning, bayesian inference, data synthesis, data restoration, motion capture, human dynamics

0

0

0

0

5:00

26/08/2020

Convex Geometry of Two-Layer ReLU Networks: Implicit Autoencoding and Interpretable Models

Tolga Ergen, Mert Pilanci

Keywords Paper

0

0

0

0

14:07

18/07/2021

Regularized Submodular Maximization at Scale

Ehsan Kazemi, shervin minaee, Moran Feldman, Amin Karbasi

Keywords Paper

Optimization, Combinatorial Optimization

0

0

0

0

5:17

06/12/2021

Score-based Generative Neural Networks for Large-Scale Optimal Transport

Grady Daniels, Tyler Maunu, Paul Hand

Keywords Paper

deep learning, optimization, generative model, optimal transport

0

0

0

0

13:48

14/06/2020

There and Back Again: Revisiting Backpropagation Saliency Methods

Sylvestre-Alvise Rebuffi, Ruth Fong, Xu Ji, Andrea Vedaldi

Keywords Paper

saliency, attribution, unifying framework, interpretability, backpropagation methods

0

0

0

0

1:01

06/12/2021

Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation

Kenneth Borup, Lars N Andersen

Keywords Paper

theory, deep learning, optimization

0

0

0

0

6:00

02/02/2021

Multi-Goal Multi-Agent Path Finding via Decoupled and Integrated Goal Vertex Ordering

Pavel Surynek

Keywords Paper

0

0

0

0

17:42

13/04/2021

Implicit regularization via neural feature alignment

Aristide Baratin, Thomas George, César Laurent and
R Devon Hjelm, Guillaume Lajoie, Pascal Vincent, Simon Lacoste-Julien

Keywords Paper

0

0

0

0

3:15

03/05/2021

Group Equivariant Conditional Neural Processes

Makoto Kawano, Wataru Kumagai, Akiyoshi Sannai and
Yusuke Iwasawa, Yutaka Matsuo

Keywords Paper

Regression, Stochastic Processes, Conditional Neural Processes, Neural Processes, Symmetry, Group Equivariance

0

0

0

0

5:12

18/07/2021

A Wasserstein Minimax Framework for Mixed Linear Regression

Theo Diamandis, Yonina Eldar, Alireza Fallah and
Farzan Farnia, Asuman Ozdaglar

Keywords Paper

Algorithms, Multimodal Learning

0

0

0

0

25:41

03/05/2021

Deep Networks and the Multiple Manifold Problem

Sam Buchanan, Dar Gilboa, John Wright

Keywords Paper

low-dimensional structure, overparameterized neural networks, deep learning

0

0

0

0

5:14

12/07/2020

Learning Similarity Metrics for Numerical Simulations

Georg Kohl, Kiwon Um, Nils Thuerey

Keywords Paper

General Machine Learning Techniques

0

0

0

0

15:16

26/04/2020

Gradient Descent Maximizes the Margin of Homogeneous Neural Networks

Kaifeng Lyu, Jian Li

Keywords Paper

margin, homogeneous, gradient descent

0

0

0

0

15:02

06/12/2020

On the training dynamics of deep networks with $L_2$ regularization

Aitor Lewkowycz, Guy Gur-Ari

Keywords Paper

0

0

0

0

3:24

02/02/2021

Implicit Kernel Attention

Kyungwoo Song, Yohan Jung, Dongjun Kim, Il-Chul Moon

Keywords Paper

0

0

0

0

17:22

18/07/2021

Low-Rank Sinkhorn Factorization

Meyer Scetbon, Marco Cuturi, Gabriel Peyré

Keywords Paper

Algorithms, Optimal Transport

0

1

1

1

5:22

06/12/2020

On the Modularity of Hypernetworks

Tomer Galanti, Lior Wolf

Keywords Paper

0

0

0

0

3:12

26/08/2020

Multi-level Gaussian Graphical Models Conditional on Covariates

Gi Bum Kim, Seyoung Kim

Keywords Paper

0

0

0

0

12:45

18/07/2021

Principled Simplicial Neural Networks for Trajectory Prediction

Mitch Roddenberry, Nicholas Glaze, Santiago Segarra

Keywords Paper

Deep Learning, Others

0

0

0

0

16:54

14/09/2020

Flexible Recurrent Neural Networks

Anne Lambert, Francoise Le Bolzer, Francois Schnitzler

Keywords Paper

recurrent neural networks, flexibility, edge computing

0

0

0

0

13:29

18/07/2021

On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

Shahar Azulay, Edward Moroshko, Mor Shpigel Nacson and
Blake Woodworth, Nati Srebro, Amir Globerson, Daniel Soudry

Keywords Paper

, Probabilistic Methods, MCMC, Theory, Deep learning Theory

0

0

0

0

15:38

26/08/2020

Private Protocols for U-Statistics in the Local Model and Beyond

James Bell, Aurélien Bellet, Adria Gascon, Tejas Kulkarni

Keywords Paper

0

0

0

0

13:20

12/07/2020

Fiedler Regularization: Learning Neural Networks with Graph Sparsity

Edric Tam, David Dunson

Keywords Paper

Supervised Learning

0

0

0

0

15:31

02/02/2021

Partial-Label and Structure-constrained Deep Coupled Factorization Network

Yan Zhang, Zhao Zhang, Yang Wang and
Zheng Zhang, Li Zhang, Shuicheng Yan, Meng Wang

Keywords Paper

0

0

0

0

13:39

03/05/2021

Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability

Suraj Srinivas, François Fleuret

Keywords Paper

Interpretability, saliency maps, score-matching

0

0

0

0

15:08

03/05/2021

Universal approximation power of deep residual neural networks via nonlinear control theory

Paulo Tabuada, Bahman Gharesifard

Keywords Paper

nonlinear control theory, Deep residual neural networks, universal approximation

0

0

0

0

4:48

06/12/2020

Multi-task Causal Learning with Gaussian Processes

Virginia Aglietti, Theo Damoulas, Mauricio Álvarez, Javier González

Keywords Paper

0

0

0

0

3:14