Separation and Concentration in Deep Networks

03/05/2021

Separation and Concentration in Deep Networks

John Zarka, Florentin Guth, Stéphane Mallat

Keywords: concentration, mean separation, neural collapse, fisher ratio, image classification, variance reduction, deep learning

Abstract Paper Similar Papers

Abstract: Numerical experiments demonstrate that deep neural network classifiers progressively separate class distributions around their mean, achieving linear separability on the training set, and increasing the Fisher discriminant ratio. We explain this mechanism with two types of operators. We prove that a rectifier without biases applied to sign-invariant tight frames can separate class means and increase Fisher ratios. On the opposite, a soft-thresholding on tight frames can reduce within-class variabilities while preserving class means. Variance reduction bounds are proved for Gaussian mixture models. For image classification, we show that separation of class means can be achieved with rectified wavelet tight frames that are not learned. It defines a scattering transform. Learning $1 \times 1$ convolutional tight frames along scattering channels and applying a soft-thresholding reduces within-class variabilities. The resulting scattering network reaches the classification accuracy of ResNet-18 on CIFAR-10 and ImageNet, with fewer layers and no learned biases.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Don't Waste Your Bits! Squeeze Activations and Gradients for Deep Neural Networks via TinyScript

Fangcheng Fu, Yuzheng Hu, Yihan He and
Jiawei Jiang, Yingxia Shao, Ce Zhang, Bin Cui

Keywords Paper

Optimization - Large Scale, Parallel and Distributed

0

0

0

0

9:59

18/07/2021

Understanding self-supervised learning dynamics without contrastive pairs

Yuandong Tian, Xinlei Chen, Surya Ganguli

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

18:16

12/07/2020

Double Trouble in Double Descent: Bias and Variance(s) in the Lazy Regime

Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli, Florent Krzakala

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:11

26/08/2020

Structured Conditional Continuous Normalizing Flows for Efficient Amortized Inference in Graphical Models

Christian Weilbach, Boyan Beronov, Frank Wood, William Harvey

Keywords Paper

0

0

0

0

14:27

06/12/2020

Robust Federated Learning: The Case of Affine Distribution Shifts

Amirhossein Reisizadeh, Farzan Farnia, Ramtin Pedarsani, Ali Jadbabaie

Keywords Paper

0

0

0

0

3:16

18/07/2021

A Wasserstein Minimax Framework for Mixed Linear Regression

Theo Diamandis, Yonina Eldar, Alireza Fallah and
Farzan Farnia, Asuman Ozdaglar

Keywords Paper

Algorithms, Multimodal Learning

0

0

0

0

25:41

12/07/2020

Scalable Differential Privacy with Certified Robustness in Adversarial Learning

Hai Phan, My T. Thai, Han Hu and
Ruoming Jin, Tong Sun, Dejing Dou

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

14:47

12/07/2020

Training Neural Networks for and by Interpolation

Leonard Berrada, M. Pawan Kumar, Andrew Zisserman

Keywords Paper

Deep Learning - General

0

0

0

0

16:12

06/12/2021

Diffusion Normalizing Flow

Qinsheng Zhang, Yongxin Chen

Keywords Paper

generative model

0

0

0

0

9:09

06/12/2020

Tight Nonparametric Convergence Rates for Stochastic Gradient Descent under the Noiseless Linear Model

Raphaël Berthier, Francis Bach, Pierre Gaillard

Keywords Paper

Optimization -> Non-Convex Optimization, Deep Learning -> Optimization for Deep Networks

0

0

0

0

3:05

06/12/2021

The Implicit Bias of Minima Stability: A View from Function Space

Rotem Mulayoff, Tomer Michaeli, Daniel Soudry

Keywords Paper

deep learning, optimization

0

0

0

0

13:51

06/12/2020

Understanding Double Descent Requires A Fine-Grained Bias-Variance Decomposition

Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:30

12/07/2020

Distance Metric Learning with Joint Representation Diversification

Xu Chu, Yang Lin, Xiting Wang and
Xin Gao, Qi Tong, Hailong Yu, Yasha Wang

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:32

13/04/2021

Fractional moment-preserving initialization schemes for training deep neural networks

Mert Gurbuzbalaban, Yuanhan Hu

Keywords Paper

0

0

0

0

3:05

14/06/2020

Controllable Orthogonalization in Training DNNs

Lei Huang, Li Liu, Fan Zhu and
Diwen Wan, Zehuan Yuan, Bo Li, Ling Shao

Keywords Paper

orthogonalization, weight normalization, newtons iteration, dynamic isometry, lipschitz continuity, regularization, orthogonality, deep learning, gans, small batch size

0

0

0

0

5:00

06/12/2021

Stochastic Solutions for Linear Inverse Problems using the Prior Implicit in a Denoiser

Zahra Kadkhodaie, Eero P Simoncelli

Keywords Paper

deep learning, self-supervised learning

0

0

0

0

14:45

06/12/2020

A Statistical Mechanics Framework for Task-Agnostic Sample Design in Machine Learning

Bhavya Kailkhura, Jayaraman Thiagarajan, Qunwei Li and
Jize Zhang, Yi Zhou, Timo Bremer

Keywords Paper

0

0

0

0

3:21

04/08/2021

Nonparametric Regression with Shallow Overparametrized Neural Networks Trained by GD with Early Stopping

Ilja Kuzborskij , Csaba Szepesvari

Keywords Paper

0

0

0

0

15:14

06/12/2021

Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Zifeng Wang, Tong Jian, Aria Masoomi and
Stratis Ioannidis, Jennifer Dy

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

13:49

26/08/2020

Deterministic Decoding for Discrete Data in Variational Autoencoders

Daniil Polykovskiy, Dmitry Vetrov

Keywords Paper

0

0

0

0

9:00

06/12/2021

Self-Supervised Learning with Kernel Dependence Maximization

Yazhe Li, Roman Pogodin, [deadname] J Sutherland, Arthur Gretton

Keywords Paper

machine learning, self-supervised learning, vision, representation learning, kernel methods, semi-supervised learning

0

0

0

0

11:48

18/07/2021

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization

Zeke Xie, Li Yuan, Zhanxing Zhu, Masashi Sugiyama

Keywords Paper

Optimization, Stochastic Optimization

0

0

0

0

5:17

18/07/2021

Optimal Complexity in Decentralized Training

Yucheng Lu, Christopher De Sa

Keywords Paper

Optimization, Distributed and Parallel Optimization

0

0

0

1

19:59

02/02/2021

Norm-Based Generalisation Bounds for Deep Multi-Class Convolutional Neural Networks

Antoine Ledent, Waleed Mustafa, Yunwen Lei, Marius Kloft

Keywords Paper

0

0

0

0

19:23

18/07/2021

Tensor Programs IV: Feature Learning in Infinite-Width Neural Networks

Greg Yang, Edward Hu

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:22

26/04/2020

DivideMix: Learning with Noisy Labels as Semi-supervised Learning

Junnan Li, Richard Socher, Steven C.H. Hoi

Keywords Paper

label noise, semi-supervised learning

0

0

0

0

5:00

06/12/2020

Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation

Devavrat Shah, Dogyoon Song, Zhi Xu, Yuzhe Yang

Keywords Paper

0

0

0

0

3:22

12/07/2020

Convolutional dictionary learning based auto-encoders for natural exponential-family distributions

Bahareh Tolooshams, Andrew Song, Simona Temereanca, Demba Ba

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

14:49

06/12/2021

On the interplay between data structure and loss function in classification problems

Stéphane d'Ascoli, Marylou Gabrié, Levent Sagun, Giulio Biroli

Keywords Paper

deep learning, machine learning

0

0

0

0

8:59

07/09/2020

On the Exploration of Incremental Learning for Fine-grained Image Retrieval

Wei Chen, Yu Liu, Weiping Wang and
Tinne Tuytelaars, Erwin M. Bakker, Michael Lew

Keywords Paper

Incremental learning, Fine-grained image retrieval, Catastrophic forgetting, Maximum Mean Discrepancy

0

0

0

0

8:32

14/06/2020

A Characteristic Function Approach to Deep Implicit Generative Modeling

Abdul Fatir Ansari, Jonathan Scarlett, Harold Soh

Keywords Paper

generative adversarial networks, generative models, probability metrics, characteristic functions, unsupervised learning

0

0

0

0

4:56

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

03/05/2021

Initialization and Regularization of Factorized Neural Layers

Misha Khodak, Neil Tenenholtz, Lester Mackey, Nicolo Fusi

Keywords Paper

matrix factorization, knowledge distillation, multi-head attention, model compression

0

0

0

0

4:25

26/04/2020

Regularizing activations in neural networks via distribution matching with the Wasserstein metric

Taejong Joo, Donggu Kang, Byunghoon Kim

Keywords Paper

regularization, Wasserstein metric, deep learning

0

0

0

0

5:26

14/06/2020

Forward and Backward Information Retention for Accurate Binary Neural Networks

Haotong Qin, Ruihao Gong, Xianglong Liu and
Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Paper

model compression, binary neural networks, deep learning, quantization, computer vision

0

0

0

0

1:00

03/05/2021

A Discriminative Gaussian Mixture Model with Sparsity

Hideaki Hayashi, Seiichi Uchida

Keywords Paper

classification, Gaussian mixture model, sparse Bayesian learning

0

0

0

0

4:19

12/07/2020

Analyzing the effect of neural network architecture on training performance

Karthik Abinav Sankararaman, Soham De, Zheng Xu and
W. Ronny Huang, Tom Goldstein

Keywords Paper

Deep Learning - Theory

0

0

0

0

14:03

13/04/2021

Spectral tensor train parameterization of deep learning layers

Anton Obukhov, Maxim Rakhuba, Alexander Liniger and
Zhiwu Huang, Stamatios Georgoulis, Dengxin Dai, Luc Van Gool

Keywords Paper

0

0

0

0

3:09

03/05/2021

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

0

0

0

0

5:13

06/12/2021

Deeply Shared Filter Bases for Parameter-Efficient Convolutional Neural Networks

Woochul Kang, Daeyeon Kim

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

13:17