Feature Space Saturation during Training

22/11/2021

Feature Space Saturation during Training

Mats L Richter, Justin C Shenk, Wolf Byttner, Anna Wiedenroth, Mikael Huss

Keywords: deep learning, convolutional neural networks, PCA, XAI, explainable AI, neural architecture, classification

Abstract Paper Code Similar Papers

Abstract: We propose layer saturation - a simple, online-computable method for analyzing the information processing in neural networks. First, we show that a layer’s output can be restricted to an eigenspace of its covariance matrix without performance loss. We propose a computationally lightweight method that approximates the covariance matrix during training. From the dimension of its relevant eigenspace we derive layer saturation- the ratio between the eigenspace dimension and layer width. We show evidence that saturation indicates which layers contribute to network performance. We demonstrate how to alter layer saturation in a neural network by changing network depth, filter sizes and input resolution. Finally we show that pathological patterns of saturation are indicative of parameter inefficiencies caused by a mismatch between input resolution and neural architecture.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Dataless Model Selection With the Deep Frame Potential

Calvin Murdock, Simon Lucey

Keywords Paper

deep learning, sparse approximation theory, deep network architectures, model selection, sparsity, mutual coherence

0

0

0

1

5:00

26/04/2020

Gradient $\ell_1$ Regularization for Quantization Robustness

Milad Alizadeh, Arash Behboodi, Mart van Baalen and
Christos Louizos, Tijmen Blankevoort, Max Welling

Keywords Paper

quantization, regularization, robustness, gradient regularization

0

0

0

0

5:01

18/07/2021

Better Training using Weight-Constrained Stochastic Dynamics

Benedict Leimkuhler, Tiffany Vlaar, Timothée Pouchon, Amos Storkey

Keywords Paper

Deep Learning, Bayesian Deep Learning

0

0

0

0

5:14

18/07/2021

On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks

Hancheng Min, Salma Tarmoun, Rene Vidal, Enrique Mallada

Keywords Paper

Theory

0

0

0

0

5:16

26/04/2020

Mixed Precision DNNs: All you need is a good parametrization

Stefan Uhlich, Lukas Mauch, Fabien Cardinaux and
Kazuki Yoshiyama, Javier Alonso Garcia, Stephen Tiedemann, Thomas Kemp, Akira Nakamura

Keywords Paper

Deep Neural Network Compression, Quantization, Straight through gradients

1

0

0

0

5:11

14/09/2020

Finding the Optimal Network Depth in Classification Tasks

Bartosz Wójcik, Maciej Wołczyk, Klaudia Bałazy, Jacek Tabor

Keywords Paper

model compression and acceleration, multi-head networks

0

0

0

0

8:13

03/05/2021

Go with the flow: Adaptive control for Neural ODEs

Mathieu Chalvidal, Matthew Ricci, Rufin VanRullen, Thomas Serre

Keywords Paper

Neural ODEs, Normalizing flows, Hypernetworks, Optimal Control Theory

0

0

0

0

5:03

14/06/2020

LSM: Learning Subspace Minimization for Low-Level Vision

Chengzhou Tang, Lu Yuan, Ping Tan

Keywords Paper

low-level vision, subspace minimization, stereo matching, optical flow, interactive segmentation, video object segmentation, muli-task learning, zero-shot task transfer

0

0

0

0

5:00

06/12/2021

Encoding Robustness to Image Style via Adversarial Feature Perturbations

Manli Shu, Zuxuan Wu, Micah Goldblum, Tom Goldstein

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security, domain adaptation

0

0

0

0

7:36

06/12/2020

Graph Information Bottleneck

Tailin Wu, Hongyu Ren, Pan Li, Jure Leskovec

Keywords Paper

0

0

0

0

3:24

14/06/2020

MTL-NAS: Task-Agnostic Neural Architecture Search Towards General-Purpose Multi-Task Learning

Yuan Gao, Haoping Bai, Zequn Jie and
Jiayi Ma, Kui Jia, Wei Liu

Keywords Paper

neural architecture search, general-purpose multi-task learning, task-agnostic search space, single-shot gradient-based search algorithm, minimal entropy regularization

0

0

1

0

1:00

12/07/2020

Adversarial Robustness via Runtime Masking and Cleansing

Yi-Hsuan Wu, Chia-Hung Yuan, Shan-Hung (Brandon) Wu

Keywords Paper

Adversarial Examples

0

0

0

0

13:38

18/07/2021

Sparsifying Networks via Subdifferential Inclusion

Sagar Verma, Jean-Christophe Pesquet

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:10

05/01/2021

Group Softmax Loss With Discriminative Feature Grouping

Takumi Kobayashi

Keywords Paper

0

0

0

0

4:49

22/11/2021

Adaptive End-to-End Budgeted Network Learning via Inverse Scale Space

Zuyuan Zhong, Chen Liu, Yanwei Fu

Keywords Paper

deep learning, network architecture, growing network, budgeted network learning, pruning

0

0

0

0

2:58

06/12/2020

ISTA-NAS: Efficient and Consistent Neural Architecture Search by Sparse Coding

Yibo Yang, Hongyang Li, Shan You and
Fei Wang, Chen Qian, Zhouchen Lin

Keywords Paper

0

0

0

0

3:19

26/04/2020

Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

Pu Zhao, Pin-Yu Chen, Payel Das and
Karthikeyan Natesan Ramamurthy, Xue Lin

Keywords Paper

mode connectivity, adversarial robustness, backdoor attack, error-injection attack, evasion attacks, loss landscapes

0

0

0

0

4:30

06/12/2020

Robust Federated Learning: The Case of Affine Distribution Shifts

Amirhossein Reisizadeh, Farzan Farnia, Ramtin Pedarsani, Ali Jadbabaie

Keywords Paper

0

0

0

0

3:16

03/05/2021

Towards Robust Neural Networks via Close-loop Control

Zhuotong Chen, Qianxiao Li, Zheng Zhang

Keywords Paper

dynamical system, neural network robustness, optimal control

0

0

0

0

4:47

03/05/2021

Estimating informativeness of samples with Smooth Unique Information

Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini and
Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

Keywords Paper

dataset summarization, ntk, stability theory, sample information, information theory

0

0

0

0

6:05

03/05/2021

Federated Learning Based on Dynamic Regularization

Durmus Alp Emre Acar, Yue Zhao, Ramon Matas and
Matthew Mattina, Paul Whatmough, Venkatesh Saligrama

Keywords Paper

Distributed Optimization, Deep Neural Networks, Federated Learning

1

0

0

0

17:21

26/04/2020

Learned step size quantization

Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani and
Rathinakumar Appuswamy, Dharmendra S. Modha

Keywords Paper

deep learning, low precision, classification, quantization

0

0

0

0

4:40

26/08/2020

AP-Perf: Incorporating Generic Performance Metrics in Differentiable Learning

Rizal Fathony, Zico Kolter

Keywords Paper

0

0

0

0

14:49

06/12/2021

Learning with Algorithmic Supervision via Continuous Relaxations

Felix Petersen, Christian Borgelt, Hilde Kuehne, Oliver Deussen

Keywords Paper

deep learning

0

0

0

0

11:39

05/01/2021

Multi-Path Neural Networks for On-Device Multi-Domain Visual Classification

Qifei Wang, Junjie Ke, Joshua Greaves and
Grace Chu, Gabriel Bender, Luciano Sbaiz, Alec Go, Andrew Howard, Ming-Hsuan Yang, Jeff Gilbert, Peyman Milanfar, Feng Yang

Keywords Paper

0

0

0

1

5:01

14/06/2020

WCP: Worst-Case Perturbations for Semi-Supervised Deep Learning

Liheng Zhang, Guo-Jun Qi

Keywords Paper

semi-supervised learning, worst-case perturbations, model-based robustness, sample-based robustness, additive perturbations, dropconnect perturbations

0

0

0

0

5:01

18/07/2021

Global inducing point variational posteriors for Bayesian neural networks and deep Gaussian processes

Sebastian Ober, Laurence Aitchison

Keywords Paper

Deep Learning, Bayesian Deep Learning

0

0

0

0

5:21

26/04/2020

A Signal Propagation Perspective for Pruning Neural Networks at Initialization

Namhoon Lee, Thalaiyasingam Ajanthan, Stephen Gould, Philip H. S. Torr

Keywords Paper

neural network pruning, signal propagation perspective, sparse neural networks

0

0

0

0

5:12

14/06/2020

Generalized Zero-Shot Learning via Over-Complete Distribution

Rohit Keshari, Richa Singh, Mayank Vatsa

Keywords Paper

deep learning, zero-shot leaning, cvae, triplet loss, center loss

0

0

0

0

0:50

06/12/2021

Deeply Shared Filter Bases for Parameter-Efficient Convolutional Neural Networks

Woochul Kang, Daeyeon Kim

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

13:17

09/07/2020

Kernel and Rich Regimes in Overparametrized Models

Blake E Woodworth, Suriya Gunasekar, Jason Lee and
Edward Moroshko, Pedro Henrique Pamplona Savarese, Itay Golan, Daniel Soudry, Nathan Srebro

Keywords Paper

Neural networks/deep learning,

0

0

0

0

13:29

19/08/2021

Sensitivity Direction Learning with Neural Networks Using Domain Knowledge as Soft Shape Constraints

Kazuyuki Wakasugi

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Constraints and Data Mining; Constraints and Machine Learning

0

0

0

0

14:52

06/12/2020

MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures

Jeong Un Ryu, JWoong Shin, Hae Beom Lee, Sung Ju Hwang

Keywords Paper

0

0

0

0

3:32

06/12/2020

On Power Laws in Deep Ensembles

Ekaterina Lobacheva, Nadezhda Chirkova, Maxim Kodryan, Dmitry Vetrov

Keywords Paper

0

0

0

0

3:06

06/12/2021

Gradient Starvation: A Learning Proclivity in Neural Networks

Mohammad Pezeshki, Oumar Kaba, Yoshua Bengio and
Aaron Courville, Doina Precup, Guillaume Lajoie

Keywords Paper

theory, deep learning, optimization, robustness

0

0

0

0

10:52

06/12/2021

Domain Adaptation with Invariant Representation Learning: What Transformations to Learn?

Petar Stojanov, Zijian Li, Mingming Gong and
Ruichu Cai, Jaime Carbonell, Kun Zhang

Keywords Paper

deep learning, machine learning, adversarial robustness and security, domain adaptation, representation learning, transfer learning

0

0

0

0

15:02

13/04/2021

Learning with gradient descent and weakly convex losses

Dominic Richards, Mike Rabbat

Keywords Paper

0

0

0

0

3:20

12/07/2020

Efficient proximal mapping of the path-norm regularizer of shallow networks

Fabian Latorre, Paul Rolland, Shaul Nadav Hallak, Volkan Cevher

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

11:32

12/07/2020

Defense Through Diverse Directions

Christopher Bender, Yang Li, Yifeng Shi and
Michael K. Reiter, Junier Oliva

Keywords Paper

Adversarial Examples

0

0

0

0

15:06

18/07/2021

On Monotonic Linear Interpolation of Neural Network Parameters

James Lucas, Juhan Bae, Michael Zhang and
Stanislav Fort, Richard Zemel, Roger Grosse

Keywords Paper

Deep Learning, Others

0

0

0

0

5:03