Robust Pruning at Initialization

03/05/2021

Robust Pruning at Initialization

Soufiane Hayou, Jean-Francois Ton, Arnaud Doucet, Yee Whye Teh

Keywords: Pruning, Compression, Initialization

Abstract Paper Similar Papers

Abstract: Overparameterized Neural Networks (NN) display state-of-the-art performance. However, there is a growing need for smaller, energy-efficient, neural networks to be able to use machine learning applications on devices with limited computational resources. A popular approach consists of using pruning techniques. While these techniques have traditionally focused on pruning pre-trained NN (LeCun et al.,1990; Hassibi et al., 1993), recent work by Lee et al. (2018) has shown promising results when pruning at initialization. However, for Deep NNs, such procedures remain unsatisfactory as the resulting pruned networks can be difficult to train and, for instance, they do not prevent one layer from being fully pruned. In this paper, we provide a comprehensive theoretical analysis of Magnitude and Gradient based pruning at initialization and training of sparse architectures. This allows us to propose novel principled approaches which we validate experimentally on a variety of NN architectures.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Towards Fast Adaptation of Neural Architectures with Meta Learning

Dongze Lian, Yin Zheng, Yintao Xu and
Yanxiong Lu, Leyu Lin, Peilin Zhao, Junzhou Huang, Shenghua Gao

Keywords Paper

Fast adaptation, Meta learning, NAS

0

0

0

0

4:55

06/12/2020

The Surprising Simplicity of the Early-Time Learning Dynamics of Neural Networks

Wei Hu, Lechao Xiao, Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:20

12/07/2020

Training Linear Neural Networks: Non-Local Convergence and Complexity Results

Armin Eftekhari

Keywords Paper

Deep Learning - General

0

0

0

0

14:35

07/09/2020

N2NSkip: Learning Highly Sparse Networks using Neuron-to-Neuron Skip Connections

Arvind Subramaniam, Avinash Sharma

Keywords Paper

model compression, pruning, heat diffusion, Convolutional Neural Networks (CNN), undirected graphs, heat diffusion, skip connections, N2NSkip, scree diagram, connection sensitivity

0

0

0

0

8:47

06/12/2021

GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training

Chen Zhu, Renkun Ni, Zheng Xu and
Kezhi Kong, W. Ronny Huang, Tom Goldstein

Keywords Paper

deep learning, transformers, vision

0

0

0

0

13:17

03/05/2021

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

Aojun Zhou, Yukun Ma, Junnan Zhu and
Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Paper

sparsity, efficient training and inference.

0

0

0

0

5:09

14/06/2020

Computing the Testing Error Without a Testing Set

Ciprian A. Corneanu, Sergio Escalera, Aleix M. Martinez

Keywords Paper

deep learning, algebraic topology, generalization, object recognition, facial analysis, semantic segmentation

0

0

0

0

4:43

06/12/2021

NAS-Bench-x11 and the Power of Learning Curves

Shen Yan, Colin White, Yash Savani, Frank Hutter

Keywords Paper

deep learning

0

0

0

0

14:03

12/07/2020

Training Binary Neural Networks using the Bayesian Learning Rule

Xiangming Meng, Roman Bachmann, Mohammad Emtiyaz Khan

Keywords Paper

Deep Learning - General

0

0

0

0

10:27

06/12/2021

Validating the Lottery Ticket Hypothesis with Inertial Manifold Theory

Zeru Zhang, Jiayin Jin, Zijie Zhang and
Yang Zhou, Xin Zhao, Jiaxiang Ren, Ji Liu, Lingfei Wu, Ruoming Jin, Dejing Dou

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:20

13/04/2021

Benchmarking simulation-based inference

Jan-Matthis Lueckmann, Jan Boelts, David Greenberg and
Pedro Goncalves, Jakob Macke

Keywords Paper

0

0

0

0

3:04

18/07/2021

Dense for the Price of Sparse: Improved Performance of Sparsely Initialized Networks via a Subspace Offset

Ilan Price, Jared Tanner

Keywords Paper

Deep Learning

0

0

0

0

5:42

06/12/2020

Neural Architecture Generator Optimization

Robin Ru, Pedro Esperança, Fabio Maria Carlucci

Keywords Paper

0

0

0

0

3:14

06/12/2021

Learning Transferable Adversarial Perturbations

Krishna kanth Nakka, Mathieu Salzmann

Keywords Paper

deep learning, optimization, adversarial robustness and security

0

0

0

0

12:00

26/04/2020

Effect of Activation Functions on the Training of Overparametrized Neural Nets

Abhishek Panigrahi, Abhishek Shetty, Navin Goyal

Keywords Paper

activation functions, deep learning theory, neural networks

0

0

0

0

5:13

02/02/2021

Large Batch Optimization for Deep Learning Using New Complete Layer-Wise Adaptive Rate Scaling

Zhouyuan Huo, Bin Gu, Heng Huang

Keywords Paper

0

0

0

0

15:17

18/07/2021

Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling

Ozan Özdenizci, Robert Legenstein

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

1

6:27

14/06/2020

Meta-Learning of Neural Architectures for Few-Shot Learning

Thomas Elsken, Benedikt Staffler, Jan Hendrik Metzen, Frank Hutter

Keywords Paper

neural architecture search, meta-learning, automl, few-shot learning, autodl, deep learning

0

0

0

0

5:01

06/12/2020

Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality

Yi Zhang, Orestis Plevrakis, Simon Du and
Xingguo Li, Zhao Song, Sanjeev Arora

Keywords Paper

0

0

0

0

2:56

12/07/2020

Confidence-Aware Learning for Deep Neural Networks

Sangheum Hwang, Jooyoung Moon, Jihyo Kim, Younghak Shin

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

14:05

02/02/2021

Scalable Verification of Quantized Neural Networks

Thomas A. Henzinger, Mathias Lechner, Đorđe Žikelić

Keywords Paper

0

0

0

0

16:58

02/02/2021

Efficient On-Chip Learning for Optical Neural Networks Through Power-Aware Sparse Zeroth-Order Optimization

Jiaqi Gu, Chenghao Feng, Zheng Zhao and
Zhoufeng Ying, Ray T. Chen, David Z. Pan

Keywords Paper

0

0

0

0

15:32

06/12/2021

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Simsekli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:38

18/07/2021

HardCoRe-NAS: Hard Constrained diffeRentiable Neural Architecture Search

Niv Nayman, Yonathan Aflalo, Asaf Noy, Lihi Zelnik

Keywords Paper

Deep Learning, Deep Learning, Biologically Plausible Deep Networks; Deep Learning, CNN Architectures, Algorithms, AutoML

0

0

0

0

5:20

26/04/2020

Neural Arithmetic Units

Andreas Madsen, Alexander Rosenberg Johansen

Keywords Paper

0

0

0

0

4:42

18/07/2021

Exponentially Many Local Minima in Quantum Neural Networks

Xuchen You, Xiaodi Wu

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

4:46

03/05/2021

Net-DNF: Effective Deep Modeling of Tabular Data

Liran Katzir, Gal Elidan, Ran El-Yaniv

Keywords Paper

Neural Networks, Predictive Modeling, Tabular Data, Architectures

0

0

0

0

5:10

22/11/2021

FFNB: Forgetting-Free Neural Blocks for Deep Continual Learning

Hichem Sahbi, Haoming Zhan

Keywords Paper

Continual and incremental learning, lifelong learning, catastrophic interference, catastrophic forgetting, dynamic neural networks, visual recognition

0

0

0

0

3:05

06/12/2021

Speedy Performance Estimation for Neural Architecture Search

Robin Ru, Clare Lyle, Lisa Schut and
Miroslav Fil, Mark van der Wilk, Yarin Gal

Keywords Paper

deep learning

0

0

0

0

13:22

03/05/2021

Go with the flow: Adaptive control for Neural ODEs

Mathieu Chalvidal, Matthew Ricci, Rufin VanRullen, Thomas Serre

Keywords Paper

Neural ODEs, Normalizing flows, Hypernetworks, Optimal Control Theory

0

0

0

0

5:03

06/12/2020

Hypersolvers: Toward Fast Continuous-Depth Models

Michael Poli, Stefano Massaroli, Atsushi Yamashita and
Hajime Asama, Jinkyoo Park

Keywords Paper

0

0

0

0

3:16

06/12/2021

Asymptotics of representation learning in finite Bayesian neural networks

Jacob Zavatone-Veth, Abdulkadir Canatar, Ben Ruben, Cengiz Pehlevan

Keywords Paper

deep learning, representation learning

0

0

0

0

14:09

03/05/2021

Training independent subnetworks for robust prediction

Marton Havasi, Rodolphe Jenatton, Stanislav Fort and
Jeremiah Zhe Liu, Jasper Snoek, Balaji Lakshminarayanan, Andrew Dai, Dustin Tran

Keywords Paper

robustness, Efficient ensembles

0

0

0

0

4:10

06/12/2021

Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning

Hayeon Lee, Sewoong Lee, Song Chong, Sung Ju Hwang

Keywords Paper

deep learning, meta learning

0

0

0

0

11:31

18/07/2021

Neural Architecture Search without Training

Joe Mellor, Jack Turner, Amos Storkey, Elliot Crowley

Keywords Paper

Deep Learning, Architectures

0

0

0

1

20:37

03/05/2021

Neural Pruning via Growing Regularization

Huan Wang, Can Qin, Yulun Zhang, Yun Fu

Keywords Paper

deep neural network pruning, regularization, Hessian matrix, model compression

0

0

0

0

6:15

06/12/2020

Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness

Long Zhao, Ting Liu, Xi Peng, Dimitris Metaxas

Keywords Paper

0

0

0

0

3:22

06/12/2020

Learning Parities with Neural Networks

Amit Daniely, Eran Malach

Keywords Paper

0

0

0

0

3:21

05/04/2021

Lost in Pruning: The Effects of Pruning Neural Networks beyond Test Accuracy

Lucas Liebenwein, Cenk Baykal, Brandon Carter and
David Gifford, Daniela Rus

Keywords Paper

0

0

0

0

20:21

05/04/2021

Lost in Pruning: The Effects of Pruning Neural Networks beyond Test Accuracy

Lucas Liebenwein, Cenk Baykal, Brandon Carter and
David Gifford, Daniela Rus

Keywords Paper

0

0

0

0

5:51