Dirichlet pruning for convolutional neural networks

13/04/2021

Dirichlet pruning for convolutional neural networks

Kamil Adamczewski, Mijung Park

Keywords:

Abstract Paper Similar Papers

Abstract: We introduce Dirichlet pruning, a novel post-processing technique to transform a large neural network model into a compressed one. Dirichlet pruning is a form of structured pruning which assigns the Dirichlet distribution over each layer’s channels in convolutional layers (or neurons in fully-connected layers), and learns the parameters of the distribution over these units using variational inference. The learnt parameters allow us to informatively and intuitively remove unimportant units, resulting in a compact architecture containing only crucial features for a task at hand. This method yields low GPU footprint, as the number of parameters is linear in the number of channels (or neurons) and training requires as little as one epoch to converge. We perform extensive experiments, in particular on larger architectures such as VGG and WideResNet (94

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Operation-Aware Soft Channel Pruning using Differentiable Masks

Minsoo Kang, Bohyung Han

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:56

13/04/2021

LassoNet: Neural networks with feature sparsity

Ismael Lemhadri, Feng Ruan, Rob Tibshirani

Keywords Paper

0

0

0

0

3:13

02/02/2021

Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision

Xingchao Liu, Mao Ye, Dengyong Zhou, Qiang Liu

Keywords Paper

0

0

0

0

15:18

06/12/2021

Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)

Jie Bu, Arka Daw, M. Maruf, Anuj Karpatne

Keywords Paper

deep learning, machine learning, vision, graph learning, representation learning

0

0

0

0

13:59

06/12/2020

A meta-learning approach to (re)discover plasticity rules that carve a desired function into a neural network

Basile Confavreux, Friedemann Zenke, Everton Agnes and
Timothy Lillicrap, Tim Vogels

Keywords Paper

0

0

0

0

3:25

13/04/2021

Neural function modules with sparse arguments: A dynamic approach to integrating information across layers

Alex Lamb, Anirudh Goyal, Agnieszka Słowik and
Michael Mozer, Philippe Beaudoin, Yoshua Bengio

Keywords Paper

0

0

0

0

3:01

03/05/2021

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

Aojun Zhou, Yukun Ma, Junnan Zhu and
Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Paper

sparsity, efficient training and inference.

0

0

0

0

5:09

03/05/2021

BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction

Yuhang Li, Ruihao Gong, Xu Tan and
Yang Yang, Peng Hu, Qi Zhang, fengwei yu, Wei Wang, Shi Gu

Keywords Paper

Second-order analysis, Mixed Precision, Post Training Quantization

0

0

0

0

4:36

06/12/2021

Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization

Yusuke Iwasawa, Yutaka Matsuo

Keywords Paper

deep learning, optimization, transformers, domain adaptation

0

0

0

0

13:50

30/11/2020

Regularizing Meta-Learning via Gradient Dropout

Hung-Yu Tseng, Yi-Wen Chen, Yi-Hsuan Tsai and
Sifei Liu, Yen-Yu Lin, Ming-Hsuan Yang

Keywords Paper

0

0

0

0

3:21

06/12/2021

Training Feedback Spiking Neural Networks by Implicit Differentiation on the Equilibrium State

Mingqing Xiao, Qingyan Meng, Zongpeng Zhang and
Yisen Wang, Zhouchen Lin

Keywords Paper

deep learning

0

0

0

0

12:22

18/07/2021

Sparsifying Networks via Subdifferential Inclusion

Sagar Verma, Jean-Christophe Pesquet

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:10

06/12/2021

Meta-Learning Sparse Implicit Neural Representations

Jaeho Lee, Jihoon Tack, Namhoon Lee, Jinwoo Shin

Keywords Paper

deep learning, optimization, meta learning, representation learning

0

0

0

0

8:41

06/12/2021

Self-Supervised Learning of Event-Based Optical Flow with Spiking Neural Networks

Jesse Hagenaars, Federico Paredes-Valles, Guido de Croon

Keywords Paper

deep learning, optimization, self-supervised learning

0

0

0

0

13:28

03/05/2021

Net-DNF: Effective Deep Modeling of Tabular Data

Liran Katzir, Gal Elidan, Ran El-Yaniv

Keywords Paper

Neural Networks, Predictive Modeling, Tabular Data, Architectures

0

0

0

0

5:10

03/05/2021

Go with the flow: Adaptive control for Neural ODEs

Mathieu Chalvidal, Matthew Ricci, Rufin VanRullen, Thomas Serre

Keywords Paper

Neural ODEs, Normalizing flows, Hypernetworks, Optimal Control Theory

0

0

0

0

5:03

18/07/2021

Selfish Sparse RNN Training

Shiwei Liu, Decebal Constantin Mocanu, Yulong Pei, Mykola Pechenizkiy

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

1

4:58

05/01/2021

Group Softmax Loss With Discriminative Feature Grouping

Takumi Kobayashi

Keywords Paper

0

0

0

0

4:49

06/12/2021

Only Train Once: A One-Shot Neural Network Training And Pruning Framework

Tianyi Chen, Bo Ji, Tianyu Ding and
Biyi Fang, Guanyi Wang, Zhihui Zhu, Luming Liang, Yixin Shi, Sheng Yi, Xiao Tu

Keywords Paper

deep learning, optimization, reinforcement learning and planning

0

0

0

0

12:53

06/12/2020

ISTA-NAS: Efficient and Consistent Neural Architecture Search by Sparse Coding

Yibo Yang, Hongyang Li, Shan You and
Fei Wang, Chen Qian, Zhouchen Lin

Keywords Paper

0

0

0

0

3:19

14/06/2020

Orthogonal Convolutional Neural Networks

Jiayun Wang, Yubei Chen, Rudrasis Chakraborty, Stella X. Yu

Keywords Paper

orthogonal convolution, orthogonality, regularization, filter redundancy, robustness, classification, retrieval, semi-supervised, gans, inpainting

0

0

0

0

1:00

14/09/2020

Finding the Optimal Network Depth in Classification Tasks

Bartosz Wójcik, Maciej Wołczyk, Klaudia Bałazy, Jacek Tabor

Keywords Paper

model compression and acceleration, multi-head networks

0

0

0

0

8:13

12/07/2020

Communication-Efficient Distributed Stochastic AUC Maximization with Deep Neural Networks

Zhishuai Guo, Mingrui Liu, Zhuoning Yuan and
Li Shen, Wei Liu, Tianbao Yang

Keywords Paper

Optimization - Large Scale, Parallel and Distributed

0

0

0

0

14:42

06/12/2020

Kernelized information bottleneck leads to biologically plausible 3-factor Hebbian learning in deep networks

Roman Pogodin, Peter E Latham

Keywords Paper

Deep Learning -> Adversarial Networks, Algorithms -> Semi-Supervised Learning

0

0

0

0

2:30

09/07/2020

Implicit Bias of Gradient Descent for Wide Two-layer Neural Networks Trained with the Logistic Loss

Lénaïc Chizat, Francis Bach

Keywords Paper

Neural networks/deep learning, Non-convex optimization

0

0

0

0

14:41

06/12/2020

Model Fusion via Optimal Transport

Sidak Pal Singh, Martin Jaggi

Keywords Paper

1

0

0

1

3:10

26/04/2020

Data-Independent Neural Pruning via Coresets

Ben Mussay, Margarita Osadchy, Vladimir Braverman and
Samson Zhou, Dan Feldman

Keywords Paper

coresets, neural pruning, network compression

0

0

0

0

4:23

30/11/2020

Channel Pruning for Accelerating Convolutional Neural Networks via Wasserstein Metric

Haoran Duan, Hui Li

Keywords Paper

0

0

0

0

5:23

03/05/2021

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks

Keyulu Xu, Mozhi Zhang, Jingling Li and
Simon Du, Ken-Ichi Kawarabayashi, Stefanie Jegelka

Keywords Paper

graph neural networks, out-of-distribution, deep learning, extrapolation, deep learning theory

0

0

0

1

17:06

26/04/2020

Stochastic Conditional Generative Networks with Basis Decomposition

Ze Wang, Xiuyuan Cheng, Guillermo Sapiro, Qiang Qiu

Keywords Paper

0

0

0

0

4:00

06/12/2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations

Minshuo Chen, Yu Bai, Jason Lee and
Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher

Keywords Paper

0

0

0

0

3:07

06/12/2021

Dynamic Resolution Network

Mingjian Zhu, Kai Han, Enhua Wu and
Qiulin Zhang, Ying Nie, Zhenzhong Lan, Yunhe Wang

Keywords Paper

deep learning

0

0

0

0

9:38

12/07/2020

Finding trainable sparse networks through Neural Tangent Transfer

Tianlin Liu, Friedemann Zenke

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:43

06/12/2020

The Surprising Simplicity of the Early-Time Learning Dynamics of Neural Networks

Wei Hu, Lechao Xiao, Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:20

06/12/2021

Sequence-to-Sequence Learning with Latent Neural Grammars

Yoon Kim

Keywords Paper

deep learning

0

0

0

0

14:31

02/02/2021

Longitudinal Deep Kernel Gaussian Process Regression

Junjie Liang, Yanting Wu, Dongkuan Xu, Vasant G Honavar

Keywords Paper

0

0

0

0

16:27

12/07/2020

MetaFun: Meta-Learning with Iterative Functional Updates

Jin Xu, Jean-Francois Ton, Hyunjik Kim and
Adam Kosiorek, Yee Whye Teh

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

13:51

18/07/2021

Deep Learning for Functional Data Analysis with Adaptive Basis Layers

Junwen Yao, Jonas Mueller, Jane-Ling Wang

Keywords Paper

Deep Learning

0

0

0

0

5:11

06/12/2021

Localization with Sampling-Argmax

Jiefeng Li, Tong Chen, Ruiqi Shi and
Yujing Lou, Yong-Lu Li, Cewu Lu

Keywords Paper

deep learning

0

0

0

0

9:29

03/05/2021

On the geometry of generalization and memorization in deep neural networks

Cory Stephenson, Suchi Padhy, Abhinav Ganesh and
Yue Hui, Hanlin Tang, SueYeon Chung

Keywords Paper

deep learning theory, representation learning, double descent, statistical physics methods

0

0

0

0

5:14