NeuralScale: Efficient Scaling of Neurons for Resource-Constrained Deep Neural Networks

14/06/2020

NeuralScale: Efficient Scaling of Neurons for Resource-Constrained Deep Neural Networks

Eugene Lee, Chen-Yi Lee

Keywords: neural architecture search, pruning, inductive bias, neural network, filters, neurons, optimization, hyperparameter selection, resource constraint, hardware

Abstract Paper Similar Papers

Abstract: Deciding the amount of neurons during the design of a deep neural network to maximize performance is not intuitive. In this work, we attempt to search for the neuron (filter) configuration of a fixed network architecture that maximizes accuracy. Using iterative pruning methods as a proxy, we parametrize the change of the neuron (filter) number of each layer with respect to the change in parameters, allowing us to efficiently scale an architecture across arbitrary sizes. We also introduce architecture descent which iteratively refines the parametrized function used for model scaling. The combination of both proposed methods is coined as NeuralScale. To prove the efficiency of NeuralScale in terms of parameters, we show empirical simulations on VGG11, MobileNetV2 and ResNet18 using CIFAR10, CIFAR100 and TinyImageNet as benchmark datasets. Our results show an increase in accuracy of 3.04%, 8.56% and 3.41% for VGG11, MobileNetV2 and ResNet18 on CIFAR10, CIFAR100 and TinyImageNet respectively under a parameter-constrained setting (output neurons (filters) of default configuration with scaling factor of 0.25).

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Adapting Neural Architectures Between Domains

Yanxi Li, Zhaohui Yang, Yunhe Wang, Chang Xu

Keywords Paper

0

0

0

0

3:20

06/12/2020

BRP-NAS: Prediction-based NAS using GCNs

Lukasz Dudziak, Thomas Chau, Mohamed Abdelfattah and
Royson Lee, Hyeji Kim, Nicholas Lane

Keywords Paper

0

0

0

0

3:20

02/02/2021

Liquid Time-constant Networks

Ramin Hasani, Mathias Lechner, Alexander Amini and
Daniela Rus, Radu Grosu

Keywords Paper

0

0

0

0

16:02

18/07/2021

Scaling Properties of Deep Residual Networks

Alain-Sam Cohen, Rama Cont, Alain Rossier, Renyuan Xu

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:20

03/05/2021

Estimating informativeness of samples with Smooth Unique Information

Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini and
Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

Keywords Paper

dataset summarization, ntk, stability theory, sample information, information theory

0

0

0

0

6:05

06/12/2020

Revisiting Parameter Sharing for Automatic Neural Channel Number Search

Jiaxing Wang, Haoli Bai, Jiaxiang Wu and
Xupeng Shi, Junzhou Huang, Irwin King, Michael Lyu, Jian Cheng

Keywords Paper

0

0

0

0

3:17

13/04/2021

Fast adaptation with linearized neural networks

Wesley Maddox, Shuai Tang, Pablo Moreno and
Andrew Gordon Wilson, Andreas Damianou

Keywords Paper

0

0

0

0

3:13

02/02/2021

Learning Interpretable Models for Coupled Networks Under Domain Constraints

Hongyuan You, Sikun Lin, Ambuj Singh

Keywords Paper

0

0

0

0

16:47

06/12/2020

Rational neural networks

Nicolas Boulle, Yuji Nakatsukasa, Alex J Townsend

Keywords Paper

0

0

0

0

3:17

14/06/2020

GP-NAS: Gaussian Process Based Neural Architecture Search

Zhihang Li, Teng Xi, Jiankang Deng and
Gang Zhang, Shengzhao Wen, Ran He

Keywords Paper

neural architecture search, gaussian process, image classification, face recognition

0

0

0

0

0:59

03/05/2021

The Recurrent Neural Tangent Kernel

Sina Alemohammad, Jack Wang, Randall Balestriero, Richard Baraniuk

Keywords Paper

Gaussian Process, Recurrent Neural Network, Neural Tangent Kernel, Overparameterization

0

0

0

0

4:44

06/12/2021

HNPE: Leveraging Global Parameters for Neural Posterior Estimation

Pedro Rodrigues, Thomas Moreau, Gilles Louppe, Alexandre Gramfort

Keywords Paper

neuroscience, generative model

0

0

0

0

14:37

06/12/2020

Benchmarking Deep Inverse Models over time, and the Neural-Adjoint method

Ben Ren, Willie Padilla, Jordan Malof

Keywords Paper

0

0

0

0

3:17

13/04/2021

DebiNet: Debiasing linear models with nonlinear overparameterized neural networks

Shiyun Xu, Zhiqi Bu

Keywords Paper

0

0

0

0

2:56

06/12/2020

Factorized Neural Processes for Neural Processes: K-Shot Prediction of Neural Responses

Ronald (James) Cotton, Fabian Sinz, Andreas Tolias

Keywords Paper

0

0

0

0

3:18

02/02/2021

High Dimensional Level Set Estimation with Bayesian Neural Network

Huong Ha, Sunil Gupta, Santu Rana, Svetha Venkatesh

Keywords Paper

0

0

0

0

19:14

02/02/2021

Anytime Inference with Distilled Hierarchical Neural Ensembles

Adria Ruiz, Jakob Verbeek

Keywords Paper

0

0

0

0

15:09

02/02/2021

Training Spiking Neural Networks with Accumulated Spiking Flow

Hao Wu, Yueyi Zhang, Wenming Weng and
Yongting Zhang, Zhiwei Xiong, Zheng-Jun Zha, Xiaoyan Sun, Feng Wu

Keywords Paper

0

0

0

0

16:45

05/01/2021

Holistic Filter Pruning for Efficient Deep Neural Networks

Lukas Enderich, Fabian Timm, Wolfram Burgard

Keywords Paper

0

0

0

0

5:00

06/12/2021

Directed Spectrum Measures Improve Latent Network Models Of Neural Populations

Neil Gallagher, Kafui Dzirasa, David Carlson

Keywords Paper

neuroscience

0

0

0

0

11:43

13/04/2021

Approximating lipschitz continuous functions with GroupSort neural networks

Ugo Tanielian, Gerard Biau

Keywords Paper

0

0

0

0

3:02

03/05/2021

Optimal Conversion of Conventional Artificial Neural Networks to Spiking Neural Networks

Shikuang Deng, Shi Gu

Keywords Paper

second-order approximation, weight balance, spiking neural network

0

0

0

0

5:24

06/12/2020

Network Diffusions via Neural Mean-Field Dynamics

shushan He, Hongyuan Zha, Xiaojing Ye

Keywords Paper

0

0

0

0

3:21

06/12/2021

Neural Tangent Kernel Maximum Mean Discrepancy

Xiuyuan Cheng, Yao Xie

Keywords Paper

theory, deep learning, kernel methods

0

0

0

0

13:11

14/06/2020

Network Adjustment: Channel Search Guided by FLOPs Utilization Ratio

Zhengsu Chen, Jianwei Niu, Lingxi Xie and
Xuefeng Liu, Longhui Wei, Qi Tian

Keywords Paper

channel search, flops utilization ratio, network architecture search, network pruning, channel number, convolutional neural networks, network architecture, computer vision, deep learning

0

0

0

0

1:01

30/11/2020

Channel Pruning for Accelerating Convolutional Neural Networks via Wasserstein Metric

Haoran Duan, Hui Li

Keywords Paper

0

0

0

0

5:23

06/12/2020

Collegial Ensembles

Etai Littwin, Ben Myara, Sima Sabah and
Joshua Susskind, Shuangfei Zhai, Oren Golan

Keywords Paper

0

0

0

0

3:17

06/12/2021

Deep inference of latent dynamics with spatio-temporal super-resolution using selective backpropagation through time

Feng Zhu, Andrew Sedler, Harrison A Grier and
Nauman Ahad, Mark Davenport, Matthew Kaufman, Andrea Giovannucci, Chethan Pandarinath

Keywords Paper

deep learning, neuroscience, generative model

0

0

0

0

7:16

14/06/2020

AdaBits: Neural Network Quantization With Adaptive Bit-Widths

Qing Jin, Linjie Yang, Zhenyu Liao

Keywords Paper

neural network quantization, adaptive model, model compression

0

0

0

0

1:01

06/12/2021

Parametric Complexity Bounds for Approximating PDEs with Neural Networks

Tanya Marwah, Zachary Lipton, Andrej Risteski

Keywords Paper

theory, deep learning, optimization

0

0

0

0

12:32

18/07/2021

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Jianfei Chen, Lianmin Zheng, Zhewei Yao and
Dequan Wang, Ion Stoica, Michael Mahoney, Joseph E Gonzalez

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

18:54

06/12/2020

Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach

Luofeng Liao, You-Lin Chen, Zhuoran Yang and
Bo Dai, Mladen Kolar, Zhaoran Wang

Keywords Paper

Theory -> Information Theory, Algorithms -> Stochastic Methods

0

0

0

0

3:23

06/12/2020

Delta-STN: Efficient Bilevel Optimization for Neural Networks using Structured Response Jacobians

Juhan Bae, Roger Grosse

Keywords Paper

0

0

0

0

3:20

19/08/2021

Neural Architecture Search of SPD Manifold Networks

Rhea Sanjay Sukthanker, Zhiwu Huang, Suryansh Kumar and
Erik Goron Endsjo, Yan Wu, Luc Van Gool

Keywords Paper

Machine Learning, Classification, Deep Learning, Networks

0

0

0

0

14:16

06/12/2021

Precise characterization of the prior predictive distribution of deep ReLU networks

Lorenzo Noci, Gregor Bachmann, Kevin Roth and
Sebastian Nowozin, Thomas Hofmann

Keywords Paper

deep learning

0

0

0

0

14:26

20/07/2020

Gating creates slow modes and controls phase-space complexity in GRUs and LSTMs

Tankut Can, Kamesh Krishnamurthy, David J. Schwab

Keywords Paper

0

0

0

0

21:00

06/12/2020

Online Neural Connectivity Estimation with Noisy Group Testing

Anne Draelos, John Pearson

Keywords Paper

0

0

0

0

3:19

06/12/2020

Model Fusion via Optimal Transport

Sidak Pal Singh, Martin Jaggi

Keywords Paper

1

0

0

1

3:10

06/12/2021

Distributional Gradient Matching for Learning Uncertain Neural Dynamics Models

Lenart Treven, Philippe Wenk, Florian Dorfler, Andreas Krause

Keywords Paper

deep learning, reinforcement learning and planning, kernel methods, active learning

0

0

0

0

14:46

03/05/2021

Prediction and generalisation over directed actions by grid cells

Changmin Yu, Timothy Behrens, Neil Burgess

Keywords Paper

grid cells, Computational neuroscience, normative models

0

0

0

0

5:42