UWC: Unit-wise Calibration Towards Rapid Network Compression

Abstract: This paper introduces a post-training quantization~(PTQ) method achieving highly efficient Convolutional Neural Network~ (CNN) quantization with high performance. Previous PTQ methods usually reduce compression error via performing layer-by-layer parameters calibration. However, with lower representational ability of extremely compressed parameters (e.g., the bit-width goes less than 4), it is hard to eliminate all the layer-wise errors. This work addresses this issue via proposing a unit-wise feature reconstruction algorithm based on an observation of second order Taylor series expansion of the unit-wise error. It indicates that leveraging the interaction between adjacent layers' parameters could compensate layer-wise errors better. In this paper, we define several adjacent layers as a Basic-Unit, and present a unit-wise post-training algorithm which can minimize quantization error. This method achieves near-original accuracy on ImageNet and COCO when quantizing FP32 models to INT4 and INT3.

03/05/2021

UWC: Unit-wise Calibration Towards Rapid Network Compression

Chen Lin, Zheyang Li, Bo Peng, Wenming Tan, Ye Ren, Shiliang Pu

Comments

Similar Papers

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss

Mingyang Yi, LU HOU, Lifeng Shang and Xin Jiang, Qun Liu, Zhi-Ming Ma

Keywords Abstract Paper

sample reweighting, data augmentation

Forward and Backward Information Retention for Accurate Binary Neural Networks

Haotong Qin, Ruihao Gong, Xianglong Liu and Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Abstract Paper

model compression, binary neural networks, deep learning, quantization, computer vision

Fantastic Four: Differentiable and Efficient Bounds on Singular Values of Convolution Layers

ssingla Singla, Soheil Feizi

Keywords Abstract Paper

spectral regularization, spectral normalization

Neural Network Quantization with Scale-Adjusted Training

Qing Jin, Linjie Yang, Zhenyu Liao, Xiaoning Qian

Keywords Abstract Paper

neural network quantization, over-fitting, regularization

Boost Neural Networks by Checkpoints

Feng Wang, Guoyizhe Wei, Qiao Liu and Jinxiang Ou, xian wei, Hairong Lv

Keywords Abstract Paper

deep learning

Interpolation between CNNs and ResNets

Zonghan Yang, Yang Liu, Chenglong Bao, Zuoqiang Shi

Keywords Abstract Paper

Accountability, Transparency and Interpretability

Near Lossless Transfer Learning for Spiking Neural Networks

Zhanglu Yan, Jun Zhou, Weng-Fai Wong

Keywords Abstract Paper

Pruning Filter in Filter

Fanxu Meng, Hao Cheng, Ke Li and Huixiang Luo, Xiaowei Guo, Guangming Lu, Xing Sun

Keywords Abstract Paper

Neural gradients are near-lognormal: improved quantized and sparse training

Brian Chmiel, Liad Ben-Uri, Moran Shkolnik and Elad Hoffer, Ron Banner, Daniel Soudry

Keywords Abstract Paper

Parameter Efficient Dynamic Convolution via Tensor Decomposition

Zejiang Hou, Sun-Yuan Kung

Keywords Abstract Paper

dynamic convolution, input-dependent reparameterization, parameter efficiency, tensor decomposition

Channel Pruning for Accelerating Convolutional Neural Networks via Wasserstein Metric

Haoran Duan, Hui Li

Keywords Abstract Paper

Structured Convolutions for Efficient Neural Network Design

Yash Bhalgat, Yizhe Zhang, Jamie Menjay Lin, Fatih Porikli

Keywords Abstract Paper

Fast-MVSNet: Sparse-to-Dense Multi-View Stereo With Learned Propagation and Gauss-Newton Refinement

Zehao Yu, Shenghua Gao

Keywords Abstract Paper

multi-view stereo, sparse-to-dense, gauss-newton optimization, propagation, coarse-to-fine

Improved Sample Complexities for Deep Neural Networks and Robust Classification via an All-Layer Margin

Colin Wei, Tengyu Ma

Keywords Abstract Paper

deep learning theory, generalization bounds, adversarially robust generalization, data-dependent generalization bounds

An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems

Ahmed M. Abdelmoniem, Ahmed Elzanaty Elzanaty, Mohamed-Slim Alouini , Marco Canini

Keywords Abstract Paper

An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems

Ahmed M. Abdelmoniem, Ahmed Elzanaty Elzanaty, Mohamed-Slim Alouini , Marco Canini

Keywords Abstract Paper

Distribution Adaptive INT8 Quantization for Training CNNs

Kang Zhao, Sida Huang, Pan Pan and Yinghan Li, Yingya Zhang, Zhenyu Gu, Yinghui Xu

Keywords Abstract Paper

Few-Shot Learning with Complex-valued Neural Networks

Zhen Liu, Baochang Zhang, Guodong Guo

Keywords Abstract Paper

few-shot learning, complex-valued network, metric-learning, image classification

To filter prune, or to layer prune, that is the question

Sara Elkerdawy, Mostafa Elhoushi, Abhineet Singh and Hong Zhang, Nilanjan Ray

Keywords Abstract Paper

CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks

Sakshi Varshney, Vinay Kumar Verma, P. K. Srijith and Lawrence Carin, Piyush Rai

Keywords Abstract Paper

generative model, representation learning, continual learning

Adaptive Denoising via GainTuning

Sreyas Mohan, Joshua L Vincent, Ramon Manzorro and Peter Crozier, Carlos Fernandez-Granda, Eero P Simoncelli

Keywords Abstract Paper

deep learning

Exploiting the Redundancy in Convolutional Filters for Parameter Reduction

Mingyang Yi, LU HOU, Lifeng Shang and
Xin Jiang, Qun Liu, Zhi-Ming Ma

Keywords Paper

Haotong Qin, Ruihao Gong, Xianglong Liu and
Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Paper

Keywords Paper

Keywords Paper

Feng Wang, Guoyizhe Wei, Qiao Liu and
Jinxiang Ou, xian wei, Hairong Lv

Keywords Paper

Keywords Paper

Keywords Paper

Fanxu Meng, Hao Cheng, Ke Li and
Huixiang Luo, Xiaowei Guo, Guangming Lu, Xing Sun

Keywords Paper

Brian Chmiel, Liad Ben-Uri, Moran Shkolnik and
Elad Hoffer, Ron Banner, Daniel Soudry

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Kang Zhao, Sida Huang, Pan Pan and
Yinghan Li, Yingya Zhang, Zhenyu Gu, Yinghui Xu

Keywords Paper

Keywords Paper

Sara Elkerdawy, Mostafa Elhoushi, Abhineet Singh and
Hong Zhang, Nilanjan Ray

Keywords Paper

Sakshi Varshney, Vinay Kumar Verma, P. K. Srijith and
Lawrence Carin, Piyush Rai

Keywords Paper

Sreyas Mohan, Joshua L Vincent, Ramon Manzorro and
Peter Crozier, Carlos Fernandez-Granda, Eero P Simoncelli

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Trung Le, Tuan Nguyen, Nhat Ho and
Hung Bui, Dinh Phung

Keywords Paper

Xiaofeng Ruan, Yufan Liu, Bing Li and
Chunfeng Yuan, Weiming Hu

Keywords Paper

Sean Sinclair, Tianyu Wang, Gauri Jain and
Sid Banerjee, Christina Yu

Keywords Paper

Hedi Xia, Vai Suliafu, Hangjie Ji and
Tan Nguyen, Andrea Bertozzi, Stanley Osher, Bao Wang

Keywords Paper

Keywords Paper

Keywords Paper

Fangcheng Fu, Yuzheng Hu, Yihan He and
Jiawei Jiang, Yingxia Shao, Ce Zhang, Bin Cui

Keywords Paper