Optimizing Information Theory Based Bitwise Bottlenecks for Efficient Mixed-Precision Activation Quantization

02/02/2021

Optimizing Information Theory Based Bitwise Bottlenecks for Efficient Mixed-Precision Activation Quantization

Xichuan Zhou, Kui Liu, Cong Shi, Haijun Liu, Ji Liu

Keywords:

Abstract Paper Similar Papers

Abstract: Recent researches on information theory shed new light on the continuous attempts to open the black box of neural signal encoding. Inspired by the problem of lossy signal compression for wireless communication, this paper presents a Bitwise Bottleneck approach for quantizing and encoding neural network activations. Based on the rate-distortion theory, the Bitwise Bottleneck attempts to determine the most significant bits in activation representation by assigning and approximating the sparse coefficients associated with different bits. Given the constraint of a limited average code rate, the bottleneck minimizes the distortion for optimal activation quantization in a flexible layer-by-layer manner. Experiments over ImageNet and other datasets show that, by minimizing the quantization distortion of each layer, the neural network with bottlenecks achieves the state-of-the-art accuracy with low-precision activation. Meanwhile, by reducing the code rate, the proposed method can improve the memory and computational efficiency by over six times compared with the deep neural network with standard single-precision representation. The source code is available on GitHub: https://github.com/CQUlearningsystemgroup/BitwiseBottleneck.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38947769

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Data-Independent Neural Pruning via Coresets

Ben Mussay, Margarita Osadchy, Vladimir Braverman and
Samson Zhou, Dan Feldman

Keywords Paper

coresets, neural pruning, network compression

0

0

0

0

4:23

14/06/2020

Adaptive Loss-Aware Quantization for Multi-Bit Networks

Zhongnan Qu, Zimu Zhou, Yun Cheng, Lothar Thiele

Keywords Paper

quantization, binary neural networks, adaptive bitwidth, loss-aware

0

0

0

0

1:01

06/12/2020

ISTA-NAS: Efficient and Consistent Neural Architecture Search by Sparse Coding

Yibo Yang, Hongyang Li, Shan You and
Fei Wang, Chen Qian, Zhouchen Lin

Keywords Paper

0

0

0

0

3:19

23/08/2020

Rethinking pruning for accelerating deep inference at the edge

Dawei Gao, Xiaoxi He, Zimu Zhou and
Yongxin Tong, Ke Xu, Lothar Thiele

Keywords Paper

automatic speech recognition, deep learning, name entity recognition, network pruning, sequence labelling

0

0

0

0

13:43

14/06/2020

Structured Compression by Weight Encryption for Unstructured Pruning and Quantization

Se Jung Kwon, Dongsoo Lee, Byeongwook Kim and
Parichay Kapoor, Baeseong Park, Gu-Yeon Wei

Keywords Paper

model compression, quantization, pruning, xor gate, parallelism, memory bandwidth, sparse matrix, structured format

0

0

0

0

0:59

14/06/2020

Learning in the Frequency Domain

Kai Xu, Minghai Qin, Fei Sun and
Yuhao Wang, Yen-Kuang Chen, Fengbo Ren

Keywords Paper

frequency domain, discrete cosine transform, image downsampling, spectral bias, data pre-processing pipeline, image compression, detection, segmentation.

0

0

0

0

1:01

14/06/2020

HRank: Filter Pruning Using High-Rank Feature Map

Mingbao Lin, Rongrong Ji, Yan Wang and
Yichen Zhang, Baochang Zhang, Yonghong Tian, Ling Shao

Keywords Paper

network pruning, neural network compression and acceleration, high-rank feature map, efficient deep learning computing

0

0

0

0

4:57

12/07/2020

Don't Waste Your Bits! Squeeze Activations and Gradients for Deep Neural Networks via TinyScript

Fangcheng Fu, Yuzheng Hu, Yihan He and
Jiawei Jiang, Yingxia Shao, Ce Zhang, Bin Cui

Keywords Paper

Optimization - Large Scale, Parallel and Distributed

0

0

0

0

9:59

03/05/2021

BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization

Huanrui Yang, Lin Duan, Yiran Chen, Hai Li

Keywords Paper

DNN compression, bit-level sparsity, Mixed-precision quantization

0

0

0

0

4:58

05/01/2021

Spike-Thrift: Towards Energy-Efficient Deep Spiking Neural Networks by Limiting Spiking Activity via Attention-Guided Compression

Souvik Kundu, Gourav Datta, Massoud Pedram, Peter A. Beerel

Keywords Paper

0

0

0

0

5:22

14/06/2020

Forward and Backward Information Retention for Accurate Binary Neural Networks

Haotong Qin, Ruihao Gong, Xianglong Liu and
Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Paper

model compression, binary neural networks, deep learning, quantization, computer vision

0

0

0

0

1:00

06/12/2021

Discrete-Valued Neural Communication

Dianbo Liu, Alex Lamb, Kenji Kawaguchi and
Anirudh Goyal ALIAS PARTH GOYAL, Chen Sun, Michael Mozer, Yoshua Bengio

Keywords Paper

deep learning, robustness, transformers, generative model, graph learning

0

0

0

0

11:09

06/12/2021

FINE Samples for Learning with Noisy Labels

Taehyeon Kim, Jongwoo Ko, sangwook Cho and
JinHwan Choi, Se-Young Yun

Keywords Paper

theory, deep learning, machine learning, vision, semi-supervised learning

0

1

0

0

11:09

26/04/2020

SELF: Learning to Filter Noisy Labels with Self-Ensembling

Duc Tam Nguyen, Chaithanya Kumar Mummadi, Thi Phuong Nhung Ngo and
Thi Hoai Phuong Nguyen, Laura Beggel, Thomas Brox

Keywords Paper

Ensemble Learning, Robust Learning, Noisy Labels, Labels Filtering

0

0

0

0

5:00

14/06/2020

Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-Based Approach

Haichuan Yang, Shupeng Gui, Yuhao Zhu, Ji Liu

Keywords Paper

model compression, pruning, quantization, structured projection

0

0

0

0

1:01

18/07/2021

Sparse and Imperceptible Adversarial Attack via a Homotopy Algorithm

Mingkang Zhu, Tianlong Chen, Zhangyang Wang

Keywords Paper

Deep Learning, Generative Models, Data, Challenges, Implementations, and Software, Benchmarks, Algorithms, Adversarial Examples

0

0

0

0

18:17

12/07/2020

Scalable Differential Privacy with Certified Robustness in Adversarial Learning

Hai Phan, My T. Thai, Han Hu and
Ruoming Jin, Tong Sun, Dejing Dou

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

14:47

30/11/2020

Channel Pruning for Accelerating Convolutional Neural Networks via Wasserstein Metric

Haoran Duan, Hui Li

Keywords Paper

0

0

0

0

5:23

18/07/2021

Learning Deep Neural Networks under Agnostic Corrupted Supervision

Boyang Liu, Mengying Sun, Ding Wang and
Pang-Ning Tan, Jiayu Zhou

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

4:59

06/12/2021

Progressive Feature Interaction Search for Deep Sparse Network

Chen Gao, Yinfeng Li, Quanming Yao and
Depeng Jin, Yong Li

Keywords Paper

deep learning, machine learning

1

0

0

0

14:01

30/11/2020

To filter prune, or to layer prune, that is the question

Sara Elkerdawy, Mostafa Elhoushi, Abhineet Singh and
Hong Zhang, Nilanjan Ray

Keywords Paper

0

0

0

1

9:32

12/07/2020

Revisiting Spatial Invariance with Low-Rank Local Connectivity

Gamaleldin Elsayed, Prajit Ramachandran, Jon Shlens, Simon Kornblith

Keywords Paper

Deep Learning - General

0

0

0

0

14:48

26/04/2020

Mixed Precision DNNs: All you need is a good parametrization

Stefan Uhlich, Lukas Mauch, Fabien Cardinaux and
Kazuki Yoshiyama, Javier Alonso Garcia, Stephen Tiedemann, Thomas Kemp, Akira Nakamura

Keywords Paper

Deep Neural Network Compression, Quantization, Straight through gradients

1

0

0

0

5:11

18/07/2021

PHEW : Constructing Sparse Networks that Learn Fast and Generalize Well without Training Data

Shreyas Malakarjun Patil, Constantine Dovrolis

Keywords Paper

Deep Learning

1

1

0

1

5:20

06/12/2020

Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity

Gonçalo Correia, Vlad Niculae, Wilker Aziz, André Martins

Keywords Paper

0

0

0

0

3:38

06/12/2020

On Power Laws in Deep Ensembles

Ekaterina Lobacheva, Nadezhda Chirkova, Maxim Kodryan, Dmitry Vetrov

Keywords Paper

0

0

0

0

3:06

18/07/2021

Sparsifying Networks via Subdifferential Inclusion

Sagar Verma, Jean-Christophe Pesquet

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:10

06/12/2021

Dynamic Resolution Network

Mingjian Zhu, Kai Han, Enhua Wu and
Qiulin Zhang, Ying Nie, Zhenzhong Lan, Yunhe Wang

Keywords Paper

deep learning

0

0

0

0

9:38

02/11/2020

Low-complexity models for acoustic scene classification based on receptive field regularization and frequency damping

Khaled Koutini, Florian Henkel, Hamid Eghbal-Zadeh, Gerhard Widmer

Keywords Paper

0

0

0

0

15:01

06/12/2021

Why Do Better Loss Functions Lead to Less Transferable Features?

Simon Kornblith, Ting Chen, Honglak Lee, Mohammad Norouzi

Keywords Paper

deep learning, machine learning, vision, transfer learning

0

0

0

0

9:26

14/06/2020

Low-Rank Compression of Neural Nets: Learning the Rank of Each Layer

Yerlan Idelbayev, Miguel Á. Carreira-Perpiñán

Keywords Paper

low-rank compression, rank selection, optimization, discrete-continuous optimization

0

0

0

0

1:00

05/01/2021

Multi-Path Neural Networks for On-Device Multi-Domain Visual Classification

Qifei Wang, Junjie Ke, Joshua Greaves and
Grace Chu, Gabriel Bender, Luciano Sbaiz, Alec Go, Andrew Howard, Ming-Hsuan Yang, Jeff Gilbert, Peyman Milanfar, Feng Yang

Keywords Paper

0

0

0

1

5:01

06/12/2021

Meta-Learning Sparse Implicit Neural Representations

Jaeho Lee, Jihoon Tack, Namhoon Lee, Jinwoo Shin

Keywords Paper

deep learning, optimization, meta learning, representation learning

0

0

0

0

8:41

14/06/2020

When to Use Convolutional Neural Networks for Inverse Problems

Nathaniel Chodosh, Simon Lucey

Keywords Paper

optimization, sparse coding, inverse problems, trajectory reconstruction, artifact removal

0

0

0

0

1:02

19/08/2021

Towards Understanding the Spectral Bias of Deep Learning

Yuan Cao, Zhiying Fang, Yue Wu and
Ding-Xuan Zhou, Quanquan Gu

Keywords Paper

Machine Learning, Deep Learning, Kernel Methods

0

0

0

0

14:42

26/08/2020

Neural Decomposition: Functional ANOVA with Variational Autoencoders

Kaspar Märtens, Christopher Yau

Keywords Paper

0

0

0

0

14:25

14/06/2020

GP-NAS: Gaussian Process Based Neural Architecture Search

Zhihang Li, Teng Xi, Jiankang Deng and
Gang Zhang, Shengzhao Wen, Ran He

Keywords Paper

neural architecture search, gaussian process, image classification, face recognition

0

0

0

0

0:59

06/12/2021

Local Signal Adaptivity: Provable Feature Learning in Neural Networks Beyond Kernels

Stefani Karp, Ezra Winston, Yuanzhi Li, Aarti Singh

Keywords Paper

theory, deep learning, optimization, machine learning, vision, kernel methods

0

0

0

0

13:22

12/07/2020

Being Bayesian about Categorical Probability

Taejong Joo, Uijung Chung, Min-Gwan Seo

Keywords Paper

Supervised Learning

0

0

0

0

12:26

18/07/2021

Large Scale Private Learning via Low-rank Reparametrization

Da Yu, Huishuai Zhang, Wei Chen and
Jian Yin, Tie-Yan Liu

Keywords Paper

Social Aspects of Machine Learning

0

0

0

0

5:09