Bit Error Robustness for Energy-Efficient DNN Accelerators

05/04/2021

Bit Error Robustness for Energy-Efficient DNN Accelerators

David Stutz, Nandhini Chandramoorthy, Matthias Hein, Bernt Schiele

Keywords:

Abstract Paper Similar Papers

Abstract: Deep neural network (DNN) accelerators received considerable attention in past years due to saved energy compared to mainstream hardware. Low-voltage operation of DNN accelerators allows to further reduce energy consumption significantly, however, causes bit-level failures in the memory storing the quantized DNN weights. In this paper, we show that a combination of robust fixed-point quantization, weight clipping, and random bit error training (RandBET) improves robustness against random bit errors in (quantized) DNN weights significantly. This leads to high energy savings from both low-voltage operation as well as low-precision quantization. Our approach generalizes across operating voltages and accelerators, as demonstrated on bit errors from profiled SRAM arrays. We also discuss why weight clipping alone is already a quite effective way to achieve robustness against bit errors. Moreover, we specifically discuss the involved trade-offs regarding accuracy, robustness and precision: Without losing more than 1% in accuracy compared to a normally trained 8-bit DNN, we can reduce energy consumption on CIFAR-10 by 20%. Higher energy savings of, e.g., 30%, are possible at the cost of 2.5% accuracy, even for 4-bit DNNs.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38952761

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at MLSYS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Sparse Spiking Gradient Descent

Nicolas Perez-Nieves, Dan Goodman

Keywords Paper

deep learning, optimization

0

0

0

0

14:54

02/02/2021

Fast and Compact Bilinear Pooling by Shifted Random Maclaurin

Tan Yu, Xiaoyun Li, Ping Li

Keywords Paper

0

0

0

0

14:24

12/07/2020

Boosting Deep Neural Network Efficiency with Dual-Module Inference

Liu Liu, Lei Deng, Zhaodong Chen and
yuke wang, Shuangchen Li, Jingwei Zhang, Yihua Yang, Zhenyu Gu, Yufei Ding, Yuan Xie

Keywords Paper

Deep Learning - General

0

0

0

0

8:04

06/12/2021

Boost Neural Networks by Checkpoints

Feng Wang, Guoyizhe Wei, Qiao Liu and
Jinxiang Ou, xian wei, Hairong Lv

Keywords Paper

deep learning

1

0

0

0

4:45

12/07/2020

Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors

Mike Dusenberry, Ghassen Jerfel, Yeming Wen and
Yian Ma, Jasper Snoek, Katherine Heller, Balaji Lakshminarayanan, Dustin Tran

Keywords Paper

Deep Learning - General

0

0

0

1

14:29

03/05/2021

A Block Minifloat Representation for Training Deep Neural Networks

Sean Fox, Seyedramin Rasoulinezhad, Julian Faraone and
david boland, Philip Leong

Keywords Paper

0

0

0

0

5:15

30/11/2020

An Efficient Group Feature Fusion Residual Network for Image Super-Resolution

Pengcheng Lei, Cong Liu

Keywords Paper

0

0

0

0

4:05

12/07/2020

Maximum-and-Concatenation Networks

Xingyu Xie, Hao Kong, Jianlong Wu and
Wayne Zhang, Guangcan Liu, Zhouchen Lin

Keywords Paper

Deep Learning - Theory

0

0

0

0

14:05

06/12/2020

Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification

Yulin Wang, Kangchen Lv, Rui Huang and
Shiji Song, Le Yang, Gao Huang

Keywords Paper

0

0

0

0

3:23

05/04/2021

VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

Steve Dai, Rangha Venkatesan, Mark Ren and
Brian Zimmer, William Dally, Brucek Khailany

Keywords Paper

Deep Learning -> Generative Models, Algorithms -> Similarity and Distance Learning

0

0

0

0

5:01

05/04/2021

VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

Steve Dai, Rangha Venkatesan, Mark Ren and
Brian Zimmer, William Dally, Brucek Khailany

Keywords Paper

Deep Learning -> Generative Models, Algorithms -> Similarity and Distance Learning

0

0

0

0

19:08

26/04/2020

Linear Symmetric Quantization of Neural Networks for Low-precision Integer Hardware

Xiandong Zhao, Ying Wang, Xuyi Cai and
Cheng Liu, Lei Zhang

Keywords Paper

quantization, integer-arithmetic-only DNN accelerator, acceleration

0

0

0

0

4:43

02/02/2021

Linearly Replaceable Filters for Deep Network Channel Pruning

Donggyu Joo, Eojindl Yi, Sunghyun Baek, Junmo Kim

Keywords Paper

0

0

0

0

15:47

05/01/2021

TResNet: High Performance GPU-Dedicated Architecture

Tal Ridnik, Hussam Lawen, Asaf Noy and
Emanuel Ben Baruch, Gilad Sharir, Itamar Friedman

Keywords Paper

0

0

0

0

4:19

03/05/2021

Training independent subnetworks for robust prediction

Marton Havasi, Rodolphe Jenatton, Stanislav Fort and
Jeremiah Zhe Liu, Jasper Snoek, Balaji Lakshminarayanan, Andrew Dai, Dustin Tran

Keywords Paper

robustness, Efficient ensembles

0

0

0

0

4:10

05/01/2021

Spike-Thrift: Towards Energy-Efficient Deep Spiking Neural Networks by Limiting Spiking Activity via Attention-Guided Compression

Souvik Kundu, Gourav Datta, Massoud Pedram, Peter A. Beerel

Keywords Paper

0

0

0

0

5:22

15/06/2020

Effectively Prefetching Remote Memory with Leap

Hasan Al Maruf, Mosharaf Chowdhury

Keywords Paper

0

0

0

0

21:56

12/07/2020

Inducing and Exploiting Activation Sparsity for Fast Inference on Deep Neural Networks

Mark Kurtz, Justin Kopinsky, Rati Gelashvili and
Alexander Matveev, John Carr, Michael Goin, William Leiserson, Sage Moore, Nir Shavit, Dan Alistarh

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

14:41

06/12/2020

TinyTL: Reduce Memory, Not Parameters for Efficient On-Device Learning

Han Cai, Chuang Gan, Ligeng Zhu, Song Han

Keywords Paper

0

0

0

0

3:20

26/04/2020

Dynamic Model Pruning with Feedback

Tao Lin, Sebastian U. Stich, Luis Barba and
Daniil Dmitriev, Martin Jaggi

Keywords Paper

network pruning, dynamic reparameterization, model compression

0

0

0

0

4:30

19/08/2021

EventDrop: Data Augmentation for Event-based Learning

Fuqiang Gu, Weicong Sng, Xuke Hu, Fangwen Yu

Keywords Paper

Computer Vision, Recognition, Classification

0

0

0

0

8:48

06/12/2021

Powerpropagation: A sparsity inducing weight reparameterisation

Jonathan Schwarz, Siddhant M Jayakumar, Razvan Pascanu and
Peter E Latham, Yee Teh

Keywords Paper

deep learning, optimization, continual learning

0

0

0

1

9:08

16/11/2020

Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications

Matthew Khoury, Rumen Dangovski, Longwu Ou and
Preslav Nakov, Yichen Shen, Li Jing

Keywords Paper

natural applications, neural translation, neural nmt, neural

0

0

0

0

11:54

06/12/2021

Neural Architecture Dilation for Adversarial Robustness

Yanxi Li, Zhaohui Yang, Yunhe Wang, Chang Xu

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

6:40

15/06/2020

Optimizing Memory-mapped I/O for Fast Storage Devices

Anastasios Papagiannis, Giorgos Xanthakis, Giorgos Saloustros and
Manolis Marazakis, Angelos Bilas

Keywords Paper

0

0

0

0

20:23

14/06/2020

Structured Multi-Hashing for Model Compression

Elad Eban, Yair Movshovitz-Attias, Hao Wu and
Mark Sandler, Andrew Poon, Yerlan Idelbayev, Miguel Á. Carreira-Perpiñán

Keywords Paper

compression, weight hashing, on device

0

0

0

0

1:01

06/12/2021

Post-Training Sparsity-Aware Quantization

Gil Shomron, Freddy Gabbay, Samer Kurzum, Uri Weiser

Keywords Paper

deep learning

0

0

0

0

8:54

06/12/2021

Memory-efficient Patch-based Inference for Tiny Deep Learning

Ji Lin, Wei-Ming Chen, Han Cai and
Chuang Gan, Song Han

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

11:14

06/12/2021

Heavy Ball Neural Ordinary Differential Equations

Hedi Xia, Vai Suliafu, Hangjie Ji and
Tan Nguyen, Andrea Bertozzi, Stanley Osher, Bao Wang

Keywords Paper

deep learning, optimization, machine learning, vision

0

0

0

0

4:08

12/07/2020

Generalization Guarantees for Sparse Kernel Approximation with Entropic Optimal Features

Liang Ding, Rui Tuo, Shahin Shahrampour

Keywords Paper

General Machine Learning Techniques

0

0

0

0

15:03

26/04/2020

EMPIR: Ensembles of Mixed Precision Deep Networks for Increased Robustness Against Adversarial Attacks

Sanchari Sen, Balaraman Ravindran, Anand Raghunathan

Keywords Paper

ensembles, mixed precision, robustness, adversarial attacks

0

0

0

0

4:25

02/02/2021

Near Lossless Transfer Learning for Spiking Neural Networks

Zhanglu Yan, Jun Zhou, Weng-Fai Wong

Keywords Paper

0

0

0

0

16:34

02/02/2021

Memory and Computation-Efficient Kernel SVM via Binary Embedding and Ternary Model Coefficients

Zijian Lei, Liang Lan

Keywords Paper

0

0

0

0

12:29

06/12/2021

Scatterbrain: Unifying Sparse and Low-rank Attention

Beidi Chen, Tri Dao, Eric Winsor and
Zhao Song, Atri Rudra, Christopher Ré

Keywords Paper

transformers, generative model

0

0

0

0

13:15

06/12/2020

DisARM: An Antithetic Gradient Estimator for Binary Latent Variables

Zhe Dong, Andriy Mnih, George Tucker

Keywords Paper

0

0

0

0

3:37

02/02/2021

Amata: An Annealing Mechanism for Adversarial Training Acceleration

Nanyang Ye, Qianxiao Li, Xiao-Yun Zhou, Zhanxing Zhu

Keywords Paper

0

0

0

0

14:30

14/09/2020

Squeezing Correlated Neurons for Resource-Efficient Deep Neural Networks

Elbruz Ozen, Alex Orailoglu

Keywords Paper

deep learning, information redundancy, pruning

0

0

0

0

14:48

23/06/2021

Snapshot-Free, Transparent, and Robust Memory Reclamation for Lock-Free Data Structures

Ruslan Nikolaev, Binoy Ravindran

Keywords Paper

lock-free, non-blocking, memory reclamation, hazard pointers, epoch-based reclamation

0

0

0

0

23:33

05/04/2021

PipeMare: Asynchronous Pipeline Parallel DNN Training

Bowen Yang, Jian Zhang, Jonathan Li and
Christopher Re, Christopher Aberger, Christopher De Sa

Keywords Paper

0

0

0

0

16:57