sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data

05/04/2021

sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data

Guanhua Wang, Zhuang Liu, Brandon Hsieh, Siyuan Zhuang, Joseph Gonzalez, Trevor Darrell, Ion Stoica

Keywords:

Abstract Paper Similar Papers

Abstract: Convolutional Neural Networks (ConvNets) enable computers to excel on vision learning tasks such as image classification, object detection. Recently, real-time inference on live data is becoming more and more important. From a system perspective, it requires fast inference on each single, incoming data item (e.g. 1 image). Two main-stream distributed model serving paradigms – data parallelism and model parallelism – are not necessarily desirable here, because we cannot further split a single input data piece via data parallelism, and model parallelism introduces huge communication overhead. To achieve live data inference with low latency, we propose sensAI, a novel and generic approach that decouples a CNN model into disconnected subnets, each is responsible for predicting certain class(es). We call this new model distribution paradigm as class parallelism. Experimental results show that, sensAI achieves up to 18x faster inference on single input data item with no or negligible accuracy loss on CIFAR-10, CIFAR-100 and ImageNet-1K datasets.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38952769

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at MLSYS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

05/04/2021

sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data

Guanhua Wang, Zhuang Liu, Brandon Hsieh and
Siyuan Zhuang, Joseph Gonzalez, Trevor Darrell, Ion Stoica

Keywords Paper

0

0

0

0

5:23

06/12/2021

Dynamic Normalization and Relay for Video Action Recognition

Dongqi Cai, Anbang Yao, Yurong Chen

Keywords Paper

deep learning, representation learning

0

0

0

0

10:42

05/04/2021

IOS: Inter-Operator Scheduler for CNN Acceleration

Yaoyao Ding, Ligeng Zhu, Zhihao Jia and
Gennady Pekhimenko, Song Han

Keywords Paper

0

0

0

0

4:44

05/04/2021

IOS: Inter-Operator Scheduler for CNN Acceleration

Yaoyao Ding, Ligeng Zhu, Zhihao Jia and
Gennady Pekhimenko, Song Han

Keywords Paper

0

0

0

0

18:27

14/06/2020

Deep Optics for Single-Shot High-Dynamic-Range Imaging

Christopher A. Metzler, Hayato Ikoma, Yifan Peng, Gordon Wetzstein

Keywords Paper

high-dynamic-range imaging, point-spread-function engineering, end-to-end learning, computational imaging, deep learning, optics, photography

0

0

0

0

5:01

26/04/2020

AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures

Michael S. Ryoo, AJ Piergiovanni, Mingxing Tan, Anelia Angelova

Keywords Paper

video representation learning, video understanding, activity recognition, neural architecture search

0

0

0

0

5:02

02/02/2021

Explaining Convolutional Neural Networks through Attribution-Based Input Sampling and Block-Wise Feature Aggregation

Sam Sattarzadeh, Mahesh Sudhakar, Anthony Lem and
Shervin Mehryar, Konstantinos N Plataniotis, Jongseong Jang, Hyunwoo Kim, Yeonjeong Jeong, Sangmin Lee, Kyunghoon Bae

Keywords Paper

0

0

0

0

19:59

14/06/2020

AOWS: Adaptive and Optimal Network Width Search With Latency Constraints

Maxim Berman, Leonid Pishchulin, Ning Xu and
Matthew B. Blaschko, Gérard Medioni

Keywords Paper

neural architecture search, mobilenet, tensorrt, latency, classification, imagenet, viterbi, network width search, ows, aows

0

0

0

0

4:55

05/01/2021

VideoSSL: Semi-Supervised Learning for Video Classification

Longlong Jing, Toufiq Parag, Zhe Wu and
Yingli Tian, Hongcheng Wang

Keywords Paper

0

0

0

0

4:56

14/06/2020

Context Prior for Scene Segmentation

Changqian Yu, Jingbo Wang, Changxin Gao and
Gang Yu, Chunhua Shen, Nong Sang

Keywords Paper

semantic segmentation, scene segmentation, context prior, context aggregation, affinity loss, affinity matrix

0

0

0

0

1:01

30/11/2020

Video-Based Crowd Counting Using a Multi-Scale Optical Flow Pyramid Network

Mohammad Asiful Hossain, Kevin Cannons, Daesik Jang and
Fabio Cuzzolin, Zhan Xu

Keywords Paper

0

0

0

0

9:54

03/05/2021

Attentional Constellation Nets for Few-Shot Learning

Weijian Xu, Yifan Xu, Huaijin Wang, Zhuowen Tu

Keywords Paper

few-shot learning, constellation models

0

0

0

0

5:10

14/06/2020

EventSR: From Asynchronous Events to Image Reconstruction, Restoration, and Super-Resolution via End-to-End Adversarial Learning

Lin Wang, Tae-Kyun Kim, Kuk-Jin Yoon

Keywords Paper

event-based vision, image super-resolution, image restoration, image reconstruction, unsupervised and adversarial learning

0

0

0

0

1:03

14/06/2020

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior

Jinshan Pan, Haoran Bai, Jinhui Tang

Keywords Paper

video deblurring, deep convolutional neural network, motion blur estimation, optical flow, temporal sharpness prior, image restoration

0

0

0

0

0:53

02/02/2021

Classifying Sequences of Extreme Length with Constant Memory Applied to Malware Detection

Edward Raff, William Fleshman, Richard Zak and
Hyrum S. Anderson, Bobby Filar, Mark McLean

Keywords Paper

0

0

0

0

19:55

14/06/2020

Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection

Sara Beery, Guanhang Wu, Vivek Rathod and
Ronny Votel, Jonathan Huang

Keywords Paper

object detection, attention, video object detection, domain adaptation, generalization, static cameras, camera traps, low-quality data, conservation, climate change

0

0

0

0

1:01

14/06/2020

Unsupervised Learning From Video With Deep Neural Embeddings

Chengxu Zhuang, Tianwei She, Alex Andonian and
Max Sobol Mark, Daniel Yamins

Keywords Paper

unsupervised learning, self-supervised learning, video learning, contrastive learning, deep neural networks, action recognition, object recognition, two-pathway models

0

0

0

0

1:01

02/02/2021

Learnable Dynamic Temporal Pooling for Time Series Classification

Dongha Lee, Seonghyeon Lee, Hwanjo Yu

Keywords Paper

0

0

0

0

18:05

06/12/2021

Container: Context Aggregation Networks

peng gao, Jiasen Lu, hongsheng Li and
Roozbeh Mottaghi, Aniruddha Kembhavi

Keywords Paper

deep learning, self-supervised learning, transformers, vision, language

0

0

0

0

8:50

06/12/2020

Sparse Graphical Memory for Robust Planning

Scott Emmons, Ajay Jain, Misha Laskin and
Thanard Kurutach, Pieter Abbeel, Deepak Pathak

Keywords Paper

0

0

0

0

3:23

07/09/2020

High-speed Light-weight CNN Inference via Strided Convolutions on a Pixel Processor Array

Yanan Liu, Laurie Bose, Jianing Chen and
Stephen Carey, Piotr Dudek, Walterio Mayol-Cuevas

Keywords Paper

Binary CNN, CNN on embedded system, Pixel Processor Array, SCAMP, high-speed CNN, Light-weight CNN

0

0

0

0

8:06

19/10/2020

Deep adaptive feature aggregation in multi-task convolutional neural networks

Zhen Shen, Chaoran Cui, Jin Huang and
Jian Zong, Meng Chen, Yilong Yin

Keywords Paper

convolutional neural networks, multi-task learning, adaptive feature aggregation

0

0

0

0

6:36

14/06/2020

GAN Compression: Efficient Architectures for Interactive Conditional GANs

Muyang Li, Ji Lin, Yaoyao Ding and
Zhijian Liu, Jun-Yan Zhu, Song Han

Keywords Paper

generative adversarial networks, model compression, distillation, neural architecture search, image and video synthesis

0

0

0

0

1:00

02/02/2021

Improving Adversarial Robustness via Probabilistically Compact Loss with Logit Constraints

Xin Li, Xiangrui Li, Deng Pan, Dongxiao Zhu

Keywords Paper

0

0

0

0

16:06

14/07/2020

Communication lower bounds of convolutions in CNNs

Xiaoyang Zhang, Junmin Xiao, Guangming Tan

Keywords Paper

near communication-optimal strategy, red-blue pebble game, communication lower bound, convolutional neural network

0

0

0

0

7:30

14/06/2020

Erasing Integrated Learning: A Simple Yet Effective Approach for Weakly Supervised Object Localization

Jinjie Mai, Meng Yang, Wenfeng Luo

Keywords Paper

weakly supervised, object localization, adversarial erasing

0

0

0

0

5:00

14/06/2020

Meta-Transfer Learning for Zero-Shot Super-Resolution

Jae Woong Soh, Sunwoo Cho, Nam Ik Cho

Keywords Paper

zero-shot super-resolution, meta learning, transfer learning

0

0

0

0

0:59

06/12/2020

Adaptive Shrinkage Estimation for Streaming Graphs

Nesreen K. Ahmed, Nick Duffield

Keywords Paper

0

0

0

0

3:23

14/06/2020

Regularizing CNN Transfer Learning With Randomised Regression

Yang Zhong, Atsuto Maki

Keywords Paper

transfer learning, network regularization, randomised regression, pseudo task regularization, limited samples

0

0

0

0

0:58

03/05/2021

VA-RED$^2$: Video Adaptive Redundancy Reduction

Bowen Pan, Rameswar Panda, Camilo L Fosco and
Chung-Ching Lin, Alex J Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris

Keywords Paper

0

0

0

0

5:02

30/11/2020

Compact and Fast Underwater Segmentation Network for Autonomous Underwater Vehicles

Jiangtao Wang, Baihua Li, Yang Zhou and
Emanuele Rocco, Qinggang Meng

Keywords Paper

0

0

0

0

4:01

26/04/2020

Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning

Ruqi Zhang, Chunyuan Li, Jianyi Zhang and
Changyou Chen, Andrew Gordon Wilson

Keywords Paper

0

0

0

0

14:59

06/12/2021

BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer

Haoping Bai, Meng Cao, Ping Huang, Jiulong Shan

Keywords Paper

deep learning, optimization

0

0

0

0

4:12

07/09/2020

Integrating Long-Short Term Network for Efficient Video Object Segmentation

Jingjing Wang, Zhu Teng, Baopeng Zhang, Jianping Fan

Keywords Paper

Video Object Segmentation, Long-Short Term Network, Multiple-object segmentation

0

0

0

0

8:30

05/01/2021

OverNet: Lightweight Multi-Scale Super-Resolution With Overscaling Network

Parichehr Behjati, Pau Rodriguez, Armin Mehri and
Isabelle Hupont, Carles Fernandez Tena, Jordi Gonzalez

Keywords Paper

0

0

0

0

4:24

14/06/2020

Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition

Linchao Zhu, Yi Yang

Keywords Paper

long-tailed visual recognition, region self-attention, inflated episodic memory, long-tailed video classification

0

0

0

0

1:00

06/12/2021

MLP-Mixer: An all-MLP Architecture for Vision

Ilya Tolstikhin, Neil Houlsby, Alexander Kolesnikov and
Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Andreas Steiner, Daniel Keysers, Jakob Uszkoreit, Mario Lucic, Alexey Dosovitskiy

Keywords Paper

deep learning, machine learning, transformers, vision, transfer learning

0

0

0

0

11:18

06/12/2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

contrastive learning

0

0

0

0

10:12

14/06/2020

Temporally Distributed Networks for Fast Video Semantic Segmentation

Ping Hu, Fabian Caba, Oliver Wang and
Zhe Lin, Stan Sclaroff, Federico Perazzi

Keywords Paper

video semantic segmentation, semantic segmentation, low-latency video processing, temporally distributed computation, attention propagation, grouped knowledge distillation

0

0

0

0

1:00

14/06/2020

ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

Qilong Wang, Banggu Wu, Pengfei Zhu and
Peihua Li, Wangmeng Zuo, Qinghua Hu

Keywords Paper

channel attention, efficient, adaptive 1d convolution, deep cnns, image classifcation, object detection, instance segmentation

0

0

0

0

0:57