Multi-Exit Vision Transformer for Dynamic Inference

22/11/2021

Multi-Exit Vision Transformer for Dynamic Inference

Arian Bakhtiarnia, Qi Zhang, Alexandros Iosifidis

Keywords: early exiting, vision transformer, dynamic inference, transformer models, attention models, efficient inference, early exits, multi-exit architectures, edge computing

Abstract Paper Code Similar Papers

Abstract: Deep neural networks can be converted to multi-exit architectures by inserting early exit branches after some of their intermediate layers. This allows their inference process to become dynamic, which is useful for time critical IoT applications with stringent latency requirements, but with time-variant communication and computation resources. In particular, in edge computing systems and IoT networks where the exact computation time budget is variable and not known beforehand. Vision Transformer is a recently proposed architecture which has since found many applications across various domains of computer vision. In this work, we propose seven different architectures for early exit branches that can be used for dynamic inference in Vision Transformer backbones. Through extensive experiments involving both classification and regression problems, we show that each one of our proposed architectures could prove useful in the trade-off between accuracy and speed.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning

Ruqi Zhang, Chunyuan Li, Jianyi Zhang and
Changyou Chen, Andrew Gordon Wilson

Keywords Paper

0

0

0

0

14:59

06/12/2021

Model, sample, and epoch-wise descents: exact solution of gradient flow in the random feature model

Antoine Bodin, Nicolas Macris

Keywords Paper

deep learning, optimization

0

0

0

0

15:00

06/12/2021

Differentiable Multiple Shooting Layers

Stefano Massaroli, Michael Poli, Sho Sonoda and
Taiji Suzuki, Jinkyoo Park, Atsushi Yamashita, Hajime Asama

Keywords Paper

deep learning, machine learning

0

0

0

0

13:08

02/02/2021

Synergetic Learning of Heterogeneous Temporal Sequences for Multi-Horizon Probabilistic Forecasting

Longyuan Li, Jihai Zhang, Junchi Yan and
Yaohui Jin, Yunhao Zhang, Yanjie Duan, Guangjian Tian

Keywords Paper

0

0

0

0

10:36

19/08/2021

Neural Temporal Point Processes: A Review

Oleksandr Shchur, Ali Caner Türkmen, Tim Januschowski, Stephan Günnemann

Keywords Paper

Machine learning, General

0

0

0

0

14:26

06/12/2020

User-Dependent Neural Sequence Models for Continuous-Time Event Data

Alex Boyd, Robert Bamler, Stephan Mandt, Padhraic Smyth

Keywords Paper

0

0

0

0

3:23

06/12/2021

Learning Transferable Adversarial Perturbations

Krishna kanth Nakka, Mathieu Salzmann

Keywords Paper

deep learning, optimization, adversarial robustness and security

0

0

0

0

12:00

14/06/2020

GP-NAS: Gaussian Process Based Neural Architecture Search

Zhihang Li, Teng Xi, Jiankang Deng and
Gang Zhang, Shengzhao Wen, Ran He

Keywords Paper

neural architecture search, gaussian process, image classification, face recognition

0

0

0

0

0:59

14/06/2020

MemNAS: Memory-Efficient Neural Architecture Search With Grow-Trim Learning

Peiye Liu, Bo Wu, Huadong Ma, Mingoo Seok

Keywords Paper

neural architecture search, recurrent neural network, memory optimization

0

0

0

0

0:59

12/07/2020

Forecasting sequential data using Consistent Koopman Autoencoders

Omri Azencot, N. Benjamin Erichson, Vanessa Lin, Michael Mahoney

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

14:06

12/07/2020

Representing Unordered Data Using Multiset Automata and Complex Numbers

Justin DeBenedetto, David Chiang

Keywords Paper

General Machine Learning Techniques

0

0

0

0

15:10

18/07/2021

AdaXpert: Adapting Neural Architecture for Growing Data

Shuaicheng Niu, Jiaxiang Wu, Guanghui Xu and
Yifan Zhang, Yong Guo, Peilin Zhao, Peng Wang, Mingkui Tan

Keywords Paper

Deep Learning, Reinforcement Learning and Planning, Reinforcement Learning, Algorithms, AutoML

0

0

0

0

5:14

06/12/2020

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

16/11/2020

AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network

Xinyu Wang, Yong Jiang, Nguyen Bach and
Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu

Keywords Paper

parallelization, faster prediction, linear-chain model, neural approaches

0

0

0

0

7:05

13/04/2021

DebiNet: Debiasing linear models with nonlinear overparameterized neural networks

Shiyun Xu, Zhiqi Bu

Keywords Paper

0

0

0

0

2:56

03/05/2021

A Design Space Study for LISTA and Beyond

Tianjian Meng, Xiaohan Chen, Yifan Jiang, Zhangyang Wang

Keywords Paper

0

0

0

0

5:50

26/04/2020

Intensity-Free Learning of Temporal Point Processes

Oleksandr Shchur, Marin Biloš, Stephan Günnemann

Keywords Paper

Temporal point process, neural density estimation

0

0

0

0

4:32

06/12/2020

A Study on Encodings for Neural Architecture Search

Colin White, Willie Neiswanger, Sam Nolen, Yash Savani

Keywords Paper

Optimization -> Convex Optimization; Optimization -> Non-Convex Optimization; Optimization -> Stochastic Optimization, Algorithms -> Large Scale Learning

0

0

0

0

3:04

06/12/2020

The phase diagram of approximation rates for deep neural networks

Dmitry Yarotsky, Anton Zhevnerchuk

Keywords Paper

0

0

0

0

3:07

22/11/2021

Make Baseline Model Stronger: Embedded Knowledge Distillation in Weight-Sharing Based Ensemble Network

Shuchang LYU, Qi Zhao, Yujing Ma, Lijiang Chen

Keywords Paper

knowledge distillation, ensemble learning, high-efficiency network

0

0

0

0

3:03

22/06/2020

Graph Hawkes Neural Network for Forecasting on Temporal Knowledge Graphs

Zhen Han, Yunpu Ma, Yuyi Wang and
Stephan Günnemann, Volker Tresp

Keywords Paper

Hawkes process, dynamic graphs, temporal knowledge graphs, point processes.

0

0

0

0

8:59

26/04/2020

Fast Neural Network Adaptation via Parameter Remapping and Architecture Search

Jiemin Fang, Yuzhu Sun, Kangjian Peng* and
Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang

Keywords Paper

1

0

0

0

4:39

02/02/2021

Anytime Inference with Distilled Hierarchical Neural Ensembles

Adria Ruiz, Jakob Verbeek

Keywords Paper

0

0

0

0

15:09

06/12/2021

DeepSITH: Efficient Learning via Decomposition of What and When Across Time Scales

Brandon Jacques, Zoran Tiganj, Marc Howard, Per B Sederberg

Keywords Paper

deep learning, machine learning

0

0

0

0

14:33

02/02/2021

Attentive Neural Point Processes for Event Forecasting

Yulong Gu

Keywords Paper

0

0

0

0

17:36

06/12/2020

EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning

Jiachen Li, Fan Yang, Masayoshi Tomizuka, Chiho Choi

Keywords Paper

0

0

0

0

3:21

12/07/2020

Learning with Feature and Distribution Evolvable Streams

Zhen-Yu Zhang, Peng Zhao, Yuan Jiang, Zhi-Hua Zhou

Keywords Paper

Supervised Learning

0

0

0

0

15:01

02/02/2021

End-to-end Semantic Role Labeling with Neural Transition-based Model

Hao Fei, Meishan Zhang, Bobo Li, Donghong Ji

Keywords Paper

0

0

0

0

18:47

12/07/2020

Neural Datalog Through Time: Informed Temporal Modeling via Logical Specification

Hongyuan Mei, Guanghui Qin, Minjie Xu, Jason Eisner

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

15:03

06/12/2021

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

Mathias Niepert, Pasquale Minervini, Luca Franceschi

Keywords Paper

deep learning, optimization

0

0

0

0

15:02

22/11/2021

One-Step Pixel-Level Perturbation-Based Saliency Detector

Vinnam Kim, Hyunsouk Cho, Sehee Chung

Keywords Paper

explainable ai, saliency map

0

0

0

0

3:30

06/12/2021

How Powerful are Performance Predictors in Neural Architecture Search?

Colin White, Arber Zela, Robin Ru and
Yang Liu, Frank Hutter

Keywords Paper

deep learning, optimization

0

0

0

0

15:02

06/12/2021

Bubblewrap: Online tiling and real-time flow prediction on neural manifolds

Anne Draelos, Pranjal Gupta, Na Young Jun and
Chaichontat Sriworarat, John Pearson

Keywords Paper

neuroscience

0

0

0

0

12:58

06/12/2021

Neural Flows: Efficient Alternative to Neural ODEs

Marin Biloš, Johanna Sommer, Syama Sundar Rangapuram and
Tim Januschowski, Stephan Günnemann

Keywords Paper

deep learning, generative model

0

0

0

0

12:09

18/07/2021

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Jianfei Chen, Lianmin Zheng, Zhewei Yao and
Dequan Wang, Ion Stoica, Michael Mahoney, Joseph E Gonzalez

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

18:54

06/12/2020

MomentumRNN: Integrating Momentum into Recurrent Neural Networks

Tan Nguyen, Richard Baraniuk, Andrea Bertozzi and
Stanley Osher, Bao Wang

Keywords Paper

0

0

0

0

3:09

03/05/2021

Prediction and generalisation over directed actions by grid cells

Changmin Yu, Timothy Behrens, Neil Burgess

Keywords Paper

grid cells, Computational neuroscience, normative models

0

0

0

0

5:42

12/07/2020

Sub-Goal Trees -- a Framework for Goal-Based Reinforcement Learning

Tom Jurgenson, Or Avner, Edward Groshev, Aviv Tamar

Keywords Paper

Reinforcement Learning - General

0

0

0

0

15:04

03/05/2021

More or Less: When and How to Build Convolutional Neural Network Ensembles

Abdul Wasay, Stratos Idreos

Keywords Paper

empirical study, ensemble learning, computer vision, machine learning systems

0

0

0

0

4:39

06/12/2020

Evolving Normalization-Activation Layers

Hanxiao Liu, Andy Brock, Karen Simonyan, Quoc V Le

Keywords Paper

0

0

0

0

2:32