Temporally Distributed Networks for Fast Video Semantic Segmentation

14/06/2020

Temporally Distributed Networks for Fast Video Semantic Segmentation

Ping Hu, Fabian Caba, Oliver Wang, Zhe Lin, Stan Sclaroff, Federico Perazzi

Keywords: video semantic segmentation, semantic segmentation, low-latency video processing, temporally distributed computation, attention propagation, grouped knowledge distillation

Abstract Paper Similar Papers

Abstract: We present TDNet, a temporally distributed network designed for fast and accurate video semantic segmentation. We observe that features extracted from a certain high-level layer of a deep CNN can be approximated by composing features extracted from several shallower sub-networks. Leveraging the inherent temporal continuity in videos, we distribute these sub-networks over sequential frames. Therefore, at each time step, we only need to perform a lightweight computation to extract a sub-features group from a single sub-network. The full features used for segmentation are then recomposed by application of a novel attention propagation module that compensates for geometry deformation between frames. A grouped knowledge distillation loss is also introduced to further improve the representation power at both full and sub-feature levels. Experiments on Cityscapes, CamVid, and NYUD-v2 demonstrate that our method achieves state-of-the-art accuracy with significantly faster speed and lower latency.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Dynamic Normalization and Relay for Video Action Recognition

Dongqi Cai, Anbang Yao, Yurong Chen

Keywords Paper

deep learning, representation learning

0

0

0

0

10:42

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

14/06/2020

X3D: Expanding Architectures for Efficient Video Recognition

Christoph Feichtenhofer

Keywords Paper

video classification, action recognition, video detection, video understanding, deep learning, neural networks

0

0

0

0

4:56

07/09/2020

Making a Case for 3D Convolutions for Object Segmentation in Videos

Sabarinath Mahadevan, Ali Athar, Aljosa Osep and
Laura Leal-Taixé, Bastian Leibe, Sebastian Hennen

Keywords Paper

object tracking, video segmentation, video object segmentation, video scene understanding, object segmentation

0

0

0

0

8:16

18/07/2021

Is Space-Time Attention All You Need for Video Understanding?

Gedas Bertasius, Heng Wang, Lorenzo Torresani

Keywords Paper

, Algorithms, AutoML, Deep Learning, Architectures

0

0

0

0

5:15

13/04/2021

Adaptive wavelet pooling for convolutional neural networks

Moritz Wolter, Jochen Garcke

Keywords Paper

0

0

0

0

2:53

14/06/2020

Gate-Shift Networks for Video Action Recognition

Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz

Keywords Paper

action recognition, video representation learning, spatio-temporal interactions, video classification

0

0

0

0

1:00

14/06/2020

Learning Event-Based Motion Deblurring

Zhe Jiang, Yu Zhang, Dongqing Zou and
Jimmy Ren, Jiancheng Lv, Yebin Liu

Keywords Paper

deblur, event camera, video reconstruction, image restoration, low-level vision, neural networks, adversarial training, adaptive sampling, supervised learning, dynamic vision sensor

0

0

0

0

1:01

22/11/2021

Spatial-Temporal Residual Aggregation for High Resolution Video Inpainting

Vishnu Sanjay Ramiya Srinivasan, Rui Ma, Qiang Tang and
Zili Yi, Zhan Xu

Keywords Paper

high resolution video inpainting, spatial-temporal aggregation, residual aggregation, spatial-temporal attention, image alignment

0

0

0

0

2:58

05/01/2021

OverNet: Lightweight Multi-Scale Super-Resolution With Overscaling Network

Parichehr Behjati, Pau Rodriguez, Armin Mehri and
Isabelle Hupont, Carles Fernandez Tena, Jordi Gonzalez

Keywords Paper

0

0

0

0

4:24

05/04/2021

Accelerate Inference of CNNs for Video Analysis While Preserving Exactness Exploiting Activation Sparsity

Toshiaki Wakatsuki, Sekitoshi Kanai, Yasuhiro Fujiwara

Keywords Paper

0

0

0

0

4:40

05/04/2021

Accelerate Inference of CNNs for Video Analysis While Preserving Exactness Exploiting Activation Sparsity

Toshiaki Wakatsuki, Sekitoshi Kanai, Yasuhiro Fujiwara

Keywords Paper

0

0

0

0

17:56

14/06/2020

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior

Jinshan Pan, Haoran Bai, Jinhui Tang

Keywords Paper

video deblurring, deep convolutional neural network, motion blur estimation, optical flow, temporal sharpness prior, image restoration

0

0

0

0

0:53

14/06/2020

Improving Convolutional Networks With Self-Calibrated Convolutions

Jiang-Jiang Liu, Qibin Hou, Ming-Ming Cheng and
Changhu Wang, Jiashi Feng

Keywords Paper

self-calibrated, feature transformation, image classification, network architecture, convolutional neural networks

0

0

0

0

1:00

02/02/2021

MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection

Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

Keywords Paper

0

0

0

0

16:48

05/01/2021

Exploiting the Redundancy in Convolutional Filters for Parameter Reduction

Kumara Kahatapitiya, Ranga Rodrigo

Keywords Paper

0

0

0

0

5:10

14/06/2020

Context Prior for Scene Segmentation

Changqian Yu, Jingbo Wang, Changxin Gao and
Gang Yu, Chunhua Shen, Nong Sang

Keywords Paper

semantic segmentation, scene segmentation, context prior, context aggregation, affinity loss, affinity matrix

0

0

0

0

1:01

03/05/2021

VA-RED$^2$: Video Adaptive Redundancy Reduction

Bowen Pan, Rameswar Panda, Camilo L Fosco and
Chung-Ching Lin, Alex J Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris

Keywords Paper

0

0

0

0

5:02

03/05/2021

CT-Net: Channel Tensorization Network for Video Classification

Kunchang Li, xianhang li, Yali Wang and
Jun Wang, Yu Qiao

Keywords Paper

3D Convolution, Video Classification, Channel Tensorization

0

0

0

0

4:59

30/11/2020

Dense Dual-Path Network for Real-time Semantic Segmentation

Xinneng Yang, Yan Wu, Junqiao Zhao, Feilin Liu

Keywords Paper

0

0

0

0

5:43

22/11/2021

GhostShiftAddNet: More Features from Energy-Efficient Operations

Jia Bi, Jonathon Hare, Geoff V Merrett

Keywords Paper

Efficient convolutional neural network, embedded platform, feature redundancy, image classifier.

0

0

0

0

3:37

14/06/2020

SmallBigNet: Integrating Core and Contextual Views for Video Classification

Xianhang Li, Yali Wang, Zhipeng Zhou, Yu Qiao

Keywords Paper

video classification, action recognition, temporal convolution, 3d maxpooling, shared convolution

0

0

0

0

1:00

06/12/2021

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Bowen Zhang, Yifan liu, Zhi Tian, Chunhua Shen

Keywords Paper

deep learning, vision, representation learning

0

0

0

0

12:04

06/12/2020

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

Mathilde Caron, Ishan Misra, Julien Mairal and
Priya Goyal, Piotr Bojanowski, Armand Joulin

Keywords Paper

0

1

0

0

3:22

03/05/2021

Attentional Constellation Nets for Few-Shot Learning

Weijian Xu, Yifan Xu, Huaijin Wang, Zhuowen Tu

Keywords Paper

few-shot learning, constellation models

0

0

0

0

5:10

02/02/2021

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Wenhao Wu, Dongliang He, Tianwei Lin and
Fu Li, Chuang Gan, Errui Ding

Keywords Paper

0

0

0

0

14:02

06/12/2021

Temporal-attentive Covariance Pooling Networks for Video Recognition

Zilin Gao, Qilong Wang, Bingbing Zhang and
Qinghua Hu, Peihua Li

Keywords Paper

0

0

0

1

8:13

26/04/2020

Batch-shaping for learning conditional channel gated networks

Babak Ehteshami Bejnordi, Tijmen Blankevoort, Max Welling

Keywords Paper

Conditional computation, channel gated networks, gating, Batch-shaping, distribution matching, image classification, semantic segmentation

0

0

0

0

5:26

14/06/2020

Classifying, Segmenting, and Tracking Object Instances in Video with Mask Propagation

Gedas Bertasius, Lorenzo Torresani

Keywords Paper

instance segmentation, object detection, object tracking, video analysis.

0

0

0

0

4:59

06/12/2020

Structured Convolutions for Efficient Neural Network Design

Yash Bhalgat, Yizhe Zhang, Jamie Menjay Lin, Fatih Porikli

Keywords Paper

0

0

0

0

3:20

14/06/2020

Time Flies: Animating a Still Image With Time-Lapse Video As Reference

Chia-Chi Cheng, Hung-Yu Chen, Wei-Chen Chiu

Keywords Paper

time-lapse video animation, self-supervised learning, style transfer, temporal consistency

0

0

0

0

1:01

14/06/2020

A2dele: Adaptive and Attentive Depth Distiller for Efficient RGB-D Salient Object Detection

Yongri Piao, Zhengkun Rong, Miao Zhang and
Weisong Ren, Huchuan Lu

Keywords Paper

rgb-d, salient object dection, knowledge distillation, attention, computer vision, cnn

0

0

0

0

1:00

06/12/2021

Relational Self-Attention: What's Missing in Attention for Video Understanding

Manjin Kim, Heeseung Kwon, CHUNYU WANG and
Suha Kwak, Minsu Cho

Keywords Paper

deep learning, transformers

0

0

0

0

13:31

02/02/2021

Learning Comprehensive Motion Representation for Action Recognition

Mingyu Wu, Boyuan Jiang, Donghao Luo and
Junchi Yan, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xiaokang Yang

Keywords Paper

0

0

0

0

15:15

12/07/2020

Informative Dropout for Robust Representation Learning: A Shape-bias Perspective

Baifeng Shi, Dinghuai Zhang, Qi Dai and
Jingdong Wang, Zhanxing Zhu, Yadong Mu

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

14:58

18/07/2021

Group Fisher Pruning for Practical Network Compression

Liyang Liu, Shilong Zhang, Zhanghui Kuang and
Aojun Zhou, Jing-Hao Xue, Xinjiang Wang, Yimin Chen, Wenming Yang, Qingmin Liao, Wayne Zhang

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:05

30/11/2020

CS-MCNet:A Video Compressive Sensing Reconstruction Network with Interpretable Motion Compensation

Bowen Huang, Jinjia Zhou, Xiao Yan and
Ming'e Jing, Rentao Wan, Yibo Fan

Keywords Paper

0

0

0

0

6:57

22/11/2021

Temporal Meta-Adaptor for Video Object Detection

Chi Wang, Yang Hua, ZHENG LU and
Jian Gao, Neil Robertson

Keywords Paper

video object detection, temporal aggregation, meta-learning, ImageNet VID

0

0

0

0

6:58

02/02/2021

SMART Frame Selection for Action Recognition

Shreyank N Gowda, Marcus Rohrbach, Laura Sevilla-Lara

Keywords Paper

0

0

0

0

14:10

05/01/2021

Adaptive Streaming of 360-Degree Videos With Reinforcement Learning

Sohee Park, Minh Hoai, Arani Bhattacharya, Samir R. Das

Keywords Paper

0

0

0

0

4:51