A Transductive Approach for Video Object Segmentation

Abstract: Semi-supervised video object segmentation aims to separate a target object from a video sequence, given the mask in the first frame. Most of current prevailing methods utilize information from additional modules trained in other domains like optical flow and instance segmentation, and as a result they do not compete with other methods on common ground. To address this issue, we propose a simple yet strong transductive method, in which additional modules, datasets, and dedicated architectural designs are not needed. Our method takes a label propagation approach where pixel labels are passed forward based on feature similarity in an embedding space. Different from other propagation methods, ours diffuses temporal information in a holistic manner which take accounts of long-term object appearance. In addition, our method requires few additional computational overhead, and runs at a fast ~37 fps speed. Our single model with a vanilla ResNet50 backbone achieves an overall score of 72.3% on the DAVIS 2017 validation set and 63.1% on the test set. This simple yet high performing and efficient method can serve as a solid baseline that facilitates future research. Code and models are available at https://github.com/ microsoft/transductive-vos.pytorch.

A Transductive Approach for Video Object Segmentation

Yizhuo Zhang, Zhirong Wu, Houwen Peng, Stephen Lin

Comments

Similar Papers

SESS: Self-Ensembling Semi-Supervised 3D Object Detection

Na Zhao, Tat-Seng Chua, Gim Hee Lee

Keywords Abstract Paper

3d object detection, semi-supervised learning, self-ensembling technique, point cloud analysis

Making a Case for 3D Convolutions for Object Segmentation in Videos

Sabarinath Mahadevan, Ali Athar, Aljosa Osep and Laura Leal-Taixé, Bastian Leibe, Sebastian Hennen

Keywords Abstract Paper

object tracking, video segmentation, video object segmentation, video scene understanding, object segmentation

FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation

Julia Gong, F. Christopher Holsinger, Serena Yeung

Keywords Abstract Paper

video object segmentation, single shot video object segmentation, segmentation, object tracking, optical flow, motion tracking, visual warping, weak supervision, video analysis, object segmentation

BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation

Hao Chen, Kunyang Sun, Zhi Tian and Chunhua Shen, Yongming Huang, Youliang Yan

Keywords Abstract Paper

instance segmentation, fully-convolutional, object detection, real-time

Characterizing signal propagation to close the performance gap in unnormalized ResNets

Andrew Brock, Soham De, Samuel Smith

Keywords Abstract Paper

neural networks, ConvNets, deep learning, CNNs, EfficientNets, ResNets, signal propagation, normalizers, ImageNet

UWSOD: Toward Fully-Supervised-Level Capacity Weakly Supervised Object Detection

Yunhang Shen, Rongrong Ji, Zhiwei Chen and Yongjian Wu, Feiyue Huang

Keywords Abstract Paper

CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement

Ho Kei Cheng, Jihoon Chung, Yu-Wing Tai, Chi-Keung Tang

Keywords Abstract Paper

segmentation refinement, high-resolution, 4k, semantic segmentation, scene parsing

Diverse Image Generation via Self-Conditioned GANs

Steven Liu, Tongzhou Wang, David Bau and Jun-Yan Zhu, Antonio Torralba

Keywords Abstract Paper

generative adversarial networks, image synthesis, mode collapse, clustering, unsupervised learning

Context Prior for Scene Segmentation

Changqian Yu, Jingbo Wang, Changxin Gao and Gang Yu, Chunhua Shen, Nong Sang

Keywords Abstract Paper

semantic segmentation, scene segmentation, context prior, context aggregation, affinity loss, affinity matrix

Structure-Consistent Weakly Supervised Salient Object Detection with Local Saliency Coherence

Siyue Yu, Bingfeng Zhang, Jimin Xiao, Eng Gee Lim

Keywords Abstract Paper

JA-POLS: A Moving-Camera Background Model via Joint Alignment and Partially-Overlapping Local Subspaces

Irit Chelly, Vlad Winter, Dor Litvak and David Rosen, Oren Freifeld

Keywords Abstract Paper

background subtraction, video analysis, computer vision, machine learning, robust pca, deep learning, moving camera, transfer learning, video surveillance, lie groups

Mask Encoding for Single Shot Instance Segmentation

Rufeng Zhang, Zhi Tian, Chunhua Shen and Mingyu You, Youliang Yan

Keywords Abstract Paper

mask encoding, instance segmentation, single shot

Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection

Nadine Chang, Zhiding Yu, Yu-Xiong Wang and Anima Anandkumar, Sanja Fidler, Jose Alvarez

Keywords Abstract Paper

Applications, Computer Vision

Feature Fusion Vision Transformer for Fine-Grained Visual Categorization

Jun Wang, Xiaohan Yu, Yongsheng Gao

Keywords Abstract Paper

Fine-grained visual categorization, Vision transformer, Self-attention, Feature Fusion

Siamese Box Adaptive Network for Visual Tracking

Zedu Chen, Bineng Zhong, Guorong Li and Shengping Zhang, Rongrong Ji

Keywords Abstract Paper

visual tracking, siamese network, anchor-free, fully convolutional network, box adaptive, no-prior box

Towards Better Generalization: Joint Depth-Pose Learning Without PoseNet

Wang Zhao, Shaohui Liu, Yezhi Shu, Yong-Jin Liu

Keywords Abstract Paper

monocular depth estimation, self-supervised learning, deep visual odometry, 3d deep learning, multi-task learning

Explicitly Modeled Attention Maps for Image Classification

Andong Tan, Duc Tam Nguyen, Maximilian Dax and Matthias Nießner, Thomas Brox

Keywords Abstract Paper

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Abstract Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

Learning Fast and Robust Target Models for Video Object Segmentation

Andreas Robinson, Felix Järemo Lawin, Martin Danelljan and Fahad Shahbaz Khan, Michael Felsberg

Keywords Abstract Paper

video object segmentation, semi-supervised

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

Shengyu Zhao, Jonathan Cui, Yilun Sheng and Yue Dong, Xiao Liang, Eric Chang, Yan Xu

Keywords Abstract Paper

Keywords Paper

Sabarinath Mahadevan, Ali Athar, Aljosa Osep and
Laura Leal-Taixé, Bastian Leibe, Sebastian Hennen

Keywords Paper

Keywords Paper

Hao Chen, Kunyang Sun, Zhi Tian and
Chunhua Shen, Yongming Huang, Youliang Yan

Keywords Paper

Keywords Paper

Yunhang Shen, Rongrong Ji, Zhiwei Chen and
Yongjian Wu, Feiyue Huang

Keywords Paper

Keywords Paper

Steven Liu, Tongzhou Wang, David Bau and
Jun-Yan Zhu, Antonio Torralba

Keywords Paper

Changqian Yu, Jingbo Wang, Changxin Gao and
Gang Yu, Chunhua Shen, Nong Sang

Keywords Paper

Keywords Paper

Irit Chelly, Vlad Winter, Dor Litvak and
David Rosen, Oren Freifeld

Keywords Paper

Rufeng Zhang, Zhi Tian, Chunhua Shen and
Mingyu You, Youliang Yan

Keywords Paper

Nadine Chang, Zhiding Yu, Yu-Xiong Wang and
Anima Anandkumar, Sanja Fidler, Jose Alvarez

Keywords Paper

Keywords Paper

Zedu Chen, Bineng Zhong, Guorong Li and
Shengping Zhang, Rongrong Ji

Keywords Paper

Keywords Paper

Andong Tan, Duc Tam Nguyen, Maximilian Dax and
Matthias Nießner, Thomas Brox

Keywords Paper

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

Andreas Robinson, Felix Järemo Lawin, Martin Danelljan and
Fahad Shahbaz Khan, Michael Felsberg

Keywords Paper

Shengyu Zhao, Jonathan Cui, Yilun Sheng and
Yue Dong, Xiao Liang, Eric Chang, Yan Xu

Keywords Paper

Keywords Paper

Keywords Paper

Chi Wang, Yang Hua, ZHENG LU and
Jian Gao, Neil Robertson

Keywords Paper

Keywords Paper

Dongfang Liu, Yiming Cui, Liqi Yan and
Christos Mousas, Baijian Yang, Yingjie Chen

Keywords Paper

Wang Shen, Wenbo Bao, Guangtao Zhai and
Li Chen, Xiongkuo Min, Zhiyong Gao

Keywords Paper

Keywords Paper

Keywords Paper

Xiaofeng Ruan, Yufan Liu, Bing Li and
Chunfeng Yuan, Weiming Hu

Keywords Paper

Keywords Paper

Keywords Paper

Yuyang Zhang Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences and
Jinge Wang, Shibiao Xu, Xiao Liu, Xiaopeng Zhang

Keywords Paper

Jiehong Lin, Hongyang Li, Ke Chen and
Jiangbo Lu, Kui Jia

Keywords Paper

Vishnu Sanjay Ramiya Srinivasan, Rui Ma, Qiang Tang and
Zili Yi, Zhan Xu

Keywords Paper

Yude Wang, Jie Zhang, Meina Kan and
Shiguang Shan, Xilin Chen

Keywords Paper

Yuliang Zou, Zizhao Zhang, Han Zhang and
Chun-Liang Li, Xiao Bian, Jia-Bin Huang, Tomas Pfister

Keywords Paper