Video Region Annotation with Sparse Bounding Boxes

07/09/2020

Video Region Annotation with Sparse Bounding Boxes

Yuzheng Xu, Yang Wu, Nur Sabrina binti Zuraimi, Shohei Nobuhara, Ko Nishino

Keywords: video annotation, semi-automatic annotation, graph convolutional network, region boundaries, sparse bounding boxes, automatic boundary finding

Abstract Paper Similar Papers

Abstract: Video analysis has been moving towards more detailed interpretation (e.g. segmentation) with encouraging progresses. These tasks, however, increasingly rely on densely annotated training data both in space and time. Since such annotation is labour-intensive, few densely annotated video data with detailed region boundaries exist. This work aims to resolve this dilemma by learning to automatically generate region boundaries for all frames of a video from sparsely annotated bounding boxes of target regions. We achieve this with a Volumetric Graph Convolutional Network (VGCN), which learns to iteratively find keypoints on the region boundaries using the spatio-temporal volume of surrounding appearance and motion. The global optimization of VGCN makes it significantly stronger and generalize better than existing solutions. Experimental results using two latest datasets (one real and one synthetic), including ablation studies, demonstrate the effectiveness and superiority of our method.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

Knowing What, Where and When to Look: Video Action modelling with Attention

Juan-Manuel Perez-Rua, Brais Martinez, Xiatian Zhu and
Antoine S Toisoul, Victor A Escorcia, Tao Xiang

Keywords Paper

Action recognition, Fine-grained action, video attention, Spatial attention, Channel attention, Temporal attention, Spatio-temporal attention, Feature refinement

0

0

0

0

2:46

03/05/2021

A Good Image Generator Is What You Need for High-Resolution Video Synthesis

Yu Tian, Jian Ren, Menglei Chai and
Kyle Olszewski, Xi Peng, Dimitris Metaxas, Sergey Tulyakov

Keywords Paper

contrastive learning, cross-domain video generation, high-resolution video generation

0

0

0

0

10:03

17/08/2020

Neural supersampling for real-time rendering

Lei Xiao, Salah Nouri, Matt Chapman and
Alexander Fix, Douglas Lanman, Anton Kaplanyan

Keywords Paper

virtual reality, rendering, deep learning, superresolution, upsampling

0

0

0

0

6:03

14/06/2020

AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation

Hyeongmin Lee, Taeoh Kim, Tae-young Chung and
Daehyun Pak, Yuseok Ban, Sangyoun Lee

Keywords Paper

video frame interpolation, video temporal super-resolution, frame rate up conversion, frame synthesis, motion estimation, motion compensation, frame warping

0

0

0

0

1:01

07/09/2020

Align-and-Attend Network for Globally and Locally Coherent Video Inpainting

Sanghyun Woo, Dahun Kim, KwanYong Park and
Joon-Young Lee, In So Kweon

Keywords Paper

Video Inpainting, Video Processing, Spatio-Temporal Alignment, Spatio-Temporal Non-local Attention

0

0

0

0

5:17

14/06/2020

Deep Homography Estimation for Dynamic Scenes

Hoang Le, Feng Liu, Shu Zhang, Aseem Agarwala

Keywords Paper

homography estimation, dynamic scenes, motion estimation, multi-task learning, deep learning

0

0

0

0

1:01

05/01/2021

Autonomous Tracking for Volumetric Video Sequences

Matthew Moynihan, Susana Ruano, Rafael Pages, Aljosa Smolic

Keywords Paper

0

0

0

0

5:01

14/06/2020

MAST: A Memory-Augmented Self-Supervised Tracker

Zihang Lai, Erika Lu, Weidi Xie

Keywords Paper

self-supervised learning, video segmentation, memory-augmented model, video understanding, tracking, unsupervised learning, generalization, attention, representation learning, metric learning

0

0

0

0

1:01

05/01/2021

DynaVSR: Dynamic Adaptive Blind Video Super-Resolution

Suyoung Lee, Myungsub Choi, Kyoung Mu Lee

Keywords Paper

0

0

0

0

4:56

05/01/2021

Embedded Dense Camera Trajectories in Multi-Video Image Mosaics by Geodesic Interpolation-Based Reintegration

Lars Haalck, Benjamin Risse

Keywords Paper

0

0

0

0

4:42

26/04/2020

CATER: A diagnostic dataset for Compositional Actions & TEmporal Reasoning

Rohit Girdhar, Deva Ramanan

Keywords Paper

Video Understanding, Temporal Reasoning

0

0

0

0

14:56

06/12/2021

Shifted Chunk Transformer for Spatio-Temporal Representational Learning

Xuefan Zha, Wentao Zhu, Lv Xun and
Sen Yang, Ji Liu

Keywords Paper

machine learning, transformers, vision, language

0

0

0

0

6:14

14/06/2020

Deep Non-Line-of-Sight Reconstruction

Javier Grau Chopite, Matthias B. Hullin, Michael Wand, Julian Iseringhausen

Keywords Paper

non-line-of-sight, time-of-flight, transient imaging, deep learning, geometry reconstruction, synthetic training

0

0

0

0

1:00

17/08/2020

Consistent video depth estimation

Xuan Luo, Jia-Bin Huang, Richard Szeliski and
Kevin Matzen, Johannes Kopf

Keywords Paper

video, depth estimation

0

0

0

1

12:43

14/06/2020

Self-Supervised Monocular Trained Depth Estimation Using Self-Attention and Discrete Disparity Volume

Adrian Johnston, Gustavo Carneiro

Keywords Paper

self-supervised depth estimation, self-supervised learning, self-attention, depth estimation, uncertainty

0

0

0

0

1:01

14/06/2020

X3D: Expanding Architectures for Efficient Video Recognition

Christoph Feichtenhofer

Keywords Paper

video classification, action recognition, video detection, video understanding, deep learning, neural networks

0

0

0

0

4:56

06/12/2021

A Continuous Mapping For Augmentation Design

Keyu Tian, Chen Lin, Ser Nam Lim and
Wanli Ouyang, Puneet Dokania, Philip Torr

Keywords Paper

optimization

0

0

0

0

9:23

14/06/2020

Temporal Pyramid Network for Action Recognition

Ceyuan Yang, Yinghao Xu, Jianping Shi and
Bo Dai, Bolei Zhou

Keywords Paper

video understanding, action recognition, visual tempo, temporal pyramid

0

0

0

0

1:01

14/06/2020

Scene-Adaptive Video Frame Interpolation via Meta-Learning

Myungsub Choi, Janghoon Choi, Sungyong Baik and
Tae Hyun Kim, Kyoung Mu Lee

Keywords Paper

video frame interpolation, test-time adaptation, meta-learning, self-supervision, image synthesis, slow motion, motion estimation, error correction, maml, input-adaptive neural network

0

0

0

0

0:55

07/09/2020

Integrating Long-Short Term Network for Efficient Video Object Segmentation

Jingjing Wang, Zhu Teng, Baopeng Zhang, Jianping Fan

Keywords Paper

Video Object Segmentation, Long-Short Term Network, Multiple-object segmentation

0

0

0

0

8:30

05/01/2021

Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition

Theo Ayral, Marco Pedersoli, Simon Bacon, Eric Granger

Keywords Paper

0

0

0

0

5:00

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

14/06/2020

Video Modeling With Correlation Networks

Heng Wang, Du Tran, Lorenzo Torresani, Matt Feiszli

Keywords Paper

action recognition, video classification, motion, correlation, temporal information, kinetics, something-something.

0

0

0

0

1:05

06/12/2020

Convolutional Tensor-Train LSTM for Spatio-Temporal Learning

Jiahao Su, Wonmin Byeon, Jean Kossaifi and
Furong Huang, Jan Kautz, Anima Anandkumar

Keywords Paper

0

0

0

0

3:29

02/02/2021

Patch-Wise Attention Network for Monocular Depth Estimation

Sihaeng Lee, Janghyeon Lee, Byungju Kim and
Eojindl Yi, Junmo Kim

Keywords Paper

0

0

0

0

14:15

06/12/2021

Clockwork Variational Autoencoders

Vaibhav Saxena, Jimmy Ba, Danijar Hafner

Keywords Paper

deep learning, vision, generative model

0

0

0

0

10:23

07/09/2020

Refinement of Boundary Regression Using Uncertainty in Temporal Action Localization

Yunze Chen, Mengjuan Chen, Rui Wu and
Jiagang Zhu, Zheng Zhu, Qingyi Gu

Keywords Paper

Temporal Action Localization, Temporal Action Detection, Activity recognition and understanding

0

0

0

0

5:09

14/06/2020

gDLS*: Generalized Pose-and-Scale Estimation Given Scale and Gravity Priors

Victor Fragoso, Joseph DeGol, Gang Hua

Keywords Paper

scale and pose estimation, generalized camera model, multi-camera system, scale prior, gravity prior, gdls, inertial sensors, pose estimation, ransac

0

0

0

0

0:56

13/04/2021

Learning bijective feature maps for linear ICA

Alexander Camuto, Matthew Willetts, Chris Holmes and
Brooks Paige, Stephen Roberts

Keywords Paper

0

0

0

0

3:02

05/01/2021

Self-Supervised 4D Spatio-Temporal Feature Learning via Order Prediction of Sequential Point Cloud Clips

Haiyan Wang, Liang Yang, Xuejian Rong and
Jinglun Feng, Yingli Tian

Keywords Paper

0

0

0

0

4:52

03/05/2021

VA-RED$^2$: Video Adaptive Redundancy Reduction

Bowen Pan, Rameswar Panda, Camilo L Fosco and
Chung-Ching Lin, Alex J Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris

Keywords Paper

0

0

0

0

5:02

14/06/2020

Plug-and-Play Algorithms for Large-Scale Snapshot Compressive Imaging

Xin Yuan, Yang Liu, Jinli Suo, Qionghai Dai

Keywords Paper

snapshot compressive image, plug-and-play, large-scale, video compressive sensing, convergence, coded aperture compresive temporal imaging (cacti), gap, admm, real data

0

0

0

0

5:01

14/06/2020

Unsupervised Learning From Video With Deep Neural Embeddings

Chengxu Zhuang, Tianwei She, Alex Andonian and
Max Sobol Mark, Daniel Yamins

Keywords Paper

unsupervised learning, self-supervised learning, video learning, contrastive learning, deep neural networks, action recognition, object recognition, two-pathway models

0

0

0

0

1:01

02/02/2021

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation

Yang Fu, Linjie Yang, Ding Liu and
Thomas S. Huang, Humphrey Shi

Keywords Paper

0

0

0

0

16:24

12/07/2020

Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?

Kei Ota, Tomoaki Oiki, Devesh Jha and
Toshisada Mariyama, Daniel Nikovski

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:55

05/01/2021

Dual-Stream Fusion Network for Spatiotemporal Video Super-Resolution

Min-Yuan Tseng, Yen-Chung Chen, Yi-Lun Lee and
Wei-Sheng Lai, Yi-Hsuan Tsai, Wei-Chen Chiu

Keywords Paper

0

0

0

0

4:58

18/07/2021

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Jianfei Chen, Lianmin Zheng, Zhewei Yao and
Dequan Wang, Ion Stoica, Michael Mahoney, Joseph E Gonzalez

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

18:54

05/01/2021

High-Quality Frame Interpolation via Tridirectional Inference

Jinsoo Choi, Jaesik Park, In So Kweon

Keywords Paper

0

0

0

0

4:08

05/01/2021

Adaptive Streaming of 360-Degree Videos With Reinforcement Learning

Sohee Park, Minh Hoai, Arani Bhattacharya, Samir R. Das

Keywords Paper

0

0

0

0

4:51

30/11/2020

Project to Adapt: Domain Adaptation for Depth Completion from Noisy and Sparse Sensor Data

Adrian Lopez-Rodriguez, Benjamin Busam, Krystian Mikolajczyk

Keywords Paper

0

0

0

0

10:00