Spatial-Temporal Residual Aggregation for High Resolution Video Inpainting

22/11/2021

Spatial-Temporal Residual Aggregation for High Resolution Video Inpainting

Vishnu Sanjay Ramiya Srinivasan, Rui Ma, Qiang Tang, Zili Yi, Zhan Xu

Keywords: high resolution video inpainting, spatial-temporal aggregation, residual aggregation, spatial-temporal attention, image alignment

Abstract Paper Code Similar Papers

Abstract: Recent learning-based inpainting algorithms have achieved compelling results for completing missing regions after removing undesired objects in videos. To maintain the temporal consistency among the frames, 3D spatial and temporal operations are often heavily used in the deep networks. However, these methods usually suffer from memory constraints and can only handle low resolution videos. We propose STRA-Net, a novel spatial-temporal residual aggregation framework for high resolution video inpainting. The key idea is to first learn and apply a spatial and temporal inpainting network on the downsampled low resolution videos. Then, we refine the low resolution results by aggregating the learned spatial and temporal image residuals (details) to the upsampled inpainted frames. Both the quantitative and qualitative evaluations show that we can produce more temporal-coherent and visually appealing results than the state-of-the-art methods on inpainting high resolution videos.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

05/01/2021

DynaVSR: Dynamic Adaptive Blind Video Super-Resolution

Suyoung Lee, Myungsub Choi, Kyoung Mu Lee

Keywords Paper

0

0

0

0

4:56

22/11/2021

Temporal Meta-Adaptor for Video Object Detection

Chi Wang, Yang Hua, ZHENG LU and
Jian Gao, Neil Robertson

Keywords Paper

video object detection, temporal aggregation, meta-learning, ImageNet VID

0

0

0

0

6:58

17/08/2020

Learning temporal coherence via self-supervision for GAN-based video generation

Mengyu Chu, You Xie, Jonas Mayer and
Laura Leal-Taixé, Nils Thuerey

Keywords Paper

self-supervision, temporal cycle-consistency, video super-resolution, generative adversarial network, unpaired video translation

0

0

0

0

16:59

06/12/2021

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations

Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc V Gool

Keywords Paper

self-supervised learning, vision, contrastive learning, representation learning

0

0

0

0

13:32

30/11/2020

Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection

Erli Ouyang, Li Zhang, Mohan Chen and
Anurag Arnab, Yanwei Fu

Keywords Paper

0

0

0

0

6:30

02/02/2021

HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation

Xiaoyang Lyu, Liang Liu, Mengmeng Wang and
Xin Kong, Lina Liu, Yong Liu, Xinxin Chen, Yi Yuan

Keywords Paper

0

0

0

0

12:10

14/06/2020

SmallBigNet: Integrating Core and Contextual Views for Video Classification

Xianhang Li, Yali Wang, Zhipeng Zhou, Yu Qiao

Keywords Paper

video classification, action recognition, temporal convolution, 3d maxpooling, shared convolution

0

0

0

0

1:00

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

14/06/2020

Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Analysis and a New Strategy

Jaejun Yoo, Namhyuk Ahn, Kyung-Ah Sohn

Keywords Paper

data augmentation, low-level vision, image restoration, super-resolution, realsr, denoising, artifact removal, cutblur, mixture of augmentation, generalization

0

0

0

0

1:01

06/12/2021

Compressive Visual Representations

Kuang-Huei Lee, Anurag Arnab, Sergio Guadarrama and
John Canny, Ian Fischer

Keywords Paper

theory, machine learning, robustness, self-supervised learning, contrastive learning

0

0

0

0

6:30

03/05/2021

Self-Supervised Learning of Compressed Video Representations

Youngjae Yu, Sangho Lee, Gunhee Kim, Yale Song

Keywords Paper

self-supervised learning, Compressed videos

0

0

0

0

4:34

05/01/2021

Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition

Theo Ayral, Marco Pedersoli, Simon Bacon, Eric Granger

Keywords Paper

0

0

0

0

5:00

14/06/2020

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking

Jin Gao, Weiming Hu, Yan Lu

Keywords Paper

online learning, visual tracking, continual learning, recursive least-squares estimation, deep learning, memory retention, recursive learning, mini-batch sgd, normal equation, mlp layer

0

0

0

0

5:01

26/04/2020

Efficient and Information-Preserving Future Frame Prediction and Beyond

Wei Yu, Yichao Lu, Steve Easterbrook, Sanja Fidler

Keywords Paper

self-supervised learning, generative pre-training, video prediction, reversible architecture

0

0

0

0

4:18

22/11/2021

Fine-grained Multi-Modal Self-Supervised Learning

Duo Wang, Salah Karout

Keywords Paper

self-supervised learning, multi-modal learning

0

0

0

0

2:46

07/09/2020

Boosting Image and Video Compression via Learning Latent Residual Patterns

Yen-Chung Chen, Keng-Jui Chang, Yi-Hsuan Tsai, Wei-Chen Chiu

Keywords Paper

compression artifacts, image compression, video compression, latent residual

0

0

0

0

7:48

14/06/2020

Semi-Supervised Learning for Few-Shot Image-to-Image Translation

Yaxing Wang, Salman Khan, Abel Gonzalez-Garcia and
Joost van de Weijer, Fahad Shahbaz Khan

Keywords Paper

image-to-image translation, few-shot image generation, unsupervised image-to-image translation, conditional image generation, semi-supervised image-to-image translation

0

0

0

0

0:58

02/02/2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Ziyi Liu, Le Wang, Wei Tang and
Junsong Yuan, Nanning Zheng, Gang Hua

Keywords Paper

0

0

0

0

19:49

14/06/2020

Learning Memory-Guided Normality for Anomaly Detection

Hyunjong Park, Jongyoun Noh, Bumsub Ham

Keywords Paper

anomaly detection, unsupervised learning, memory networks, prototypical feature, pattern clustering, feature extraction, video recognition, convolutional neural networks

0

0

0

0

1:01

06/12/2021

Why Do Better Loss Functions Lead to Less Transferable Features?

Simon Kornblith, Ting Chen, Honglak Lee, Mohammad Norouzi

Keywords Paper

deep learning, machine learning, vision, transfer learning

0

0

0

0

9:26

07/09/2020

Learning Effectively from Noisy Supervision for Weakly Supervised Semantic Segmentation

Wenbin Xie, Qiaoqiao Wei, Zheng Li, Hui Zhang

Keywords Paper

Semantic Segmentation, Weakly Supervised Semantic Segmentation, Self Attention

0

0

0

0

3:46

19/08/2021

Learning Implicit Temporal Alignment for Few-shot Video Classification

Songyang Zhang, Jiale Zhou, Xuming He

Keywords Paper

Computer Vision, Action Recognition, Deep Learning

0

0

0

0

6:20

06/12/2020

Self-Adaptively Learning to Demoiré from Focused and Defocused Image Pairs

Lin Liu, Shanxin Yuan, Jianzhuang Liu and
Liping Bao, Gregory Slabaugh, Qi Tian

Keywords Paper

0

0

0

0

3:14

22/11/2021

Deep Video Decaptioning

Pengpeng Chu, Weize Quan, Tong Wang and
Pan Wang, Peiran Ren, Dong-Ming Yan

Keywords Paper

video decaptioning, caption mask extraction, frame attention, real time

0

0

0

0

2:59

07/09/2020

Meta-RetinaNet for Few-shot Object Detection

Shaoqi Li, Wenfeng Song, Shuai Li and
Aimin Hao, Hong Qin

Keywords Paper

Few shot, object detection, meta-learning, Meta-RetinaNet, Balanced Loss, coefficient vector

0

0

0

0

8:51

05/01/2021

DualSR: Zero-Shot Dual Learning for Real-World Super-Resolution

Mohammad Emad, Maurice Peemen, Henk Corporaal

Keywords Paper

0

0

0

0

4:57

14/06/2020

Scale-Space Flow for End-to-End Optimized Video Compression

Eirikur Agustsson, David Minnen, Nick Johnston and
Johannes Ballé, Sung Jin Hwang, George Toderici

Keywords Paper

learned video compression, scale-space flow, bilinear warping

0

0

0

0

0:55

22/11/2021

Siamese Prototypical Contrastive Learning

Shentong Mo, Zhun Sun, Chao Li

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

2:50

14/06/2020

Regularization on Spatio-Temporally Smoothed Feature for Action Recognition

Jinhyung Kim, Seunghwan Cha, Dongyoon Wee and
Soonmin Bae, Junmo Kim

Keywords Paper

regularization, action recognition, video classification

0

0

0

0

1:01

14/06/2020

Deblurring Using Analysis-Synthesis Networks Pair

Adam Kaufman, Raanan Fattal

Keywords Paper

image deblurring, blur-kernel estimation, blind deblurring, image restoration, image processing, deep learning, neural networks, generative model

0

0

0

0

1:01

06/12/2021

Compressed Video Contrastive Learning

Yuqi Huo, Mingyu Ding, Haoyu Lu and
Nanyi Fei, Zhiwu Lu, Ji-Rong Wen, Ping Luo

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

9:07

07/09/2020

Making a Case for 3D Convolutions for Object Segmentation in Videos

Sabarinath Mahadevan, Ali Athar, Aljosa Osep and
Laura Leal-Taixé, Bastian Leibe, Sebastian Hennen

Keywords Paper

object tracking, video segmentation, video object segmentation, video scene understanding, object segmentation

0

0

0

0

8:16

06/12/2020

Hard Negative Mixing for Contrastive Learning

Yannis Kalantidis, Mert Bulent Sariyildiz, Noe Pion and
Philippe Weinzaepfel, Diane Larlus

Keywords Paper

0

0

0

0

3:17

14/06/2020

A Spatial RNN Codec for End-to-End Image Compression

Chaoyi Lin, Jiabao Yao, Fangdong Chen, Li Wang

Keywords Paper

image compression, spatial rnn, adaptive quantization, lstm

0

0

0

0

1:01

02/02/2021

Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation

Daizong Liu, Shuangjie Xu, Xiao-Yang Liu and
Zichuan Xu, Wei Wei, Pan Zhou

Keywords Paper

0

0

0

0

14:42

05/01/2021

Intra-Class Part Swapping for Fine-Grained Image Classification

Lianbo Zhang, Shaoli Huang, Wei Liu

Keywords Paper

0

0

0

0

4:43

14/06/2020

Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection

Zhongzheng Ren, Zhiding Yu, Xiaodong Yang and
Ming-Yu Liu, Yong Jae Lee, Alexander G. Schwing, Jan Kautz

Keywords Paper

weakly-supervised, object detection, video recognition, instance-aware, context-focused, memory-efficient

0

0

0

0

0:59

07/09/2020

ViewSynth: Learning Local Features from Depth using View Synthesis

Jisan Mahmud, Rajat Vikram Singh, Peri Akiva and
Spondon Kundu, Kuan-Chuan Peng, Jan-Michael Frahm

Keywords Paper

viewpoint invariant representation learning, depth representation learning, view synthesis, correspondence learning

0

0

0

0

10:00

03/05/2021

VA-RED$^2$: Video Adaptive Redundancy Reduction

Bowen Pan, Rameswar Panda, Camilo L Fosco and
Chung-Ching Lin, Alex J Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris

Keywords Paper

0

0

0

0

5:02

14/06/2020

Learning Event-Based Motion Deblurring

Zhe Jiang, Yu Zhang, Dongqing Zou and
Jimmy Ren, Jiancheng Lv, Yebin Liu

Keywords Paper

deblur, event camera, video reconstruction, image restoration, low-level vision, neural networks, adversarial training, adaptive sampling, supervised learning, dynamic vision sensor

0

0

0

0

1:01