Memory Aggregation Networks for Efficient Interactive Video Object Segmentation

14/06/2020

Memory Aggregation Networks for Efficient Interactive Video Object Segmentation

Jiaxu Miao, Yunchao Wei, Yi Yang

Keywords: interactive video object segmentation, pixel embedding learning, memory aggregation networks

Abstract Paper Similar Papers

Abstract: Interactive video object segmentation (iVOS) aims at efficiently harvesting high-quality segmentation masks of the target object in a video with user interactions. Most previous state-of-the-arts tackle the iVOS with two independent networks for conducting user interaction and temporal propagation, respectively, leading to inefficiencies during the inference stage. In this work, we propose a unified framework, named Memory Aggregation Networks (MA-Net), to address the challenging iVOS in a more efficient way. Our MA-Net integrates the interaction and the propagation operations into a single network, which significantly promotes the efficiency of iVOS in the scheme of multi-round interactions. More importantly, we propose a simple yet effective memory aggregation mechanism to record the informative knowledge from the previous interaction rounds, improving the robustness in discovering challenging objects of interest greatly. We conduct extensive experiments on the validation set of DAVIS Challenge 2018 benchmark. In particular, our MA-Net achieves the J@60 score of 76.1% without any bells and whistles, outperforming the state-of-the-arts with more than 2.7%.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching

Xuhua Huang, Jiarui Xu, Yu-Wing Tai, Chi-Keung Tang

Keywords Paper

video object segmentation, tracking, segmentation, detection, semi-supervised learning

0

0

0

0

1:02

06/12/2020

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

Mathilde Caron, Ishan Misra, Julien Mairal and
Priya Goyal, Piotr Bojanowski, Armand Joulin

Keywords Paper

0

1

0

0

3:22

26/04/2020

Computation Reallocation for Object Detection

Feng Liang, Chen Lin, Ronghao Guo and
Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang

Keywords Paper

Neural Architecture Search, Object Detection

0

0

0

0

5:29

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

14/06/2020

Learning Video Object Segmentation From Unlabeled Videos

Xiankai Lu, Wenguan Wang, Jianbing Shen and
Yu-Wing Tai, David J. Crandall, Steven C. H. Hoi

Keywords Paper

unsupervised/weakly supervised vos, four granularity, video pattern learning

0

0

0

0

1:01

14/06/2020

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

Junbo Yin, Wenguan Wang, Qinghao Meng and
Ruigang Yang, Jianbing Shen

Keywords Paper

mot, multi-task learning, motion, affinity, attention, online

0

0

0

0

1:03

14/06/2020

MemNAS: Memory-Efficient Neural Architecture Search With Grow-Trim Learning

Peiye Liu, Bo Wu, Huadong Ma, Mingoo Seok

Keywords Paper

neural architecture search, recurrent neural network, memory optimization

0

0

0

0

0:59

03/05/2021

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels

Denis Yarats, Ilya Kostrikov, Rob Fergus

Keywords Paper

0

0

0

0

7:30

06/12/2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare and
Shafiq Joty, Caiming Xiong, Steven Chu Hong Hoi

Keywords Paper

transformers, vision, representation learning

0

0

0

0

9:40

02/02/2021

DenserNet: Weakly Supervised Visual Localization Using Multi-Scale Feature Aggregation

Dongfang Liu, Yiming Cui, Liqi Yan and
Christos Mousas, Baijian Yang, Yingjie Chen

Keywords Paper

0

0

0

0

16:15

14/06/2020

Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition

Linchao Zhu, Yi Yang

Keywords Paper

long-tailed visual recognition, region self-attention, inflated episodic memory, long-tailed video classification

0

0

0

0

1:00

23/08/2020

Time-aware user embeddings as a service

Martin Pavlovski, Jelena Gligorijevic, Ivan Stojkovic and
Shubham Agrawal, Shabhareesh Komirishetty, Djordje Gligorijevic, Narayan Bhamidipati, Zoran Obradovic

Keywords Paper

sequential models, user representation, neural embeddings

0

0

0

0

19:42

14/09/2020

Squeezing Correlated Neurons for Resource-Efficient Deep Neural Networks

Elbruz Ozen, Alex Orailoglu

Keywords Paper

deep learning, information redundancy, pruning

0

0

0

0

14:48

05/01/2021

Mask Selection and Propagation for Unsupervised Video Object Segmentation

Shubhika Garg, Vidit Goel

Keywords Paper

0

0

0

0

4:38

02/02/2021

MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection

Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

Keywords Paper

0

0

0

0

16:48

14/06/2020

Self-Trained Deep Ordinal Regression for End-to-End Video Anomaly Detection

Guansong Pang, Cheng Yan, Chunhua Shen and
Anton van den Hengel, Xiao Bai

Keywords Paper

anomaly detection, deep ordinal regression, human-in-the-loop machine learning, anomaly explanation, self-training, unsupervised representation learning, abnormal activity detection, video learning

0

0

0

0

1:01

14/06/2020

Unsupervised Learning From Video With Deep Neural Embeddings

Chengxu Zhuang, Tianwei She, Alex Andonian and
Max Sobol Mark, Daniel Yamins

Keywords Paper

unsupervised learning, self-supervised learning, video learning, contrastive learning, deep neural networks, action recognition, object recognition, two-pathway models

0

0

0

0

1:01

06/12/2021

TokenLearner: Adaptive Space-Time Tokenization for Videos

Michael S Ryoo, AJ Piergiovanni, Anurag Arnab and
Mostafa Dehghani, Anelia Angelova

Keywords Paper

transformers, representation learning

0

0

0

0

10:26

14/06/2020

EfficientDet: Scalable and Efficient Object Detection

Mingxing Tan, Ruoming Pang, Quoc V. Le

Keywords Paper

object detection, segmentation, automl, neural network, efficient models

0

0

0

0

1:00

22/11/2021

Single-Modal Entropy based Active Learning for Visual Question Answering

Dong-Jin Kim, Jae Won Cho, Jinsoo Choi and
Yunjae Jung, In So Kweon

Keywords Paper

Visual Question Answering, Vision and Language, Active Learning

0

0

0

0

2:42

02/02/2021

Query-Memory Re-Aggregation for Weakly-supervised Video Object Segmentation

Fanchao Lin, Hongtao Xie, Yan Li, Yongdong Zhang

Keywords Paper

0

0

0

0

14:19

14/06/2020

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking

Jin Gao, Weiming Hu, Yan Lu

Keywords Paper

online learning, visual tracking, continual learning, recursive least-squares estimation, deep learning, memory retention, recursive learning, mini-batch sgd, normal equation, mlp layer

0

0

0

0

5:01

06/12/2021

Neural Routing by Memory

Kaipeng Zhang, Zhenqiang Li, Zhifeng Li and
Wei Liu, Yoichi Sato

Keywords Paper

deep learning

0

0

0

0

6:41

23/08/2020

Spectrum-guided adversarial disparity learning

Zhe Liu, Lina Yao, Lei Bai and
Xianzhi Wang, Can Wang

Keywords Paper

adversarial autoencoder, generative models, intraclass variability, activity recognition

0

0

0

0

14:30

05/01/2021

InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning

Kwot Sin Lee, Ngoc-Trung Tran, Ngai-Man Cheung

Keywords Paper

0

0

0

0

5:01

06/12/2020

Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation

Yuxi Li, Ning Xu, Jinlong Peng and
John See, Weiyao Lin

Keywords Paper

0

0

0

0

2:56

03/05/2021

VA-RED$^2$: Video Adaptive Redundancy Reduction

Bowen Pan, Rameswar Panda, Camilo L Fosco and
Chung-Ching Lin, Alex J Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris

Keywords Paper

0

0

0

0

5:02

06/12/2021

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Ho Kei Cheng, Yu-Wing Tai, Chi-Keung Tang

Keywords Paper

0

0

0

0

14:41

06/12/2020

Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning

Jean-Bastien Grill, Florian Strub, Florent Altché and
Corentin Tallec, Pierre Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, koray kavukcuoglu, Remi Munos, Michal Valko

Keywords Paper

0

0

0

0

3:27

14/06/2020

MUXConv: Information Multiplexing in Convolutional Neural Networks

Zhichao Lu, Kalyanmoy Deb, Vishnu Naresh Boddeti

Keywords Paper

convolutional neural networks, neural architecture search, evolutionary algorithms

0

0

0

0

0:56

05/01/2021

We Don't Need Thousand Proposals: Single Shot Actor-Action Detection in Videos

Aayush J. Rana, Yogesh S. Rawat

Keywords Paper

0

0

0

0

3:53

04/07/2020

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

Fynn Schröder, Chris Biemann

Keywords Paper

multi-task tasks, MTL, TL, MTL setups

0

0

0

0

12:02

14/06/2020

MAST: A Memory-Augmented Self-Supervised Tracker

Zihang Lai, Erika Lu, Weidi Xie

Keywords Paper

self-supervised learning, video segmentation, memory-augmented model, video understanding, tracking, unsupervised learning, generalization, attention, representation learning, metric learning

0

0

0

0

1:01

06/12/2021

Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training

Sheng Liu, Xiao Li, Yuexiang Zhai and
Chong You, Zhihui Zhu, Carlos Fernandez-Granda, Qing Qu

Keywords Paper

deep learning, machine learning, robustness, generative model

0

0

0

0

6:45

06/12/2021

Channel Permutations for N:M Sparsity

Jeff Pool, Chong Yu

Keywords Paper

optimization

0

0

0

0

12:41

14/06/2020

Diverse Image Generation via Self-Conditioned GANs

Steven Liu, Tongzhou Wang, David Bau and
Jun-Yan Zhu, Antonio Torralba

Keywords Paper

generative adversarial networks, image synthesis, mode collapse, clustering, unsupervised learning

0

0

0

0

1:00

03/05/2021

DrNAS: Dirichlet Neural Architecture Search

Xiangning Chen, Ruochen Wang, Minhao Cheng and
Xiaocheng Tang, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

5:00

06/12/2020

MMA Regularization: Decorrelating Weights of Neural Networks by Maximizing the Minimal Angles

Zhennan Wang, Canqun Xiang, Wenbin Zou, Chen Xu

Keywords Paper

0

0

0

0

3:23

14/06/2020

Scene-Adaptive Video Frame Interpolation via Meta-Learning

Myungsub Choi, Janghoon Choi, Sungyong Baik and
Tae Hyun Kim, Kyoung Mu Lee

Keywords Paper

video frame interpolation, test-time adaptation, meta-learning, self-supervision, image synthesis, slow motion, motion estimation, error correction, maml, input-adaptive neural network

0

0

0

0

0:55

06/12/2021

MLP-Mixer: An all-MLP Architecture for Vision

Ilya Tolstikhin, Neil Houlsby, Alexander Kolesnikov and
Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Andreas Steiner, Daniel Keysers, Jakob Uszkoreit, Mario Lucic, Alexey Dosovitskiy

Keywords Paper

deep learning, machine learning, transformers, vision, transfer learning

0

0

0

0

11:18