Learning Multi-Object Tracking and Segmentation From Automatic Annotations

14/06/2020

Learning Multi-Object Tracking and Segmentation From Automatic Annotations

Lorenzo Porzi, Markus Hofinger, Idoia Ruiz, Joan Serrat, Samuel Rota Bulò, Peter Kontschieder

Keywords: multi-object tracking and segmentation, mots, object tracking, instance segmentation, automatic annotations, deep learning

Abstract Paper Similar Papers

Abstract: In this work we contribute a novel pipeline to automatically generate training data, and to improve over state-of-the-art multi-object tracking and segmentation (MOTS) methods. Our proposed track mining algorithm turns raw street-level videos into high-fidelity MOTS training data, is scalable and overcomes the need of expensive and time-consuming manual annotation approaches. We leverage state-of-the-art instance segmentation results in combination with optical flow predictions, also trained on automatically harvested training data. Our second major contribution is MOTSNet - a deep learning, tracking-by-detection architecture for MOTS - deploying a novel mask-pooling layer for improved object association over time. Training MOTSNet with our automatically extracted data leads to significantly improved sMOTSA scores on the novel KITTI MOTS dataset (+1.9%/+7.5% on cars/pedestrians), and MOTSNet improves by +4.1% over previously best methods on the MOTSChallenge dataset. Our most impressive finding is that we can improve over previous best-performing works, even in complete absence of manually annotated MOTS training data.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

DRT: Detection Refinement for Multiple Object Tracking

Bisheng Wang, Christian Fruhwirth-Reisinger, Horst Possegger and
Horst Bischof, Guo Cao

Keywords Paper

Multiple Object Tracking, Tracking by Detection, Detection Refinement

0

0

0

0

2:57

06/12/2021

Bootstrap Your Object Detector via Mixed Training

Mengde Xu, Zheng Zhang, Fangyun Wei and
Yutong Lin, Yue Cao, Stephen Lin, Han Hu, Xiang Bai

Keywords Paper

deep learning, robustness, vision

0

0

0

0

4:59

06/12/2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare and
Shafiq Joty, Caiming Xiong, Steven Chu Hong Hoi

Keywords Paper

transformers, vision, representation learning

0

0

0

0

9:40

05/01/2021

Continual Representation Learning for Biometric Identification

Bo Zhao, Shixiang Tang, Dapeng Chen and
Hakan Bilen, Rui Zhao

Keywords Paper

0

0

0

0

4:36

07/09/2020

Real-Time Semantic Segmentation via Multiply Spatial Fusion Network

Haiyang Si, Zhiqiang Zhang, Feng Lu

Keywords Paper

real-time, semantic segmentation, boundary supervision

0

0

0

0

6:19

14/06/2020

Moving in the Right Direction: A Regularization for Deep Metric Learning

Deen Dayal Mohan, Nishant Sankaran, Dennis Fedorishin and
Srirangaraj Setlur, Venu Govindaraju

Keywords Paper

deep metric learning, regularization, image retrieval.

0

0

0

0

1:00

18/07/2021

Data Augmentation for Meta-Learning

Renkun Ni, Micah Goldblum, Amr Sharaf and
Kezhi Kong, Tom Goldstein

Keywords Paper

Deep Learning

0

0

0

0

5:09

02/02/2021

A Scalable Reasoning and Learning Approach for Neural-Symbolic Stream Fusion

Danh Le-Phuoc, Thomas Eiter, Anh Le-Tuan

Keywords Paper

0

0

0

0

18:49

06/12/2021

Video Instance Segmentation using Inter-Frame Communication Transformers

Sukjun Hwang, Miran Heo, Seoung Wug Oh, Seon Joo Kim

Keywords Paper

transformers

0

0

0

0

10:00

05/01/2021

Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition

Theo Ayral, Marco Pedersoli, Simon Bacon, Eric Granger

Keywords Paper

0

0

0

0

5:00

14/06/2020

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

Junbo Yin, Wenguan Wang, Qinghao Meng and
Ruigang Yang, Jianbing Shen

Keywords Paper

mot, multi-task learning, motion, affinity, attention, online

0

0

0

0

1:03

05/01/2021

RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization

Alejandro Pardo, Humam Alwassel, Fabian Caba and
Ali Thabet, Bernard Ghanem

Keywords Paper

0

0

0

0

5:01

07/09/2020

Meta-RetinaNet for Few-shot Object Detection

Shaoqi Li, Wenfeng Song, Shuai Li and
Aimin Hao, Hong Qin

Keywords Paper

Few shot, object detection, meta-learning, Meta-RetinaNet, Balanced Loss, coefficient vector

0

0

0

0

8:51

06/12/2021

TokenLearner: Adaptive Space-Time Tokenization for Videos

Michael S Ryoo, AJ Piergiovanni, Anurag Arnab and
Mostafa Dehghani, Anelia Angelova

Keywords Paper

transformers, representation learning

0

0

0

0

10:26

14/06/2020

Self-Supervised Monocular Trained Depth Estimation Using Self-Attention and Discrete Disparity Volume

Adrian Johnston, Gustavo Carneiro

Keywords Paper

self-supervised depth estimation, self-supervised learning, self-attention, depth estimation, uncertainty

0

0

0

0

1:01

14/06/2020

Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition

Linchao Zhu, Yi Yang

Keywords Paper

long-tailed visual recognition, region self-attention, inflated episodic memory, long-tailed video classification

0

0

0

0

1:00

06/12/2020

Deep Transformation-Invariant Clustering

Tom Monnier, Thibault Groueix, Mathieu Aubry

Keywords Paper

0

0

0

0

3:22

02/02/2021

Multi-Decoder Attention Model with Embedding Glimpse for Solving Vehicle Routing Problems

Liang Xin, Wen Song, Zhiguang Cao, Jie Zhang

Keywords Paper

0

0

0

0

18:18

12/07/2020

Revisiting Training Strategies and Generalization Performance in Deep Metric Learning

Karsten Roth, Timo Milbich, Samrath Sinha and
Prateek Gupta, Bjorn Ommer, Joseph Paul Cohen

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:15

01/07/2020

Re-translation versus Streaming for Simultaneous Translation

Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, George Foster

Keywords Paper

0

0

0

0

23:21

06/12/2021

Shifted Chunk Transformer for Spatio-Temporal Representational Learning

Xuefan Zha, Wentao Zhu, Lv Xun and
Sen Yang, Ji Liu

Keywords Paper

machine learning, transformers, vision, language

0

0

0

0

6:14

22/11/2021

Robust Semantic Segmentation with Superpixel-Mix

Gianni Franchi, Nacim Belkhir, Mai Lan Ha and
Yufei Hu, Andrei Bursuc, Volker Blanz, Angela Yao

Keywords Paper

robust AI, uncertainty, semantic segmentation, semi supervised learning, mathematical morphology

0

0

0

0

2:54

02/02/2021

Augmented Partial Mutual Learning with Frame Masking for Video Captioning

Ke Lin, Zhuoxin Gan, Liwei Wang

Keywords Paper

0

0

0

0

16:57

06/12/2021

Unsupervised Object-Level Representation Learning from Scene Images

Jiahao Xie, Xiaohang Zhan, Ziwei Liu and
Yew Soon Ong, Chen Change Loy

Keywords Paper

self-supervised learning, representation learning

0

0

0

0

5:01

22/11/2021

FAST3D: Flow-Aware Self-Training for 3D Object Detectors

Christian Fruhwirth-Reisinger, Michael Opitz, Horst Possegger, Horst Bischof

Keywords Paper

unsupervised domain adaptation, self-training, 3D object detection, scene flow, LiDAR point cloud, source-free domain adaptation

0

0

0

0

3:06

05/01/2021

Intra-Class Part Swapping for Fine-Grained Image Classification

Lianbo Zhang, Shaoli Huang, Wei Liu

Keywords Paper

0

0

0

0

4:43

14/06/2020

TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model

Bo Pang, Yizhuo Li, Yifan Zhang and
Muchen Li, Cewu Lu

Keywords Paper

bounding-tube, mot, one-stage, tube-nms, fcn

0

0

0

0

4:55

22/11/2021

Single-Modal Entropy based Active Learning for Visual Question Answering

Dong-Jin Kim, Jae Won Cho, Jinsoo Choi and
Yunjae Jung, In So Kweon

Keywords Paper

Visual Question Answering, Vision and Language, Active Learning

0

0

0

0

2:42

05/01/2021

Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization

Sadbhavana Babar, Sukhendu Das

Keywords Paper

0

0

0

0

5:01

02/02/2021

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation

Yang Fu, Linjie Yang, Ding Liu and
Thomas S. Huang, Humphrey Shi

Keywords Paper

0

0

0

0

16:24

14/06/2020

Generalized Product Quantization Network for Semi-Supervised Image Retrieval

Young Kyun Jang, Nam Ik Cho

Keywords Paper

image retrieval, vector quantization, semi-supervised learning

0

0

0

0

1:01

14/06/2020

Memory Aggregation Networks for Efficient Interactive Video Object Segmentation

Jiaxu Miao, Yunchao Wei, Yi Yang

Keywords Paper

interactive video object segmentation, pixel embedding learning, memory aggregation networks

0

0

0

0

0:59

06/12/2021

SOLQ: Segmenting Objects by Learning Queries

Bin Dong, Fangao Zeng, Tiancai Wang and
Xiangyu Zhang, Yichen Wei

Keywords Paper

machine learning, transformers

0

0

0

0

7:12

17/08/2020

Consistent video depth estimation

Xuan Luo, Jia-Bin Huang, Richard Szeliski and
Kevin Matzen, Johannes Kopf

Keywords Paper

video, depth estimation

0

0

0

1

12:43

06/12/2020

Online Decision Based Visual Tracking via Reinforcement Learning

ke Song, Wei Zhang, Ran Song, Yibin Li

Keywords Paper

0

0

0

0

3:30

06/12/2020

An Unsupervised Information-Theoretic Perceptual Quality Metric

Sangnie Bhardwaj, Ian Fischer, Johannes Ballé, Troy Chinen

Keywords Paper

0

0

0

0

3:08

06/12/2021

Focal Attention for Long-Range Interactions in Vision Transformers

Jianwei Yang, Chunyuan Li, Pengchuan Zhang and
Xiyang Dai, Bin Xiao, Lu Yuan, Jianfeng Gao

Keywords Paper

machine learning, transformers, vision

0

0

0

0

14:39

05/01/2021

Mask Selection and Propagation for Unsupervised Video Object Segmentation

Shubhika Garg, Vidit Goel

Keywords Paper

0

0

0

0

4:38

26/04/2020

Computation Reallocation for Object Detection

Feng Liang, Chen Lin, Ronghao Guo and
Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang

Keywords Paper

Neural Architecture Search, Object Detection

0

0

0

0

5:29

18/07/2021

Improved Denoising Diffusion Probabilistic Models

Alexander Nichol, Prafulla Dhariwal

Keywords Paper

Deep Learning, Generative Models, Theory, Game Theory and Computational Economics, Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

4:25