Learning Disentangled Representations of Videos with Missing Data

06/12/2020

Learning Disentangled Representations of Videos with Missing Data

Armand Comas, Chi Zhang, Zlatan Feric, Octavia Camps, Rose Yu

Keywords:

Abstract Paper Similar Papers

Abstract: Missing data poses significant challenges while learning representations of video sequences. We present Disentangled Imputed Video autoEncoder (DIVE), a deep generative model that imputes and predicts future video frames in the presence of missing data. Specifically, DIVE introduces a missingness latent variable, disentangles the hidden video representations into static and dynamic appearance, pose, and missingness factors for each object, while it imputes each object trajectory where data is missing. On a moving MNIST dataset with various missing scenarios, DIVE outperforms the state of the art baselines by a substantial margin. We also present comparisons on a real-world MOTSChallenge pedestrian dataset, which demonstrates the practical value of our method in a more realistic setting. Our code can be found in https://github.com/Rose-STL-Lab/DIVE.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

05/01/2021

Towards Visually Explaining Video Understanding Networks With Perturbation

Zhenqiang Li, Weimin Wang, Zuoyue Li and
Yifei Huang, Yoichi Sato

Keywords Paper

0

0

0

0

4:53

05/01/2021

Only Time Can Tell: Discovering Temporal Data for Temporal Modeling

Laura Sevilla-Lara, Shengxin Zha, Zhicheng Yan and
Vedanuj Goswami, Matt Feiszli, Lorenzo Torresani

Keywords Paper

0

0

0

0

4:14

05/01/2021

Multi-Frame Recurrent Adversarial Network for Moving Object Segmentation

Prashant W. Patil, Akshay Dudhane, Subrahmanyam Murala

Keywords Paper

0

0

0

0

5:00

02/02/2021

Weakly-supervised Temporal Action Localization by Uncertainty Modeling

Pilhyeon Lee, Jinglu Wang, Yan Lu, Hyeran Byun

Keywords Paper

0

0

0

0

14:01

06/12/2020

Why Normalizing Flows Fail to Detect Out-of-Distribution Data

Polina Kirichenko, Pavel Izmailov, Andrew Wilson

Keywords Paper

0

0

0

0

3:20

14/06/2020

Scale-Space Flow for End-to-End Optimized Video Compression

Eirikur Agustsson, David Minnen, Nick Johnston and
Johannes Ballé, Sung Jin Hwang, George Toderici

Keywords Paper

learned video compression, scale-space flow, bilinear warping

0

0

0

0

0:55

06/12/2021

Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations

Shasha Li, Abhishek Aich, Shitong Zhu and
Salman Asif, Chengyu Song, Amit Roy-Chowdhury, Srikanth Krishnamurthy

Keywords Paper

machine learning, adversarial robustness and security, vision

0

0

0

0

11:16

22/11/2021

Gradient Frequency Modulation for Visually Explaining Video Understanding Models

Xin Miao Lin, Wentao Bao, Matthew Wright, Yu Kong

Keywords Paper

model explanation, model explainability, explainable AI, video action recognition, Discrete Fourier Transform, video perturbation, interpretable machine learning, video model explanation, frequency modulation, spatiotemporal consistency

0

0

0

0

2:53

22/11/2021

SVD-GAN for Real-Time Unsupervised Video Anomaly Detection

Dinesh Jackson Samuel, Fabio Cuzzolin

Keywords Paper

Unsupervised anomaly detection, SVD-GAN, depth-wise separable convolutions, spatiotemporal features, GAN convergence, Singular Value Decomposition loss, GAN reconstruction, lightweight GAN model, minimized KL divergence

0

0

0

0

2:54

02/02/2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Ziyi Liu, Le Wang, Wei Tang and
Junsong Yuan, Nanning Zheng, Gang Hua

Keywords Paper

0

0

0

0

19:49

30/11/2020

dpVAEs: Fixing Sample Generation for Regularized VAEs

Riddhish Bhalodia, Iain Lee, Shireen Elhabian

Keywords Paper

0

0

0

0

7:54

03/05/2021

gradSim: Differentiable simulation for system identification and visuomotor control

Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo and
Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine, Jérôme Parent-Lévesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

Keywords Paper

3D scene understanding, Physical parameter estimation, System identification, Differentiable simulation, Differentiable physics, Differentiable rendering, 3D vision

0

0

0

0

5:01

07/09/2020

Uncovering Hidden Challenges in Query-Based Video Moment Retrieval

Mayu Otani, Yuta Nakashima, Esa Rahtu, Janne Heikkila

Keywords Paper

video moment retrieval, temporal sentence grounding, dataset analysis, negative result

0

0

0

0

5:17

22/11/2021

Spatial-Temporal Residual Aggregation for High Resolution Video Inpainting

Vishnu Sanjay Ramiya Srinivasan, Rui Ma, Qiang Tang and
Zili Yi, Zhan Xu

Keywords Paper

high resolution video inpainting, spatial-temporal aggregation, residual aggregation, spatial-temporal attention, image alignment

0

0

0

0

2:58

19/08/2021

Learning Implicit Temporal Alignment for Few-shot Video Classification

Songyang Zhang, Jiale Zhou, Xuming He

Keywords Paper

Computer Vision, Action Recognition, Deep Learning

0

0

0

0

6:20

06/12/2021

Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning

Kento Nozawa, Issei Sato

Keywords Paper

machine learning, representation learning

0

0

0

0

8:50

26/04/2020

Efficient and Information-Preserving Future Frame Prediction and Beyond

Wei Yu, Yichao Lu, Steve Easterbrook, Sanja Fidler

Keywords Paper

self-supervised learning, generative pre-training, video prediction, reversible architecture

0

0

0

0

4:18

14/06/2020

TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model

Bo Pang, Yizhuo Li, Yifan Zhang and
Muchen Li, Cewu Lu

Keywords Paper

bounding-tube, mot, one-stage, tube-nms, fcn

0

0

0

0

4:55

14/06/2020

Learning Event-Based Motion Deblurring

Zhe Jiang, Yu Zhang, Dongqing Zou and
Jimmy Ren, Jiancheng Lv, Yebin Liu

Keywords Paper

deblur, event camera, video reconstruction, image restoration, low-level vision, neural networks, adversarial training, adaptive sampling, supervised learning, dynamic vision sensor

0

0

0

0

1:01

22/11/2021

ERA: Entity–relationship Aware Video Summarization with Wasserstein GAN

Guande Wu, Jianzhe Peter Lin, Claudio Silva

Keywords Paper

video summarization, spatio-temporal graph neural network

0

0

0

0

2:59

26/04/2020

Self-labelling via simultaneous clustering and representation learning

Asano YM., Rupprecht C., Vedaldi A.

Keywords Paper

self-supervision, feature representation learning, clustering

0

0

0

0

4:57

26/04/2020

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

0

0

0

0

5:16

14/06/2020

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking

Jin Gao, Weiming Hu, Yan Lu

Keywords Paper

online learning, visual tracking, continual learning, recursive least-squares estimation, deep learning, memory retention, recursive learning, mini-batch sgd, normal equation, mlp layer

0

0

0

0

5:01

06/12/2020

Focus of Attention Improves Information Transfer in Visual Features

Matteo Tiezzi, Stefano Melacci, Alessandro Betti and
Marco Maggini, Marco Gori

Keywords Paper

0

0

0

0

3:17

06/12/2021

Streaming Linear System Identification with Reverse Experience Replay

Suhas Kowshik, Dheeraj Nagaraj, Prateek Jain, Praneeth Netrapalli

Keywords Paper

optimization, reinforcement learning and planning

1

0

0

0

14:17

03/05/2021

Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy

Akinori Ebihara, Taiki Miyagawa, Kazuyuki Sakurai, Hitoshi Imaoka

Keywords Paper

Density ratio estimation, Early classification, Sequential probability ratio test

0

0

0

0

9:55

02/02/2021

Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation

Daizong Liu, Shuangjie Xu, Xiao-Yang Liu and
Zichuan Xu, Wei Wei, Pan Zhou

Keywords Paper

0

0

0

0

14:42

06/12/2021

Detecting Moments and Highlights in Videos via Natural Language Queries

Jie Lei, Tamara L Berg, Mohit Bansal

Keywords Paper

transformers

0

0

0

0

13:12

22/11/2021

Conditional Model Selection for Efficient Video Understanding

Mihir Jain, Haitam Ben Yahia, Amir Ghodrati and
Amirhossein Habibian, Fatih Porikli

Keywords Paper

action recognition, efficient classification, efficient localization, conditional compute

0

0

0

0

2:49

03/05/2021

Revisiting Locally Supervised Learning: an Alternative to End-to-end Training

Yulin Wang, Zanlin Ni, Shiji Song and
Le Yang, Gao Huang

Keywords Paper

Deep learning, Locally supervised training

1

0

0

1

5:03

06/12/2021

Noise2Score: Tweedie’s Approach to Self-Supervised Image Denoising without Clean Images

Kwanyoung Kim, Jong Chul Ye

Keywords Paper

0

0

0

0

8:28

19/08/2021

Detecting Deepfake Videos with Temporal Dropout 3DCNN

Daichi Zhang, Chenyu Li, Fanzhao Lin and
Dan Zeng, Shiming Ge

Keywords Paper

Computer Vision, Biometrics, Face and Gesture Recognition, Fairness, Surveillance, Manipulation of People

0

0

0

0

8:30

14/06/2020

Multi-Scale Interactive Network for Salient Object Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Keywords Paper

saliency detection, salient object detection, feature interaction strategy, scale-insensitive loss, multi-scale features, multi-level features, fully convolutional network, deep learning

0

0

0

0

1:01

14/06/2020

Unsupervised Learning From Video With Deep Neural Embeddings

Chengxu Zhuang, Tianwei She, Alex Andonian and
Max Sobol Mark, Daniel Yamins

Keywords Paper

unsupervised learning, self-supervised learning, video learning, contrastive learning, deep neural networks, action recognition, object recognition, two-pathway models

0

0

0

0

1:01

03/05/2021

Self-Supervised Learning of Compressed Video Representations

Youngjae Yu, Sangho Lee, Gunhee Kim, Yale Song

Keywords Paper

self-supervised learning, Compressed videos

0

0

0

0

4:34

06/12/2021

Dense Unsupervised Learning for Video Segmentation

Nikita Araslanov, Simone Schaub-Meyer, Stefan Roth

Keywords Paper

self-supervised learning, representation learning

0

0

0

0

13:34

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

22/11/2021

Knowing What, Where and When to Look: Video Action modelling with Attention

Juan-Manuel Perez-Rua, Brais Martinez, Xiatian Zhu and
Antoine S Toisoul, Victor A Escorcia, Tao Xiang

Keywords Paper

Action recognition, Fine-grained action, video attention, Spatial attention, Channel attention, Temporal attention, Spatio-temporal attention, Feature refinement

0

0

0

0

2:46

07/09/2020

Semantically Adaptive Image-to-image Translation for Domain Adaptation of Semantic Segmentation

Luigi Musto, Andrea Zinelli

Keywords Paper

domain adaptation, semantic segmentation, image-to-image translation, generative models, image translation

0

0

0

0

4:32

06/12/2021

Low-Fidelity Video Encoder Optimization for Temporal Action Localization

Mengmeng Xu, Juan Manuel Perez Rua, Xiatian Zhu and
Bernard Ghanem, Brais Martinez

Keywords Paper

optimization, machine learning, transfer learning

0

0

0

0

14:34