Unsupervised object-centric video generation and decomposition in 3D

06/12/2020

Unsupervised object-centric video generation and decomposition in 3D

Paul Henderson, Christoph Lampert

Keywords:

Abstract Paper Similar Papers

Abstract: A natural approach to generative modeling of videos is to represent them as a composition of moving objects. Recent works model a set of 2D sprites over a slowly-varying background, but without considering the underlying 3D scene that gives rise to them. We instead propose to model a video as the view seen while moving through a scene with multiple 3D objects and a 3D background. Our model is trained from monocular videos without any supervision, yet learns to generate coherent 3D scenes containing several moving objects. We conduct detailed experiments on two datasets, going beyond the visual complexity supported by state-of-the-art generative approaches. We evaluate our method on depth-prediction and 3D object detection---tasks which cannot be addressed by those earlier works---and show it out-performs them even on 2D instance segmentation and tracking.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

SynSin: End-to-End View Synthesis From a Single Image

Olivia Wiles, Georgia Gkioxari, Richard Szeliski, Justin Johnson

Keywords Paper

single image view synthesis, view synthesis, differentiable rendering, point cloud, convolutional neural networks, generative networks

0

0

0

0

4:58

07/09/2020

Making a Case for 3D Convolutions for Object Segmentation in Videos

Sabarinath Mahadevan, Ali Athar, Aljosa Osep and
Laura Leal-Taixé, Bastian Leibe, Sebastian Hennen

Keywords Paper

object tracking, video segmentation, video object segmentation, video scene understanding, object segmentation

0

0

0

0

8:16

17/08/2020

Unpaired motion style transfer from video to animation

Kfir Aberman, Yijia Weng, Dani Lischinski and
Daniel Cohen-Or, Baoquan Chen

Keywords Paper

style transfer, motion analysis

0

0

0

0

16:08

14/06/2020

Single-View View Synthesis With Multiplane Images

Richard Tucker, Noah Snavely

Keywords Paper

view synthesis, monocular, multiplane image, image-based rendering, 3d deep learning, scale invariance

0

0

0

0

1:01

14/06/2020

Towards Unsupervised Learning of Generative Models for 3D Controllable Image Synthesis

Yiyi Liao, Katja Schwarz, Lars Mescheder, Andreas Geiger

Keywords Paper

image synthesis, generative adversarial network, 3d controllability, unsupervised learning, 3d representation, disentangled representation, differentiable rendering, neural rendering

0

0

0

0

1:01

14/06/2020

DeepFaceFlow: In-the-Wild Dense 3D Facial Motion Estimation

Mohammad Rami Koujan, Anastasios Roussos, Stefanos Zafeiriou

Keywords Paper

3d flow, dense 3d facial motion capture, optical flow, scene flow, 3d reconstruction and tracking, in-the-wild monocular tracking, facial reenactment, expression recognition, performance capture, non-rigid facial deformations

0

0

0

0

1:01

22/11/2021

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and
Oliver Wang, Kwang In Kim, James Tompkin

Keywords Paper

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

0

0

0

0

2:49

18/07/2021

Unsupervised Co-part Segmentation through Assembly

Qingzhe Gao, Bin Wang, Libin Liu, Baoquan Chen

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:01

18/07/2021

NeRF-VAE: A Geometry Aware 3D Scene Generative Model

Adam Kosiorek, Heiko Strathmann, Daniel Zoran and
Pol Moreno, Rosalia Schneider, Sona Mokra, Danilo J. Rezende

Keywords Paper

Deep Learning, Generative Models

0

0

0

0

17:23

17/08/2020

Example-driven virtual cinematography by learning camera behaviors

Hongda Jiang, Bin Wang, Xi Wang and
Marc Christie, Baoquan Chen

Keywords Paper

camera behaviors, machine learning, virtual cinematography

0

0

0

0

14:17

03/05/2021

gradSim: Differentiable simulation for system identification and visuomotor control

Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo and
Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine, Jérôme Parent-Lévesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

Keywords Paper

3D scene understanding, Physical parameter estimation, System identification, Differentiable simulation, Differentiable physics, Differentiable rendering, 3D vision

0

0

0

0

5:01

22/11/2021

Learning to Deblur and Rotate Motion-Blurred Faces

Givi Meishvili, Attila Szabo, Simon Jenni, Paolo Favaro

Keywords Paper

deblurring, face, multi-view, video, blur, GAN, novel view synthesis, inversion, deep learning, dataset

0

0

0

0

2:50

30/11/2020

3D Object Detection and Pose Estimation of Unseen Objects in Color Images with Local Surface Embeddings

Giorgia Pitteri, Aureélie Bugeau, Slobodan Ilic, Vincent Lepetit

Keywords Paper

0

0

0

0

9:17

06/12/2020

Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views

Nanbo Li, Cian Eastwood, Robert Fisher

Keywords Paper

0

0

0

0

3:19

06/12/2020

BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

Thu Nguyen-Phuoc, Christian Richardt, Long Mai and
Yongliang Yang, Niloy Mitra

Keywords Paper

0

0

0

0

3:24

06/12/2021

Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image

Feng Liu, Xiaoming Liu

Keywords Paper

vision

0

0

0

0

9:19

02/02/2021

Learning to Sit: Synthesizing Human-Chair Interactions via Hierarchical Control

Yu-Wei Chao, Jimei Yang, Weifeng Chen, Jia Deng

Keywords Paper

0

0

0

0

19:45

18/07/2021

Unsupervised Learning of Visual 3D Keypoints for Control

Boyuan Chen, Pieter Abbeel, Deepak Pathak

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

06/12/2021

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

Aljaz Bozic, Pablo Palafox, Justus Thies and
Angela Dai, Matthias Niessner

Keywords Paper

transformers

0

0

0

0

7:14

16/11/2020

Attentional Separation-and-Aggregation Network for Self-supervised Depth-Pose Learning in Dynamic Scenes

Feng Gao, Jincheng Yu, Hao Shen and
Yu Wang, Huazhong Yang

Keywords Paper

0

0

0

0

4:39

16/11/2020

Self-Supervised 3D Keypoint Learning for Ego-Motion Estimation

Jiexiong Tang, Rareș Ambruș, Vitor Guizilini and
Sudeep Pillai, Hanme Kim, Patric Jensfelt, Adrien Gaidon

Keywords Paper

0

0

0

0

5:05

14/06/2020

Multi-Path Learning for Object Pose Estimation Across Domains

Martin Sundermeyer, Maximilian Durner, En Yen Puang and
Zoltan-Csaba Marton, Narunas Vaskevicius, Kai O. Arras, Rudolph Triebel

Keywords Paper

object pose estimation, encodings, multi object, synthetic data, symmetries, autoencoder, embedding, 6d object detection, t-less, relative pose estimation

0

0

0

0

1:01

30/11/2020

Novel-View Human Action Synthesis

Mohamed Ilyes Lakhal, Davide Boscaini, Fabio Poiesi and
Oswald Lanz, Andrea Cavallaro

Keywords Paper

0

0

0

0

4:34

06/12/2020

Self-Learning Transformations for Improving Gaze and Head Redirection

Yufeng Zheng, Seonwook Park, Xucong Zhang and
Shalini De Mello, Otmar Hilliges

Keywords Paper

0

0

0

0

3:20

14/06/2020

From Image Collections to Point Clouds With Self-Supervised Shape and Pose Networks

K L Navaneet, Ansu Mathew, Shashank Kashyap and
Wei-Chih Hung, Varun Jampani, R. Venkatesh Babu

Keywords Paper

3d reconstruction, single image reconstruction, self supervised, point clouds, unsupervised, 2d to 3d, image collections

0

0

0

0

1:01

14/06/2020

Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis

Jogendra Nath Kundu, Siddharth Seth, Varun Jampani and
Mugalodi Rakesh, R. Venkatesh Babu, Anirban Chakraborty

Keywords Paper

3d human pose estimation, self-supervised learning, disentangling factors of variation, human puppet model, pose transfer, novel view synthesis, human part segmentation

0

0

0

0

5:00

16/11/2020

3D-OES: Viewpoint-Invariant Object-Factorized Environment Simulators

Hsiao-Yu Tung, Zhou Xian, Mihir Prabhudesai and
Shamit Lal, Katerina Fragkiadaki

Keywords Paper

0

0

0

0

5:03

14/06/2020

SG-NN: Sparse Generative Neural Networks for Self-Supervised Scene Completion of RGB-D Scans

Angela Dai, Christian Diller, Matthias Nießner

Keywords Paper

3d vision, self-supervised training, generative 3d learning, 3d reconstruction

0

0

0

0

1:00

14/06/2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

Yancheng Wang, Yang Xiao, Fu Xiong and
Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Keywords Paper

3d action recognition, point cloud, 3d motion, temporal rank pooling, pointnet++, multi-stream network

0

0

0

0

1:01

19/08/2021

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild (Extended Abstract)

Shangzhe Wu, Christian Rupprecht, Andrea Vedaldi

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Computational Photography, Photometry, Shape from X

0

0

0

0

8:56

26/04/2020

CATER: A diagnostic dataset for Compositional Actions & TEmporal Reasoning

Rohit Girdhar, Deva Ramanan

Keywords Paper

Video Understanding, Temporal Reasoning

0

0

0

0

14:56

03/05/2021

Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering

Yuxuan Zhang, Wenzheng Chen, Huan Ling and
Jun Gao, Yinan Zhang, Antonio Torralba, Sanja Fidler

Keywords Paper

GANs, inverse graphics, Differentiable rendering

0

0

0

0

10:15

06/12/2021

Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

Angtian Wang, Shenxiao Mei, Alan Yuille, Adam Kortylewski

Keywords Paper

robustness, vision, few shot learning, semi-supervised learning

0

0

0

0

14:54

14/06/2020

Time Flies: Animating a Still Image With Time-Lapse Video As Reference

Chia-Chi Cheng, Hung-Yu Chen, Wei-Chen Chiu

Keywords Paper

time-lapse video animation, self-supervised learning, style transfer, temporal consistency

0

0

0

0

1:01

14/06/2020

LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation

Keunhong Park, Arsalan Mousavian, Yu Xiang, Dieter Fox

Keywords Paper

pose estimation, pose, neural rendering, zero-shot, shape learning, 3d reconstruction, datasets, generative models, multi-view, robotics

0

0

0

0

1:01

14/06/2020

JA-POLS: A Moving-Camera Background Model via Joint Alignment and Partially-Overlapping Local Subspaces

Irit Chelly, Vlad Winter, Dor Litvak and
David Rosen, Oren Freifeld

Keywords Paper

background subtraction, video analysis, computer vision, machine learning, robust pca, deep learning, moving camera, transfer learning, video surveillance, lie groups

0

0

0

0

1:00

22/11/2021

Unsupervised computation of salient motion maps from the interpretation of a frame-based classification network

Etienne Meunier, Patrick Bouthemy

Keywords Paper

Motion saliency, motion segmentation, interpretation neural network, LRP

0

0

0

0

2:44

05/01/2021

Learning to Generate Dense Point Clouds With Textures on Multiple Categories

Tao Hu, Geng Lin, Zhizhong Han, Matthias Zwicker

Keywords Paper

0

0

0

0

4:57

30/11/2020

Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection

Erli Ouyang, Li Zhang, Mohan Chen and
Anurag Arnab, Yanwei Fu

Keywords Paper

0

0

0

0

6:30

22/11/2021

CamLessMonoDepth: Monocular Depth Estimation with Unknown Camera Parameters

Sai Shyam Chanduri, Igor Vozniak, Zeeshan Khan Suri

Keywords Paper

monocular depth estimation, self-supervised learning, single-camera egomotion, camera intrinsics estimation, sub-pixel convolutions, uncertainty estimation

0

0

0

0

3:02