Object-Centric Representation Learning with Generative Spatial-Temporal Factorization

06/12/2021

Object-Centric Representation Learning with Generative Spatial-Temporal Factorization

Nanbo Li, Muhammad Ahmed Raza, Wenbin Hu, Zhaole Sun, Robert Fisher

Keywords: vision, generative model, representation learning

Abstract Paper Similar Papers

Abstract: Learning object-centric scene representations is essential for attaining structural understanding and abstraction of complex scenes. Yet, as current approaches for unsupervised object-centric representation learning are built upon either a stationary observer assumption or a static scene assumption, they often: i) suffer single-view spatial ambiguities, or ii) infer incorrectly or inaccurately object representations from dynamic scenes. To address this, we propose Dynamics-aware Multi-Object Network (DyMON), a method that broadens the scope of multi-view object-centric representation learning to dynamic scenes. We train DyMON on multi-view-dynamic-scene data and show that DyMON learns---without supervision---to factorize the entangled effects of observer motions and scene object dynamics from a sequence of observations, and constructs scene object spatial representations suitable for rendering at arbitrary times (querying across time) and from arbitrary viewpoints (querying across space). We also show that the factorized scene representations (w.r.t. objects) support querying about a single object by space and time independently.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views

Nanbo Li, Cian Eastwood, Robert Fisher

Keywords Paper

0

0

0

0

3:19

03/05/2021

Spatially Structured Recurrent Modules

Nasim Rahaman, Anirudh Goyal, Waleed Gondal and
Manuel Wuthrich, Stefan Bauer, Yash Sharma, Yoshua Bengio, Bernhard Schoelkopf

Keywords Paper

spatio-temporal modelling, partially observed environments, recurrent neural networks, modular architectures

0

0

0

0

5:27

03/05/2021

Self-supervised Visual Reinforcement Learning with Object-centric Representations

Andrii Zadaianchuk, Maximilian Seitzer, Georg Martius

Keywords Paper

object-centric representations, visual reinforcement learning, autonomous learning, self-supervision

0

0

0

0

9:58

06/12/2020

Learning Physical Graph Representations from Visual Scenes

Daniel Bear, Chaofei Fan, Damian Mrowca and
Yunzhu Li, Seth Alter, Aran Nayebi, Jeremy Schwartz, Li Fei-Fei, Jiajun Wu, Josh Tenenbaum, Daniel Yamins

Keywords Paper

0

0

0

0

3:19

18/07/2021

Reinforcement Learning with Prototypical Representations

Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:15

06/12/2021

On Contrastive Representations of Stochastic Processes

Emile Mathieu, Adam Foster, Yee Teh

Keywords Paper

machine learning, meta learning, contrastive learning, representation learning

0

0

0

0

10:59

16/11/2020

CLOUD: Contrastive Learning of Unsupervised Dynamics

Jianren Wang, Yujie Lu, Hang Zhao

Keywords Paper

0

0

0

0

5:05

02/02/2021

Deep Metric Learning with Self-Supervised Ranking

Zheren Fu, Yan Li, Zhendong Mao and
Quan Wang, Yongdong Zhang

Keywords Paper

0

0

0

0

12:36

03/05/2021

Fast And Slow Learning Of Recurrent Independent Mechanisms

Kanika Madan, Nan Rosemary Ke, Anirudh Goyal and
Bernhard Schoelkopf, Yoshua Bengio

Keywords Paper

better generalization, modular representations, learning mechanisms

0

0

0

0

5:09

18/07/2021

Temporal Predictive Coding For Model-Based Planning In Latent Space

Tung Nguyen, Rui Shu, Tuan Pham and
Hung Bui, Stefano Ermon

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:19

18/07/2021

Structured World Belief for Reinforcement Learning in POMDP

Gautam Singh, Skand Peri, Junghyun Kim and
Hyunseok Kim, Sungjin Ahn

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:21

02/02/2021

A Continual Learning Framework for Uncertainty-Aware Interactive Image Segmentation

Ervine Zheng, Qi Yu, Rui Li and
Pengcheng Shi, Anne Haake

Keywords Paper

0

0

0

0

14:21

14/06/2020

Steering Self-Supervised Feature Learning Beyond Local Pixel Statistics

Simon Jenni, Hailin Jin, Paolo Favaro

Keywords Paper

self-supervised, representation learning, inpainting, unsupervised, feature learning, self-supervision, transformations, image statistics

0

0

0

0

5:01

12/07/2020

Goal-Aware Prediction: Learning to Model What Matters

Suraj Nair, Silvio Savarese, Chelsea Finn

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

11:16

14/06/2020

Self-Supervised Scene De-Occlusion

Xiaohang Zhan, Xingang Pan, Bo Dai and
Ziwei Liu, Dahua Lin, Chen Change Loy

Keywords Paper

de-occlusion, self-supervised, occlusion ordering, scene understanding, amodal completion, inpainting, amodal instance segmentation, decomposition, image editing, manipulation

0

0

0

0

4:59

06/12/2020

Self-Learning Transformations for Improving Gaze and Head Redirection

Yufeng Zheng, Seonwook Park, Xucong Zhang and
Shalini De Mello, Otmar Hilliges

Keywords Paper

0

0

0

0

3:20

05/01/2021

Integrating Human Gaze Into Attention for Egocentric Activity Recognition

Kyle Min, Jason J. Corso

Keywords Paper

0

0

0

0

4:56

06/12/2021

Object-aware Contrastive Learning for Debiased Scene Representation

Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

10:25

05/01/2021

Multi-Frame Recurrent Adversarial Network for Moving Object Segmentation

Prashant W. Patil, Akshay Dudhane, Subrahmanyam Murala

Keywords Paper

0

0

0

0

5:00

06/12/2021

Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

Angtian Wang, Shenxiao Mei, Alan Yuille, Adam Kortylewski

Keywords Paper

robustness, vision, few shot learning, semi-supervised learning

0

0

0

0

14:54

03/05/2021

Disentangled Recurrent Wasserstein Autoencoder

Jun Han, Martin Min, Ligong Han and
Li Erran Li, Xuan Zhang

Keywords Paper

Recurrent Generative Model, Sequential Representation Learning, Disentanglement

0

0

0

0

9:17

05/01/2021

Weakly-Supervised Object Representation Learning for Few-Shot Semantic Segmentation

Xiaowen Ying, Xin Li, Mooi Choo Chuah

Keywords Paper

0

0

0

0

5:00

14/06/2020

Spatio-Temporal Graph for Video Captioning With Knowledge Distillation

Boxiao Pan, Haoye Cai, De-An Huang and
Kuan-Hui Lee, Adrien Gaidon, Ehsan Adeli, Juan Carlos Niebles

Keywords Paper

video captioning, spatio-temporal graph, video understanding, vision and language, knowledge distillation, transformer, computer vision.

0

0

0

0

1:01

05/01/2021

Towards Contextual Learning in Few-Shot Object Classification

Mathieu Page Fortin, Brahim Chaib-draa

Keywords Paper

0

0

0

0

4:57

14/06/2020

Weakly-Supervised Semantic Segmentation via Sub-Category Exploration

Yu-Ting Chang, Qiaosong Wang, Wei-Chih Hung and
Robinson Piramuthu, Yi-Hsuan Tsai, Ming-Hsuan Yang

Keywords Paper

weakly-supervised learning, semantic segmentation, class activation map, unsupervised sub-category classification, self-supervised learning

0

0

0

0

1:00

03/05/2021

Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning

Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang

Keywords Paper

Unsupervised Learning, Variational Autoencoders, Unsupervised Meta-learning, Meta-Learning

0

0

0

0

13:31

06/12/2021

Meta Internal Learning

Raphael Bensadoun, Shir Gur, Tomer Galanti, Lior Wolf

Keywords Paper

vision, generative model, meta learning

0

0

0

0

7:41

22/11/2021

Hierarchical Contrastive Motion Learning for Video Action Recognition

Xitong Yang, Xiaodong Yang, Sifei Liu and
Deqing Sun, Larry Davis, Jan Kautz

Keywords Paper

action recognition, motion hierarchy, motion representation, contrastive learning

0

0

0

0

8:29

14/06/2020

Unsupervised Learning for Intrinsic Image Decomposition From a Single Image

Yunfei Liu, Yu Li, Shaodi You, Feng Lu

Keywords Paper

intrinsic image decomposition, unsupervised learning, distribution, priors, independence constraint, physical consistency constraint

0

0

0

0

1:00

14/06/2020

Adaptive Dilated Network With Self-Correction Supervision for Counting

Shuai Bai, Zhiqun He, Yu Qiao and
Hanzhe Hu, Wei Wu, Junjie Yan

Keywords Paper

crowd counting, self-correction, convolutional neural network

0

0

0

0

0:59

14/06/2020

Focus on Defocus: Bridging the Synthetic to Real Domain Gap for Depth Estimation

Maxim Maximov, Kevin Galim, Laura Leal-Taixé

Keywords Paper

depth estimation, generalisation, depth from focus, blur estimation, depth

0

0

0

0

1:01

26/04/2020

Contrastive Learning of Structured World Models

Thomas Kipf, Elise van der Pol, Max Welling

Keywords Paper

state representation learning, graph neural networks, model-based reinforcement learning, relational learning, object discovery

0

0

0

0

14:51

12/07/2020

Automated Synthetic-to-Real Generalization

Wuyang Chen, Zhiding Yu, Zhangyang Wang, Anima Anandkumar

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

9:24

06/12/2020

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Masha Itkina, Boris Ivanovic, Ransalu Senanayake and
Mykel J Kochenderfer, Marco Pavone

Keywords Paper

0

0

0

0

3:39

22/11/2021

Attention to Action: Leveraging Attention for Object Navigation

Shi Chen, Qi Zhao

Keywords Paper

Object-goal Navigation, Attention, Visual Navigation

0

0

0

0

2:51

22/11/2021

Grid Cell Path Integration For Movement-Based Visual Object Recognition

Niels Leadholm, Marcus Lewis, Subutai Ahmad

Keywords Paper

biologically plausible, translation invariance, robustness, sequential vision, transsaccadic vision, grid cells, path integration, continual learning, predictive representations, Hebbian learning

0

0

0

0

11:21

02/02/2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Ziyi Liu, Le Wang, Wei Tang and
Junsong Yuan, Nanning Zheng, Gang Hua

Keywords Paper

0

0

0

0

19:49

02/02/2021

Improving Sample Efficiency in Model-Free Reinforcement Learning from Images

Denis Yarats, Amy Zhang, Ilya Kostrikov and
Brandon Amos, Joelle Pineau, Rob Fergus

Keywords Paper

0

0

0

0

12:19

26/04/2020

Neural Outlier Rejection for Self-Supervised Keypoint Learning

Jiexiong Tang, Hanme Kim, Vitor Guizilini and
Sudeep Pillai, Rares Ambrus

Keywords Paper

Self-Supervised Learning, Keypoint Detection, Outlier Rejection, Deep Learning

0

0

0

0

4:55

06/12/2021

Towards Context-Agnostic Learning Using Synthetic Data

Charles Jin, Martin Rinard

Keywords Paper

machine learning, vision

0

0

0

0

14:20