BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

Abstract: We present BlockGAN, an image generative model that learns object-aware 3D scene representations directly from unlabelled 2D images. Current work on scene representation learning either ignores scene background or treats the whole scene as one object. Meanwhile, work that considers scene compositionality treats scene objects only as image patches or 2D layers with alpha maps. Inspired by the computer graphics pipeline, we design BlockGAN to learn to first generate 3D features of background and foreground objects, then combine them into 3D features for the whole scene, and finally render them into realistic images. This allows BlockGAN to reason over occlusion and interaction between objects’ appearance, such as shadow and lighting, and provides control over each object’s 3D pose and identity, while maintaining image realism. BlockGAN is trained end-to-end, using only unlabelled single images, without the need for 3D geometry, pose labels, object masks, or multiple views of the same scene. Our experiments show that using explicit 3D features to represent objects allows BlockGAN to learn disentangled representations both in terms of objects (foreground and background) and their properties (pose and identity).

BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

Thu Nguyen-Phuoc, Christian Richardt, Long Mai, Yongliang Yang, Niloy Mitra

Comments

Similar Papers

3D Object Detection and Pose Estimation of Unseen Objects in Color Images with Local Surface Embeddings

Giorgia Pitteri, Aureélie Bugeau, Slobodan Ilic, Vincent Lepetit

Keywords Abstract Paper

From Image Collections to Point Clouds With Self-Supervised Shape and Pose Networks

K L Navaneet, Ansu Mathew, Shashank Kashyap and Wei-Chih Hung, Varun Jampani, R. Venkatesh Babu

Keywords Abstract Paper

3d reconstruction, single image reconstruction, self supervised, point clouds, unsupervised, 2d to 3d, image collections

Shape-Pose Ambiguity in Learning 3D Reconstruction from Images

Yunjie Wu, Zhengxing Sun, Youcheng Song and Yunhan Sun, YiJie Zhong, Jinlong Shi

Keywords Abstract Paper

Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

Angtian Wang, Shenxiao Mei, Alan Yuille, Adam Kortylewski

Keywords Abstract Paper

robustness, vision, few shot learning, semi-supervised learning

SG-NN: Sparse Generative Neural Networks for Self-Supervised Scene Completion of RGB-D Scans

Angela Dai, Christian Diller, Matthias Nießner

Keywords Abstract Paper

3d vision, self-supervised training, generative 3d learning, 3d reconstruction

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild (Extended Abstract)

Shangzhe Wu, Christian Rupprecht, Andrea Vedaldi

Keywords Abstract Paper

Computer Vision, 2D and 3D Computer Vision, Computational Photography, Photometry, Shape from X

Unsupervised Learning of Intrinsic Structural Representation Points

Nenglun Chen, Lingjie Liu, Zhiming Cui and Runnan Chen, Duygu Ceylan, Changhe Tu, Wenping Wang

Keywords Abstract Paper

3d point cloud learning, structure point, unsupervised learning

Scene Recomposition by Learning-Based ICP

Hamid Izadinia, Steven M. Seitz

Keywords Abstract Paper

3d scene recomposition, 3d scene reconstruction, deep reinforcement learning, learning-based icp (licp), 3d geometry learning, 3d cad models, 3d shapes, room layout estimation, 3d geometry deep network, iterative closest point

NeRF-VAE: A Geometry Aware 3D Scene Generative Model

Adam Kosiorek, Heiko Strathmann, Daniel Zoran and Pol Moreno, Rosalia Schneider, Sona Mokra, Danilo J. Rezende

Keywords Abstract Paper

Deep Learning, Generative Models

Learning 3D Face Reconstruction with a Pose Guidance Network

Pengpeng Liu, Xintong Han, Michael Lyu and Irwin King, Jia Xu

Keywords Abstract Paper

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and Oliver Wang, Kwang In Kim, James Tompkin

Keywords Abstract Paper

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

Learning Complex 3D Human Self-Contact

Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata and Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

Keywords Abstract Paper

Deductive Learning for Weakly-Supervised 3D Human Pose Estimation via Uncalibrated Cameras

Xipeng Chen, Pengxu Wei, Liang Lin

Keywords Abstract Paper

Leveraging SE(3) Equivariance for Self-supervised Category-Level Object Pose Estimation from Point Clouds

Xiaolong Li, Yijia Weng, Li Yi and Leonidas Guibas, A. Abbott, Shuran Song, He Wang

Keywords Abstract Paper

self-supervised learning, vision

Sharf: Shape-conditioned Radiance Fields from a Single View

Konstantinos Rematas, Ricardo Martin-Brualla, Vittorio Ferrari

Keywords Abstract Paper

Applications, Computer Vision

Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination

Pratul P. Srinivasan, Ben Mildenhall, Matthew Tancik and Jonathan T. Barron, Richard Tucker, Noah Snavely

Keywords Abstract Paper

lighting estimation, relighting, object insertion, deep learning, augmented reality, view synthesis, 3d convolutional network

UCSG-NET- Unsupervised Discovering of Constructive Solid Geometry Tree

Kacper Kania, Maciej Zieba, Tomasz Kajdanowicz

Keywords Abstract Paper

DRWR: A Differentiable Renderer without Rendering for Unsupervised 3D Structure Learning from Silhouette Images

Zhizhong Han, Chao Chen, Yushen Liu, Matthias Zwicker

Keywords Abstract Paper

Applications - Computer Vision

BSP-Net: Generating Compact Meshes via Binary Space Partitioning

Zhiqin Chen, Andrea Tagliasacchi, Hao Zhang

Keywords Abstract Paper

generative neural network, 3d shape, polygonal mesh, binary space partitioning, convex decomposition, structured single view reconstruction

Self-Supervised Learning of Interpretable Keypoints From Unlabelled Videos

Tomas Jakab, Ankush Gupta, Hakan Bilen, Andrea Vedaldi

Keywords Abstract Paper

self-supervised, unsupervised, keypoints, landmarks, pose, videos, adversarial, gan, disentanglement, factorizations

Novel Object Viewpoint Estimation Through Reconstruction Alignment

Mohamed El Banani, Jason J. Corso, David F. Fouhey

Keywords Paper

K L Navaneet, Ansu Mathew, Shashank Kashyap and
Wei-Chih Hung, Varun Jampani, R. Venkatesh Babu

Keywords Paper

Yunjie Wu, Zhengxing Sun, Youcheng Song and
Yunhan Sun, YiJie Zhong, Jinlong Shi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Nenglun Chen, Lingjie Liu, Zhiming Cui and
Runnan Chen, Duygu Ceylan, Changhe Tu, Wenping Wang

Keywords Paper

Keywords Paper

Adam Kosiorek, Heiko Strathmann, Daniel Zoran and
Pol Moreno, Rosalia Schneider, Sona Mokra, Danilo J. Rezende

Keywords Paper

Pengpeng Liu, Xintong Han, Michael Lyu and
Irwin King, Jia Xu

Keywords Paper

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and
Oliver Wang, Kwang In Kim, James Tompkin

Keywords Paper

Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata and
Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

Keywords Paper

Keywords Paper

Xiaolong Li, Yijia Weng, Li Yi and
Leonidas Guibas, A. Abbott, Shuran Song, He Wang

Keywords Paper

Keywords Paper

Pratul P. Srinivasan, Ben Mildenhall, Matthew Tancik and
Jonathan T. Barron, Richard Tucker, Noah Snavely

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Emilien Dupont, Miguel Angel Bautista Martin, Alex Colburn and
Aditya Sankar, Joshua Susskind, Qi Shan

Keywords Paper

Keywords Paper

Keywords Paper

Zhimin Chen, Longlong Jing, Yang Liang and
YingLi Tian, Bing Li

Keywords Paper

Zhaoyuan Fang, Ayush Jain, Gabriel Sarch and
Adam Harley, Katerina Fragkiadaki

Keywords Paper

Vitor Guizilini, Rui Hou, Jie Li and
Rares Ambrus, Adrien Gaidon

Keywords Paper

Keywords Paper

Dmitriy Smirnov, MICHAEL GHARBI, Matthew Fisher and
Vitor Guizilini, Alexei A Efros, Justin Solomon

Keywords Paper

Sicheng Xu, Jiaolong Yang, Dong Chen and
Fang Wen, Yu Deng, Yunde Jia, Xin Tong

Keywords Paper

Keywords Paper

Ayush Tewari, Mohamed Elgharib, Gaurav Bharaj and
Florian Bernard, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt

Keywords Paper

Keywords Paper

Keywords Paper