RevealNet: Seeing Behind Objects in RGB-D Scans

14/06/2020

RevealNet: Seeing Behind Objects in RGB-D Scans

Ji Hou, Angela Dai, Matthias Nießner

Keywords: instance completion, shape completion, instance segmentation, 3d scene understanding, rgb-d, multi-view geometry, segmentation

Abstract Paper Similar Papers

Abstract: During 3D reconstruction, it is often the case that people cannot scan each individual object from all views, resulting in missing geometry in the captured scan. This missing geometry can be fundamentally limiting for many applications, e.g., a robot needs to know the unseen geometry to perform a precise grasp on an object. Thus, we introduce the task of semantic instance completion: from an incomplete RGB-D scan of a scene, we aim to detect the individual object instances and infer their complete object geometry. This will open up new possibilities for interactions with objects in a scene, for instance for virtual or robotic agents. We tackle this problem by introducing RevealNet, a new data-driven approach that jointly detects object instances and predicts their complete geometry. This enables a semantically meaningful decomposition of a scanned scene into individual, complete 3D objects, including hidden and unobserved object parts. RevealNet is an end-to-end 3D neural network architecture that leverages joint color and geometry feature learning. The fully-convolutional nature of our 3D network enables efficient inference of semantic instance completion for 3D scans at scale of large indoor environments in a single forward pass. We show that predicting complete object geometry improves both 3D detection and instance segmentation performance. We evaluate on both real and synthetic scan benchmark data for the new task, where we outperform state-of-the-art approaches by over 15 in mAP@0.5 on ScanNet, and over 18 in mAP@0.5 on SUNCG.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

30/11/2020

3D Object Detection and Pose Estimation of Unseen Objects in Color Images with Local Surface Embeddings

Giorgia Pitteri, Aureélie Bugeau, Slobodan Ilic, Vincent Lepetit

Keywords Paper

0

0

0

0

9:17

06/12/2020

Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D

Ankit Goyal, Kaiyu Yang, Dawei Yang, Jia Deng

Keywords Paper

0

0

0

0

3:27

14/06/2020

SG-NN: Sparse Generative Neural Networks for Self-Supervised Scene Completion of RGB-D Scans

Angela Dai, Christian Diller, Matthias Nießner

Keywords Paper

3d vision, self-supervised training, generative 3d learning, 3d reconstruction

0

0

0

0

1:00

16/11/2020

S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds

Ran Cheng, Christopher Agia, Yuan Ren and
Xinhai Li, Liu Bingbing

Keywords Paper

0

0

0

0

6:13

22/11/2021

Leveraging Geometry for Shape Estimation from a Single RGB Image

Florian Maximilian Langer, Ignas Budvytis, Roberto Cipolla

Keywords Paper

3D shape, CAD model, deformation, image-based retrieval, key points, geometry

0

0

0

0

3:02

14/06/2020

Self-Supervised Learning of Interpretable Keypoints From Unlabelled Videos

Tomas Jakab, Ankush Gupta, Hakan Bilen, Andrea Vedaldi

Keywords Paper

self-supervised, unsupervised, keypoints, landmarks, pose, videos, adversarial, gan, disentanglement, factorizations

0

0

0

0

5:01

03/05/2021

NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation

Angtian Wang, Adam Kortylewski, Alan Yuille

Keywords Paper

Contrastive Learning, Render-and-Compare, Robust Deep Learning, Pose Estimation

0

0

0

0

5:08

02/02/2021

ASHF-Net: Adaptive Sampling and Hierarchical Folding Network for Robust Point Cloud Completion

Daoming Zong, Shiliang Sun, Jing Zhao

Keywords Paper

0

0

0

0

16:49

06/12/2020

Grasp Proposal Networks: An End-to-End Solution for Visual Learning of Robotic Grasps

Chaozheng Wu, Jian Chen, Qiaoyu Cao and
Jianchi Zhang, Yunxin Tai, Lin Sun, Kui Jia

Keywords Paper

0

0

0

0

3:19

14/06/2020

Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion

Julian Chibane, Thiemo Alldieck, Gerard Pons-Moll

Keywords Paper

shape reconstruction, shape completion, implicit function learning, 3d scene understanding, single-view reconstruction, point cloud completion, voxel super-resolution, representation learning, surface reconstruction, 3d vision

0

0

0

0

1:01

14/06/2020

Unsupervised Learning of Intrinsic Structural Representation Points

Nenglun Chen, Lingjie Liu, Zhiming Cui and
Runnan Chen, Duygu Ceylan, Changhe Tu, Wenping Wang

Keywords Paper

3d point cloud learning, structure point, unsupervised learning

0

0

0

0

1:00

16/11/2020

Self-Supervised Object-in-Gripper Segmentation from Robotic Motions

Wout Boerdijk, Martin Sundermeyer, Maximilian Durner, Rudolph Triebel

Keywords Paper

0

0

0

0

5:03

14/06/2020

Leveraging 2D Data to Learn Textured 3D Mesh Generation

Paul Henderson, Vagia Tsiminaki, Christoph H. Lampert

Keywords Paper

generative models, inverse graphics, single-image 3d reconstruction, variational autoencoders, meshes, unsupervised learning

0

0

0

0

4:56

14/06/2020

Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking

Peiliang Li, Jieqi Shi, Shaojie Shen

Keywords Paper

3d object tracking, stereo cameras, autonomous driving

0

0

0

0

1:01

14/06/2020

DSGN: Deep Stereo Geometry Network for 3D Object Detection

Yilun Chen, Shu Liu, Xiaoyong Shen, Jiaya Jia

Keywords Paper

3d object detection, autonomous vehicle, stereo matching, depth estimation, kitti, 3d perception, lidar sensor, stereo camera, point cloud

0

0

0

0

1:00

14/06/2020

ARCH: Animatable Reconstruction of Clothed Humans

Zeng Huang, Yuanlu Xu, Christoph Lassner and
Hao Li, Tony Tung

Keywords Paper

body modeling, 3d reconstruction, implicit function learning, differentiable rendering, 3d pose, monocular rgb, deep learning

0

0

0

0

1:01

14/06/2020

Generating 3D People in Scenes Without People

Yan Zhang, Mohamed Hassan, Heiko Neumann and
Michael J. Black, Siyu Tang

Keywords Paper

person-scene interaction, 3d human body, 3d scene, generative model, geometry-aware fitting

0

0

0

0

4:55

14/06/2020

Learning Unsupervised Hierarchical Part Decomposition of 3D Objects From a Single RGB Image

Despoina Paschalidou, Luc Van Gool, Andreas Geiger

Keywords Paper

3d reconstruction, primitive-based representations, structure-aware representations, part-based decomposition, primitives, semantic shape abstractions, single-view 3d reconstruction, unsupervised learning, 3d deep learning

0

0

0

0

1:01

14/06/2020

Coherent Reconstruction of Multiple Humans From a Single Image

Wen Jiang, Nikos Kolotouros, Georgios Pavlakos and
Xiaowei Zhou, Kostas Daniilidis

Keywords Paper

3d human pose, multiple humans, 3d from single image, body pose and shape, coherent reconstruction

0

0

0

0

1:01

05/01/2021

Deep Template-Based Object Instance Detection

Jean-Philippe Mercier, Mathieu Garon, Philippe Giguere, Jean-Francois Lalonde

Keywords Paper

0

0

0

0

5:01

14/06/2020

From Image Collections to Point Clouds With Self-Supervised Shape and Pose Networks

K L Navaneet, Ansu Mathew, Shashank Kashyap and
Wei-Chih Hung, Varun Jampani, R. Venkatesh Babu

Keywords Paper

3d reconstruction, single image reconstruction, self supervised, point clouds, unsupervised, 2d to 3d, image collections

0

0

0

0

1:01

14/06/2020

LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation

Keunhong Park, Arsalan Mousavian, Yu Xiang, Dieter Fox

Keywords Paper

pose estimation, pose, neural rendering, zero-shot, shape learning, 3d reconstruction, datasets, generative models, multi-view, robotics

0

0

0

0

1:01

16/11/2020

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

Yu Xiang, Christopher Xie, Arsalan Mousavian, Dieter Fox

Keywords Paper

0

0

0

0

5:17

14/06/2020

SynSin: End-to-End View Synthesis From a Single Image

Olivia Wiles, Georgia Gkioxari, Richard Szeliski, Justin Johnson

Keywords Paper

single image view synthesis, view synthesis, differentiable rendering, point cloud, convolutional neural networks, generative networks

0

0

0

0

4:58

02/02/2021

Shape-Pose Ambiguity in Learning 3D Reconstruction from Images

Yunjie Wu, Zhengxing Sun, Youcheng Song and
Yunhan Sun, YiJie Zhong, Jinlong Shi

Keywords Paper

0

0

0

0

15:50

06/12/2020

Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views

Nanbo Li, Cian Eastwood, Robert Fisher

Keywords Paper

0

0

0

0

3:19

26/04/2020

Learning to Move with Affordance Maps

William Qi, Ravi Teja Mullapudi, Saurabh Gupta, Deva Ramanan

Keywords Paper

navigation, exploration

0

0

0

0

5:28

16/11/2020

Amodal 3D Reconstruction for Robotic Manipulation via Stability and Connectivity

William Agnew, Christopher Xie, Aaron Walsman and
Octavian Murad, Yubo Wang, Pedro Domingos, Siddhartha Srinivasa

Keywords Paper

0

0

0

0

4:37

06/12/2021

Active 3D Shape Reconstruction from Vision and Touch

Edward Smith, David Meger, Luis Pineda and
Roberto Calandra, Jitendra Malik, Adriana Romero Soriano, Michal Drozdzal

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

8:31

14/06/2020

RL-CycleGAN: Reinforcement Learning Aware Simulation-to-Real

Kanishka Rao, Chris Harris, Alex Irpan and
Sergey Levine, Julian Ibarz, Mohi Khansari

Keywords Paper

robotics, sim2real, cyclegan, reinforcement learning, grasping, q-learning

0

0

0

0

4:55

14/06/2020

DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing

Shaohui Liu, Yinda Zhang, Songyou Peng and
Boxin Shi, Marc Pollefeys, Zhaopeng Cui

Keywords Paper

differentiable rendering, 3d reconstruction, implicit representations, multi-view reconstruction, depth completion, 3d deep learning

0

0

0

0

1:01

05/01/2021

SMPLpix: Neural Avatars From 3D Human Models

Sergey Prokudin, Michael J. Black, Javier Romero

Keywords Paper

0

0

0

0

4:55

14/06/2020

KeyPose: Multi-View 3D Labeling and Keypoint Estimation for Transparent Objects

Xingyu Liu, Rico Jonschkowski, Anelia Angelova, Kurt Konolige

Keywords Paper

pose estimation, stereo, keypoint, transparent object, robotic manipulation, depth sensor, dataset

0

0

0

0

1:00

05/01/2021

Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization

Sadbhavana Babar, Sukhendu Das

Keywords Paper

0

0

0

0

5:01

14/06/2020

Three-Dimensional Reconstruction of Human Interactions

Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata and
Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

Keywords Paper

human, interaction, 3d, reconstruction, contact, dataset, pose, shape, body, person

0

0

0

0

1:00

14/06/2020

ImVoteNet: Boosting 3D Object Detection in Point Clouds With Image Votes

Charles R. Qi, Xinlei Chen, Or Litany, Leonidas J. Guibas

Keywords Paper

3d object detection, rgb-d, voting, point clouds, multi-modality, fusion, deep learning, object recognition.

0

0

0

0

1:00

16/11/2020

S3K: Self-Supervised Semantic Keypoints for Robotic Manipulation via Multi-View Consistency

Mel Vecerik, Jean-Baptiste Regli, Oleg Sushkov and
David Barker, Rugile Pevceviciute, Thomas Roth ̈orl, Raia Hadsell, Lourdes Agapito, Jonathan Scholz

Keywords Paper

0

0

0

0

5:06

14/06/2020

Self-Supervised Scene De-Occlusion

Xiaohang Zhan, Xingang Pan, Bo Dai and
Ziwei Liu, Dahua Lin, Chen Change Loy

Keywords Paper

de-occlusion, self-supervised, occlusion ordering, scene understanding, amodal completion, inpainting, amodal instance segmentation, decomposition, image editing, manipulation

0

0

0

0

4:59

06/12/2021

Progressive Coordinate Transforms for Monocular 3D Object Detection

Li Wang, Li Zhang, Yi Zhu and
Zhi Zhang, Tong He, Mu Li, Xiangyang Xue

Keywords Paper

vision

0

0

0

0

13:21

14/06/2020

Leveraging Photometric Consistency Over Time for Sparsely Supervised Hand-Object Reconstruction

Yana Hasson, Bugra Tekin, Federica Bogo and
Ivan Laptev, Marc Pollefeys, Cordelia Schmid

Keywords Paper

hand-object reconstruction, pose estimation, object manipulation, photometric consistency, self-supervised learning, hands, objects, 3d reconstruction, manipulation

0

0

0

0

1:01