FroDO: From Detections to 3D Objects

14/06/2020

FroDO: From Detections to 3D Objects

Martin Rünz, Kejie Li, Meng Tang, Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

Keywords: reconstruction, shape embedding, 3d vision, object detection, shape prior, object representation, monocular, sdf, pointcloud, inference

Abstract Paper Similar Papers

Abstract: Object-oriented maps are important for scene understanding since they jointly capture geometry and semantics, allow individual instantiation and meaningful reasoning about objects. We introduce FroDO, a method for accurate 3D reconstruction of object instances from RGB video that infers their location, pose and shape in a coarse to fine manner. Key to FroDO is to embed object shapes in a novel learnt shape space that allows seamless switching between sparse point cloud and dense DeepSDF decoding. Given an input sequence of localized RGB frames, FroDO first aggregates 2D detections to instantiate a 3D bounding box per object. A shape code is regressed using an encoder network before optimizing shape and pose further under the learnt shape priors using sparse or dense shape representations. The optimization uses multi-view geometric, photometric and silhouette losses. We evaluate on real-world datasets, including Pix3D, Redwood-OS, and ScanNet, for single-view, multi-view, and multi-object reconstruction.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and
Oliver Wang, Kwang In Kim, James Tompkin

Keywords Paper

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

0

0

0

0

2:49

06/12/2021

Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image

Feng Liu, Xiaoming Liu

Keywords Paper

vision

0

0

0

0

9:19

14/06/2020

Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation

Dengsheng Chen, Jun Li, Zheng Wang, Kai Xu

Keywords Paper

object pose estimation, 3d vision

0

0

0

0

1:01

14/06/2020

StructEdit: Learning Structural Shape Variations

Kaichun Mo, Paul Guerrero, Li Yi and
Hao Su, Peter Wonka, Niloy J. Mitra, Leonidas J. Guibas

Keywords Paper

3d vision, 3d graphics, shape editing, generative modeling, shape analysis, edit transfer, shape parts, shape structure, conditional generative model, variational auto-encoder

0

0

0

0

1:01

06/12/2021

To The Point: Correspondence-driven monocular 3D category reconstruction

Filippos Kokkinos, Iasonas Kokkinos

Keywords Paper

optimization

0

0

0

0

5:27

06/12/2020

Multi-Plane Program Induction with 3D Box Priors

Yikai Li, Jiayuan Mao, Xiuming Zhang and
Bill Freeman, Josh Tenenbaum, Noah Snavely, Jiajun Wu

Keywords Paper

0

0

0

0

3:24

06/12/2021

3DP3: 3D Scene Perception via Probabilistic Programming

Nishad Gothoskar, Marco Cusumano-Towner, Ben Zinberg and
Matin Ghavamizadeh, Falk Pollok, Austin Garrett, Josh Tenenbaum, Dan Gutfreund, Vikash Mansinghka

Keywords Paper

deep learning, vision, generative model, graph learning

0

0

0

0

6:16

06/12/2021

Panoptic 3D Scene Reconstruction From a Single RGB Image

Manuel Dahnert, Ji Hou, Matthias Niessner, Angela Dai

Keywords Paper

vision

0

0

0

0

6:06

30/11/2020

Novel-View Human Action Synthesis

Mohamed Ilyes Lakhal, Davide Boscaini, Fabio Poiesi and
Oswald Lanz, Andrea Cavallaro

Keywords Paper

0

0

0

0

4:34

05/01/2021

Learning to Generate Dense Point Clouds With Textures on Multiple Categories

Tao Hu, Geng Lin, Zhizhong Han, Matthias Zwicker

Keywords Paper

0

0

0

0

4:57

06/12/2020

Generative View Synthesis: From Single-view Semantics to Novel-view Images

Tewodros Amberbir Habtegebrial, Varun Jampani, Orazio Gallo, Didier Stricker

Keywords Paper

0

0

0

0

3:20

14/06/2020

DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing

Shaohui Liu, Yinda Zhang, Songyou Peng and
Boxin Shi, Marc Pollefeys, Zhaopeng Cui

Keywords Paper

differentiable rendering, 3d reconstruction, implicit representations, multi-view reconstruction, depth completion, 3d deep learning

0

0

0

0

1:01

14/06/2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

Yancheng Wang, Yang Xiao, Fu Xiong and
Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Keywords Paper

3d action recognition, point cloud, 3d motion, temporal rank pooling, pointnet++, multi-stream network

0

0

0

0

1:01

14/06/2020

On Joint Estimation of Pose, Geometry and svBRDF From a Handheld Scanner

Carolin Schmitt, Simon Donné, Gernot Riegler and
Vladlen Koltun, Andreas Geiger

Keywords Paper

3d reconstruction, mobile lightstage, mulitview photometric stereo, svbrdf estimation, shape from shading, material segmentation, handheld 3d sensor, non-lambertian surfaces

0

0

0

0

1:01

14/06/2020

Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking

Peiliang Li, Jieqi Shi, Shaojie Shen

Keywords Paper

3d object tracking, stereo cameras, autonomous driving

0

0

0

0

1:01

14/06/2020

G2L-Net: Global to Local Network for Real-Time 6D Pose Estimation With Embedding Vector Features

Wei Chen, Xi Jia, Hyung Jin Chang and
Jinming Duan, Aleš Leonardis

Keywords Paper

object pose estimation, point cloud, embedding vector features, real time

0

0

0

0

1:01

14/06/2020

Joint Texture and Geometry Optimization for RGB-D Reconstruction

Yanping Fu, Qingan Yan, Jie Liao, Chunxia Xiao

Keywords Paper

rgb-d reconstruction, 3d reconstruction, texture optimization, geometry optimization, joint texture and geometry optimization

0

0

0

0

0:57

14/06/2020

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

Li Jiang, Hengshuang Zhao, Shaoshuai Shi and
Shu Liu, Chi-Wing Fu, Jiaya Jia

Keywords Paper

instance segmentation, point cloud, 3d, scene understanding, indoor scenes, bottom-up, grouping, dual-set, scannet, s3dis

0

0

0

0

5:01

06/12/2021

Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis

Tianchang Shen, Jun Gao, Kangxue Yin and
Ming-Yu Liu, Sanja Fidler

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

14:32

06/12/2020

Hard Example Generation by Texture Synthesis for Cross-domain Shape Similarity Learning

Huan Fu, Shunming Li, Rongfei Jia and
Mingming Gong, Binqiang Zhao, Dacheng Tao

Keywords Paper

0

0

0

0

3:21

06/12/2021

Differentiable rendering with perturbed optimizers

Quentin Le Lidec, Ivan Laptev, Cordelia Schmid, Justin Carpentier

Keywords Paper

optimization, vision

0

0

0

0

3:18

15/06/2020

Synthesizing structured CAD models with equality saturation and inverse transformations

Chandrakana Nandi, Max Willsey, Adam Anderson and
James R. Wilcox, Eva Darulova, Dan Grossman, Zachary Tatlock

Keywords Paper

Decompilation, Program Synthesis, Computer-Aided Design, Equality Saturation

0

0

0

0

16:25

14/06/2020

Fusion-Aware Point Convolution for Online Semantic 3D Scene Segmentation

Jiazhao Zhang, Chenyang Zhu, Lintao Zheng, Kai Xu

Keywords Paper

online segmentation, scene understanding, semantic segmentation, point cloud, 3d vision

0

0

0

0

1:01

02/02/2021

Text-Guided Graph Neural Networks for Referring 3D Instance Segmentation

Pin-Hao Huang, Han-Hung Lee, Hwann-Tzong Chen, Tyng-Luh Liu

Keywords Paper

0

0

0

0

15:17

14/06/2020

Perspective Plane Program Induction From a Single Image

Yikai Li, Jiayuan Mao, Xiuming Zhang and
William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu

Keywords Paper

inverse graphics, program synthesis, image manipulation, camera pose estimation, repeated pattern detection, image inpainting, image extrapolation

0

0

0

0

1:01

14/06/2020

PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes

Rundi Wu, Yixin Zhuang, Kai Xu and
Hao Zhang, Baoquan Chen

Keywords Paper

3d shape modeling, generative models, shape generation, representation learning, sequence-to-sequence network, implicit function

0

0

0

0

1:01

30/11/2020

3D Object Detection and Pose Estimation of Unseen Objects in Color Images with Local Surface Embeddings

Giorgia Pitteri, Aureélie Bugeau, Slobodan Ilic, Vincent Lepetit

Keywords Paper

0

0

0

0

9:17

06/12/2021

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

Gengshan Yang, Deqing Sun, Varun Jampani and
Daniel Vlasic, Forrester Cole, Ce Liu, Deva Ramanan

Keywords Paper

0

0

0

0

10:42

22/11/2021

AniFormer: Data-driven 3D Animation with Transformer

Haoyu Chen, Hao Tang, Nicu Sebe, Guoying Zhao

Keywords Paper

3D motion, 3D generation, 3D style transfer, Transformer, 3D animation

0

0

0

0

2:51

06/12/2020

Geo-PIFu: Geometry and Pixel Aligned Implicit Functions for Single-view Human Reconstruction

Tong He, John Collomosse, Hailin Jin, Stefano Soatto

Keywords Paper

0

0

0

0

3:16

30/11/2020

Image Captioning through Image Transformer

Sen He, Wentong Liao, Hamed R. Tavakoli and
Michael Yang, Bodo Rosenhahn, Nicolas Pugeault

Keywords Paper

0

0

0

0

9:49

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

14/06/2020

OccuSeg: Occupancy-Aware 3D Instance Segmentation

Lei Han, Tian Zheng, Lan Xu, Lu Fang

Keywords Paper

instance segmentation, multi-task learning, occupancy

0

0

0

0

1:02

14/06/2020

SynSin: End-to-End View Synthesis From a Single Image

Olivia Wiles, Georgia Gkioxari, Richard Szeliski, Justin Johnson

Keywords Paper

single image view synthesis, view synthesis, differentiable rendering, point cloud, convolutional neural networks, generative networks

0

0

0

0

4:58

17/08/2020

Cut-enhanced PolyCube-maps for feature-aware all-hex meshing

Hao-Xiang Guo, Xiaohan Liu, Dong-Ming Yan, Yang Liu

Keywords Paper

PolyCube-Map, cut-enhanced, feature-aware, hexahedral meshing

0

0

0

0

16:47

06/12/2021

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

Aljaz Bozic, Pablo Palafox, Justus Thies and
Angela Dai, Matthias Niessner

Keywords Paper

transformers

0

0

0

0

7:14

06/12/2020

Learning Implicit Functions for Topology-Varying Dense 3D Shape Correspondence

Feng Liu, Xiaoming Liu

Keywords Paper

Deep Learning -> Adversarial Networks, Algorithms -> Adversarial Learning

0

0

0

0

3:16

30/11/2020

3D Guided Weakly Supervised Semantic Segmentation

Weixuan Sun, Jing Zhang, Nick Barnes

Keywords Paper

0

0

0

0

9:20

22/11/2021

FacialGAN: Style Transfer and Attribute Manipulation on Synthetic Faces

Ricard Durall Lopez, Jireh Jam, Dominik Strassel and
Moi Hoon Yap, Janis Keuper

Keywords Paper

GAN, attribute manipulation, style transfer, face editing

0

0

0

0

2:55

14/06/2020

Sequential 3D Human Pose and Shape Estimation From Point Clouds

Kangkan Wang, Jin Xie, Guofeng Zhang and
Lei Liu, Jian Yang

Keywords Paper

3d reconstruction, 3d human pose and shape estimation, point clouds, depth sensor, sequential modeling, spatial-temporal features, mesh convolution, attention model, weakly-supervised fine-tuning, deep learning

0

0

0

0

1:00