Anisotropic Convolutional Networks for 3D Semantic Scene Completion

14/06/2020

Anisotropic Convolutional Networks for 3D Semantic Scene Completion

Jie Li, Kai Han, Peng Wang, Yu Liu, Xia Yuan

Keywords: semantic scene completion, dense voxel prediction, shape completion, semantic segmentation, rgb-d, anisotropic convolution, voxel-wise receptive fields, 3d convolution

Abstract Paper Similar Papers

Abstract: As a voxel-wise labeling task, semantic scene completion (SSC) tries to simultaneously infer the occupancy and semantic labels for a scene from a single depth and/or RGB image. The key challenge for SSC is how to effectively take advantage of the 3D context to model various objects or stuffs with severe variations in shapes, layouts, and visibility. To handle such variations, we propose a novel module called anisotropic convolution, which properties with flexibility and power impossible for the competing methods such as standard 3D convolution and some of its variations. In contrast to the standard 3D convolution that is limited to a fixed 3D receptive field, our module is capable of modeling the dimensional anisotropy voxel-wisely. The basic idea is to enable anisotropic 3D receptive field by decomposing a 3D convolution into three consecutive 1D convolutions, and the kernel size for each such 1D convolution is adaptively determined on the fly. By stacking multiple such anisotropic convolution modules, the voxel-wise modeling capability can be further enhanced while maintaining a controllable amount of model parameters. Extensive experiments on two SSC benchmarks, NYU-Depth-v2 and NYUCAD, show the superior performance of the proposed method.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Towards Unsupervised Learning of Generative Models for 3D Controllable Image Synthesis

Yiyi Liao, Katja Schwarz, Lars Mescheder, Andreas Geiger

Keywords Paper

image synthesis, generative adversarial network, 3d controllability, unsupervised learning, 3d representation, disentangled representation, differentiable rendering, neural rendering

0

0

0

0

1:01

06/12/2020

Generative View Synthesis: From Single-view Semantics to Novel-view Images

Tewodros Amberbir Habtegebrial, Varun Jampani, Orazio Gallo, Didier Stricker

Keywords Paper

0

0

0

0

3:20

14/06/2020

OccuSeg: Occupancy-Aware 3D Instance Segmentation

Lei Han, Tian Zheng, Lan Xu, Lu Fang

Keywords Paper

instance segmentation, multi-task learning, occupancy

0

0

0

0

1:02

14/06/2020

Joint Texture and Geometry Optimization for RGB-D Reconstruction

Yanping Fu, Qingan Yan, Jie Liao, Chunxia Xiao

Keywords Paper

rgb-d reconstruction, 3d reconstruction, texture optimization, geometry optimization, joint texture and geometry optimization

0

0

0

0

0:57

06/12/2021

Panoptic 3D Scene Reconstruction From a Single RGB Image

Manuel Dahnert, Ji Hou, Matthias Niessner, Angela Dai

Keywords Paper

vision

0

0

0

0

6:06

14/06/2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

Yancheng Wang, Yang Xiao, Fu Xiong and
Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Keywords Paper

3d action recognition, point cloud, 3d motion, temporal rank pooling, pointnet++, multi-stream network

0

0

0

0

1:01

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

17/08/2020

Radiative backpropagation: An adjoint method for lightning-fast differentiable rendering

Merlin Nimier-David, Sébastien Speierer, Benoı̂t Ruiz, Wenzel Jakob

Keywords Paper

ray tracing, global illumination, differentiable rendering

0

0

0

0

17:54

30/11/2020

Robust High Dynamic Range (HDR) Imaging with Complex Motion and Parallax

Zhiyuan Pu, Peiyao Guo, M. Salman Asif, Zhan Ma

Keywords Paper

0

0

0

0

7:38

14/06/2020

Novel View Synthesis of Dynamic Scenes With Globally Coherent Depths From a Monocular Camera

Jae Shin Yoon, Kihwan Kim, Orazio Gallo and
Hyun Soo Park, Jan Kautz

Keywords Paper

view synthesis, depth estimation, dynamic scene, depth fusion, globally coherent depth, monocular camera

0

0

0

0

1:00

22/11/2021

Multi-Modality Task Cascade for 3D Object Detection

Jinhyung Park, Xinshuo Weng, Yunze Man, Kris Kitani

Keywords Paper

Multi Modality Learning, Object Detection, Semantic Segmentation

0

0

0

0

3:03

06/12/2021

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

Xingang Pan, Xudong XU, Chen Change Loy and
Christian Theobalt, Bo Dai

Keywords Paper

generative model

0

0

0

0

11:15

14/06/2020

UCTGAN: Diverse Image Inpainting Based on Unsupervised Cross-Space Translation

Lei Zhao, Qihang Mo, Sihuan Lin and
Zhizhong Wang, Zhiwen Zuo, Haibo Chen, Wei Xing, Dongming Lu

Keywords Paper

image inpainting, diverse image inpainting, image completion, unsupervised cross-space translation, diverse image generation, deep-learning based inpainting, deep learning, multiple-solution inpainting

0

0

0

0

1:01

14/06/2020

SynSin: End-to-End View Synthesis From a Single Image

Olivia Wiles, Georgia Gkioxari, Richard Szeliski, Justin Johnson

Keywords Paper

single image view synthesis, view synthesis, differentiable rendering, point cloud, convolutional neural networks, generative networks

0

0

0

0

4:58

14/06/2020

On Joint Estimation of Pose, Geometry and svBRDF From a Handheld Scanner

Carolin Schmitt, Simon Donné, Gernot Riegler and
Vladlen Koltun, Andreas Geiger

Keywords Paper

3d reconstruction, mobile lightstage, mulitview photometric stereo, svbrdf estimation, shape from shading, material segmentation, handheld 3d sensor, non-lambertian surfaces

0

0

0

0

1:01

26/04/2020

Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation

Hang Gao, Xizhou Zhu, Stephen Lin, Jifeng Dai

Keywords Paper

Effective Receptive Fields, Deformation Modeling, Dynamic Inference

0

0

0

0

4:13

14/06/2020

SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization

Yue Jiang, Dantong Ji, Zhizhong Han, Matthias Zwicker

Keywords Paper

differentiable rendering, signed distance field, image-based 3d reconstruction, 3d shape optimization, deep learning, inverse graphics

0

0

0

0

5:01

02/02/2021

ASHF-Net: Adaptive Sampling and Hierarchical Folding Network for Robust Point Cloud Completion

Daoming Zong, Shiliang Sun, Jing Zhao

Keywords Paper

0

0

0

0

16:49

14/06/2020

ASLFeat: Learning Local Features of Accurate Shape and Localization

Zixin Luo, Lei Zhou, Xuyang Bai and
Hongkai Chen, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

Keywords Paper

image matching, local feature keypoints, local feature descriptors, deep learning

0

0

0

0

1:01

14/06/2020

ReDA:Reinforced Differentiable Attribute for 3D Face Reconstruction

Wenbin Zhu, HsiangTao Wu, Zeyu Chen and
Noranart Vesdapunt, Baoyuan Wang

Keywords Paper

3d face reconstruction, soft rasterization, differentiable rendering, free-form deformation, 3d morphable model, face parsing

0

0

0

0

5:01

06/12/2021

Sparse Steerable Convolutions: An Efficient Learning of SE(3)-Equivariant Features for Estimation and Tracking of Object Poses in 3D Space

Jiehong Lin, Hongyang Li, Ke Chen and
Jiangbo Lu, Kui Jia

Keywords Paper

vision

0

0

0

0

12:29

14/06/2020

Spatially-Attentive Patch-Hierarchical Network for Adaptive Motion Deblurring

Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan

Keywords Paper

motion blur, spatially varying, attention, dynamic filter, adaptive, dynamic scene, deformable, encoder decoder, hierarchical, convolutional neural network

0

0

0

0

1:00

06/12/2020

GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis

Katja Schwarz, Yiyi Liao, Michael Niemeyer, Andreas Geiger

Keywords Paper

Algorithms -> Large Scale Learning, Optimization -> Stochastic Optimization

0

0

0

0

3:23

06/12/2020

Neural Star Domain as Primitive Representation

Yuki Kawana, Yusuke Mukuta, Tatsuya Harada

Keywords Paper

0

0

0

0

3:15

02/02/2021

Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio and
R Devon Hjelm, Shikhar Sharma

Keywords Paper

0

0

0

0

17:44

03/08/2020

Locally Masked Convolution for Autoregressive Models

Ajay Jain, Pieter Abbeel, Deepak Pathak

Keywords Paper

0

0

0

0

8:28

22/11/2021

AniFormer: Data-driven 3D Animation with Transformer

Haoyu Chen, Hao Tang, Nicu Sebe, Guoying Zhao

Keywords Paper

3D motion, 3D generation, 3D style transfer, Transformer, 3D animation

0

0

0

0

2:51

14/06/2020

MARMVS: Matching Ambiguity Reduced Multiple View Stereo for Efficient Large Scale Scene Reconstruction

Zhenyu Xu, Yiguang Liu, Xuelei Shi and
Ying Wang, Yunan Zheng

Keywords Paper

multiple view stereo, dense matching, matching ambiguity, cpu efficient, scale selection, differential geometry, epiploar geometry, depth map fusion, accuracy and completeness, 3d reconstruction

0

0

0

0

1:01

22/11/2021

Learning Attention Map for 3D Human Recovery from a Single RGB Image

Peng Xu, Na Jiang, Jun Li, Zhiping Shi

Keywords Paper

3D Human Recover, Human Parsing, Depth Estimation

0

0

0

0

8:14

14/06/2020

KeypointNet: A Large-Scale 3D Keypoint Dataset Aggregated From Numerous Human Annotations

Yang You, Yujing Lou, Chengkun Li and
Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Cewu Lu, Weiming Wang

Keywords Paper

dataset, 3d vision, 3d keypoints, object analysis

0

0

0

0

1:01

06/12/2021

Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image

Feng Liu, Xiaoming Liu

Keywords Paper

vision

0

0

0

0

9:19

30/11/2020

3D Guided Weakly Supervised Semantic Segmentation

Weixuan Sun, Jing Zhang, Nick Barnes

Keywords Paper

0

0

0

0

9:20

06/12/2021

Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Marc Pollefeys

Keywords Paper

optimization

0

0

0

0

10:44

30/11/2020

Low-light Color Imaging via Dual Camera Acquisition

Peiyao Guo, Zhan Ma

Keywords Paper

0

0

0

0

7:28

14/06/2020

DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing

Shaohui Liu, Yinda Zhang, Songyou Peng and
Boxin Shi, Marc Pollefeys, Zhaopeng Cui

Keywords Paper

differentiable rendering, 3d reconstruction, implicit representations, multi-view reconstruction, depth completion, 3d deep learning

0

0

0

0

1:01

25/07/2020

3D self-attention for unsupervised video quantization

Jingkuan Song, Ruimin Lang, Xiaosu Zhu and
Xing Xu, Lianli Gao, Heng Tao Shen

Keywords Paper

quantization, video retrieval, ann search

0

0

0

0

9:44

30/11/2020

Localin Reshuffle Net: Toward Naturally and Efficiently Facial Image Blending

Chengyao Zheng, Siyu Xia, Joseph Robinson and
Changsheng Lu, Wayne Wu, Chen Qian, Ming Shao

Keywords Paper

0

0

0

0

2:19

14/06/2020

Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking

Peiliang Li, Jieqi Shi, Shaojie Shen

Keywords Paper

3d object tracking, stereo cameras, autonomous driving

0

0

0

0

1:01

14/06/2020

HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection

Maosheng Ye, Shuangjie Xu, Tongyi Cao

Keywords Paper

hybrid voxel network, hybird voxel feature encoding, 3d object detection, autonomous driving, lidar based methods, hybrid scales voxelization, attentive voxel feature encoding, feature fusion pyramid network

0

0

0

0

1:00

14/06/2020

Self-Supervised Monocular Scene Flow Estimation

Junhwa Hur, Stefan Roth

Keywords Paper

monocular scene flow, self-supervised learning, 3d scene flow, optical flow, monocular depth estimation

0

0

0

0

5:00