Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation

22/11/2021

Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation

Hanz Cuevas, Antonio-Javier Gallego, Robert B Fisher

Keywords: point cloud, point cloud segmentation, 3D, point cloud classification, shape classification, deep 3D segmentation, deep learning, scene understanding, 3D scene understanding, light network, local agregation, gnn

Abstract Paper Code Similar Papers

Abstract: We present an innovative two-headed attention layer that combines geometric and latent features to segment a 3D scene into semantically meaningful subsets. Each head combines local and global information, using either the geometric or latent features, of a neighborhood of points and uses this information to learn better local relationships. This Geometric-Latent attention layer (Ge-Latto) is combined with a sub-sampling strategy to capture global features. Our method is invariant to permutation thanks to the use of shared-MLP layers, and it can also be used with point clouds with varying densities because the local attention layer does not depend on the neighbor order. Our proposal is simple yet robust, which allows it to achieve competitive results in the ShapeNetPart and ModelNet40 datasets, and the state-of-the-art when segmenting the complex dataset S3DIS, with 69.2% IoU on Area 5, and 89.7% overall accuracy using K-fold cross-validation on the 6 areas.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

30/11/2020

Bi-Directional Attention for Joint Instance and Semantic Segmentation in Point Clouds

guangnan wu, Zhiyi Pan, Peng Jiang, Changhe Tu

Keywords Paper

0

0

0

0

9:15

06/12/2021

Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Marc Pollefeys

Keywords Paper

optimization

0

0

0

0

10:44

14/06/2020

OccuSeg: Occupancy-Aware 3D Instance Segmentation

Lei Han, Tian Zheng, Lan Xu, Lu Fang

Keywords Paper

instance segmentation, multi-task learning, occupancy

0

0

0

0

1:02

30/11/2020

D2D: Keypoint Extraction with Describe to Detect Approach

Yurun Tian, Vassileios Balntas, Tony Ng and
Axel Barroso-Laguna, Yiannis Demiris, Krystian Mikolajczyk

Keywords Paper

0

0

0

0

4:34

14/06/2020

PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection

Shaoshuai Shi, Chaoxu Guo, Li Jiang and
Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li

Keywords Paper

3d object detection, point cloud, 3d scene understanding, lidar, autonomous driving, kitti dataset, waymo open dataset

0

0

0

0

1:01

30/11/2020

SDP-Net: Scene Flow Based Real-time Object Detection and Prediction from Sequential 3D Point Clouds

Yi Zhang, Yuwen Ye, Zhiyu Xiang, Jiaqi Gu

Keywords Paper

0

0

0

0

9:45

17/08/2020

Uncertainty quantification for multi-scan registration

Xiangru Huang, Zhenxiao Liang, Qixing Huang

Keywords Paper

multi-scan registration, uncertainty quantification, approximation error, view planning

0

0

0

0

21:04

22/11/2021

Multi-Modality Task Cascade for 3D Object Detection

Jinhyung Park, Xinshuo Weng, Yunze Man, Kris Kitani

Keywords Paper

Multi Modality Learning, Object Detection, Semantic Segmentation

0

0

0

0

3:03

14/06/2020

FPConv: Learning Local Flattening for Point Convolution

Yiqun Lin, Zizheng Yan, Haibin Huang and
Dong Du, Ligang Liu, Shuguang Cui, Xiaoguang Han

Keywords Paper

scene understanding, point cloud analysis, 3d deep learning, 3d semantic segmentation

0

0

0

0

1:01

19/08/2021

Spline Positional Encoding for Learning 3D Implicit Signed Distance Fields

Peng-Shuai Wang, Yang Liu, Yu-Qi Yang, Xin Tong

Keywords Paper

Computer Vision, 2D and 3D Computer Vision

0

0

0

0

9:42

05/01/2021

The Devil Is in the Boundary: Exploiting Boundary Representation for Basis-Based Instance Segmentation

Myungchul Kim, Sanghyun Woo, Dahun Kim, In So Kweon

Keywords Paper

0

0

0

0

4:47

14/06/2020

What You See is What You Get: Exploiting Visibility for 3D Object Detection

Peiyun Hu, Jason Ziglar, David Held, Deva Ramanan

Keywords Paper

freespace reasoning, 3d object detection, lidar processing, autonomous driving

0

0

0

0

5:01

06/12/2020

Deep Transformation-Invariant Clustering

Tom Monnier, Thibault Groueix, Mathieu Aubry

Keywords Paper

0

0

0

0

3:22

06/12/2021

Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis

Tianchang Shen, Jun Gao, Kangxue Yin and
Ming-Yu Liu, Sanja Fidler

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

14:32

14/06/2020

DualConvMesh-Net: Joint Geodesic and Euclidean Convolutions on 3D Meshes

Jonas Schult, Francis Engelmann, Theodora Kontogianni, Bastian Leibe

Keywords Paper

3d, semantic, segmentation, meshes, semantic segmentation, 3d semantic segmentation

0

0

0

0

4:57

02/02/2021

Explicitly Modeled Attention Maps for Image Classification

Andong Tan, Duc Tam Nguyen, Maximilian Dax and
Matthias Nießner, Thomas Brox

Keywords Paper

0

0

0

0

16:59

02/02/2021

Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio and
R Devon Hjelm, Shikhar Sharma

Keywords Paper

0

0

0

0

17:44

22/11/2021

Enhancing Local Feature Learning for 3D Point Cloud Processing using Unary-Pairwise Attention

Haoyi Xiu, Xin Liu, Weimin Wang and
Kyoung-Sook Kim, Takayuki Shinohara, Qiong Chang, Masashi Matsuoka

Keywords Paper

3D point clouds, self-attention, 3D point cloud deep learning

0

0

0

0

2:51

02/02/2021

Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Ardhendu Behera, Zachary Wharton, Pradeep R P G Hewage, Asish Bera

Keywords Paper

0

0

0

0

18:54

02/02/2021

Exploiting Relationship for Complex-scene Image Generation

Tianyu Hua, Hongdong Zheng, Yalong Bai and
Wei Zhang, Xiao-Ping Zhang, Tao Mei

Keywords Paper

0

0

0

0

15:01

05/01/2021

Structured Visual Search via Composition-Aware Learning

Mert Kilickaya, Arnold W.M. Smeulders

Keywords Paper

0

0

0

0

0:44

14/06/2020

ASLFeat: Learning Local Features of Accurate Shape and Localization

Zixin Luo, Lei Zhou, Xuyang Bai and
Hongkai Chen, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan

Keywords Paper

image matching, local feature keypoints, local feature descriptors, deep learning

0

0

0

0

1:01

25/07/2020

Regional relation modeling for visual place recognition

Yingying Zhu, Biao Li, Jiong Wang, Zhou Zhao

Keywords Paper

convolutional neural network, visual place recognition, content-based image retrieval, relation modeling

0

0

0

0

14:11

14/06/2020

PointGMM: A Neural GMM Network for Point Clouds

Amir Hertz, Rana Hanocka, Raja Giryes, Daniel Cohen-Or

Keywords Paper

point clouds, gmm, hierarchical models, shape generation, registration, 3d, attention, 3d representation, deep learning, machine learning

0

0

0

0

1:01

14/06/2020

Deep 3D Capture: Geometry and Reflectance From Sparse Multi-View Images

Sai Bi, Zexiang Xu, Kalyan Sunkavalli and
David Kriegman, Ravi Ramamoorthi

Keywords Paper

appearance acquisition, 3d reconstruction, multi-view stereo

0

0

0

0

1:01

14/06/2020

PVN3D: A Deep Point-Wise 3D Keypoints Voting Network for 6DoF Pose Estimation

Yisheng He, Wei Sun, Haibin Huang and
Jianran Liu, Haoqiang Fan, Jian Sun

Keywords Paper

6d pose estimation, 3d instance segmentation, 3d semantic segmentation, 3d keypoint, 3d scene understanding, vision for robotics, 3d single view, rgbd, 3d computer vision

0

0

0

0

1:01

02/02/2021

Patch-Wise Attention Network for Monocular Depth Estimation

Sihaeng Lee, Janghyeon Lee, Byungju Kim and
Eojindl Yi, Junmo Kim

Keywords Paper

0

0

0

0

14:15

22/11/2021

Planar Shape Based Registration for Multi-modal Geometry

Muxingzi Li, Florent Lafarge

Keywords Paper

global registration, energy minimization, geometric primitives, point cloud, polygonal mesh

0

0

0

0

3:00

02/02/2021

Geodesic-HOF: 3D Reconstruction Without Cutting Corners

Ziyun Wang, Eric A. Mitchell, Volkan Isler, Daniel D. Lee

Keywords Paper

0

0

0

0

19:26

14/06/2020

Joint Texture and Geometry Optimization for RGB-D Reconstruction

Yanping Fu, Qingan Yan, Jie Liao, Chunxia Xiao

Keywords Paper

rgb-d reconstruction, 3d reconstruction, texture optimization, geometry optimization, joint texture and geometry optimization

0

0

0

0

0:57

06/12/2021

Twins: Revisiting the Design of Spatial Attention in Vision Transformers

Xiangxiang Chu, Zhi Tian, Yuqing Wang and
Bo Zhang, Haibing Ren, Xiaolin Wei, Huaxia Xia, Chunhua Shen

Keywords Paper

deep learning, machine learning, transformers, vision

0

0

0

0

5:29

30/11/2020

Local Context Attention for Salient Object Segmentation

Jing Tan Research, Pengfei Xiong Research, Zhengyi Lv Research and
Kuntao Xiao Research, Yuwen He Research

Keywords Paper

0

0

0

0

9:35

07/09/2020

Image Harmonization with Attention-based Deep Feature Modulation

Guoqing Hao, Satoshi Iizuka, Kazuhiro Fukui

Keywords Paper

image harmonization, feature map modulation, attention

0

0

0

0

5:03

14/06/2020

Interactive Object Segmentation With Inside-Outside Guidance

Shiyin Zhang, Jun Hao Liew, Yunchao Wei and
Shikui Wei, Yao Zhao

Keywords Paper

interactive segmentation, annotation, dataset

0

0

0

0

4:58

17/08/2020

Cut-enhanced PolyCube-maps for feature-aware all-hex meshing

Hao-Xiang Guo, Xiaohan Liu, Dong-Ming Yan, Yang Liu

Keywords Paper

PolyCube-Map, cut-enhanced, feature-aware, hexahedral meshing

0

0

0

0

16:47

02/02/2021

Dual-level Collaborative Transformer for Image Captioning

Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun and
Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji

Keywords Paper

0

0

0

0

14:58

14/06/2020

Novel View Synthesis of Dynamic Scenes With Globally Coherent Depths From a Monocular Camera

Jae Shin Yoon, Kihwan Kim, Orazio Gallo and
Hyun Soo Park, Jan Kautz

Keywords Paper

view synthesis, depth estimation, dynamic scene, depth fusion, globally coherent depth, monocular camera

0

0

0

0

1:00

14/06/2020

Self-Supervised Monocular Scene Flow Estimation

Junhwa Hur, Stefan Roth

Keywords Paper

monocular scene flow, self-supervised learning, 3d scene flow, optical flow, monocular depth estimation

0

0

0

0

5:00

14/06/2020

Efficient and Robust Shape Correspondence via Sparsity-Enforced Quadratic Assignment

Rui Xiang, Rongjie Lai, Hongkai Zhao

Keywords Paper

shape matching, non-rigid transformation, point cloud matching, quadratic assignment, sparsity control, laplace-beltrami operator, local distortion

0

0

0

0

1:01

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01