RGB-D Co-attention Network for Semantic Segmentation

30/11/2020

RGB-D Co-attention Network for Semantic Segmentation

Hao Zhou, Lu Qi, Zhaoliang Wan, Hai Huang, Xu Yang

Keywords:

Abstract Paper Similar Papers

Abstract: Incorporating the depth (D) information for RGB images has proven the effectiveness and robustness in semantic segmentation. However, the fusion between them is still a challenge due to their meaning discrepancy, in which RGB represents the color but D depth information. In this paper, we propose a co-attention Network (CANet) to capture the fine-grained interplay between RGB�_and D�_ features. The key part in our CANet is co-attention fusion part. It includes three modules. At first, the position and channel co-attention fusion modules adaptively fuse color and depth features in spatial and channel dimension. Finally, a final fusion module integrates the outputs of the two co-attention fusion modules for forming a more representative feature. Our extensive experiments validate the effectiveness of CANet in fusing RGB and D features, achieving the state-of-the-art performance on two challenging RGB-D semantic segmentation datasets, i.e., NYUDv2, SUN-RGBD.

The video of this talk cannot be embedded. You can watch it here:

https://accv2020.github.io/miniconf/poster_681.html

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACCV 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

FETNet: Feature Exchange Transformer Network for RGB-D Object Detection

Zhibin Xiao, Jing-Hao Xue, Pengwei Xie, Guijin Wang

Keywords Paper

RGB-D object detection, Multi-modal Fusion, Vision Transformer, Feature Exchange

0

0

0

0

2:54

22/11/2021

Multiple Fusion Adaptation: A Strong Framework for Unsupervised Semantic Segmentation Adaptation

Kai Zhang, Yifan Sun, Rui Wang and
Haichang Li, Xiaohui Hu

Keywords Paper

domain adaptation, semantic segmentation, pseudo label learning

0

0

0

0

2:48

14/06/2020

Learning Selective Self-Mutual Attention for RGB-D Saliency Detection

Nian Liu, Ni Zhang, Junwei Han

Keywords Paper

rgb-d saliency detection, middle fusion, self-attention, mutual-attention, non-local network, two-stream cnn

0

0

0

0

1:01

05/01/2021

Cross-Modality 3D Object Detection

Ming Zhu, Chao Ma, Pan Ji, Xiaokang Yang

Keywords Paper

0

0

0

0

4:48

14/06/2020

Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation

Gen Luo, Yiyi Zhou, Xiaoshuai Sun and
Liujuan Cao, Chenglin Wu, Cheng Deng, Rongrong Ji

Keywords Paper

referring expression comprehension, referring expression segmentation, multi-task learning, visual grounding, object detection

0

0

0

0

5:00

14/06/2020

Multi-Modality Cross Attention Network for Image and Sentence Matching

Xi Wei, Tianzhu Zhang, Yan Li and
Yongdong Zhang, Feng Wu

Keywords Paper

cross modal, retrieval, transformer, attention, intra-modality, inter-modality

0

0

0

0

0:59

06/12/2020

Deep Multimodal Fusion by Channel Exchanging

Yikai Wang, Wenbing Huang, Fuchun Sun and
Tingyang Xu, Yu Rong, Junzhou Huang

Keywords Paper

0

0

0

0

3:12

02/02/2021

Attention-based Multi-Level Fusion Network for Light Field Depth Estimation

Jiaxin Chen, Shuo Zhang, Youfang Lin

Keywords Paper

0

0

0

0

14:19

02/02/2021

RGB-D Salient Object Detection via 3D Convolutional Neural Networks

Qian Chen, Ze Liu, Yi Zhang and
Keren Fu, Qijun Zhao, Hongwei Du

Keywords Paper

0

0

0

0

14:04

14/06/2020

JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection

Keren Fu, Deng-Ping Fan, Ge-Peng Ji, Qijun Zhao

Keywords Paper

visual saliency, salient object detection, rgb-d, depth information, joint learning, dense connections, multi-modal features, feature fusion, deep learning, encoder-decoder

0

0

0

0

1:01

30/11/2020

Low-light Color Imaging via Dual Camera Acquisition

Peiyao Guo, Zhan Ma

Keywords Paper

0

0

0

0

7:28

06/12/2021

End-to-end Multi-modal Video Temporal Grounding

Yi-Wen Chen, Yi-Hsuan Tsai, Ming-Hsuan Yang

Keywords Paper

self-supervised learning, transformers, vision, contrastive learning

0

0

0

0

8:46

22/11/2021

Paying Attention to Varying Receptive Fields: Object Detection with Atrous Filters and Vision Transformers

Arthur Jian Shun Lam, Jun Yi Lim, Ricky Sutopo, Vishnu Monn Baskaran

Keywords Paper

object detection, atrous convolution, vision transformers, attention mechanism

0

0

0

0

3:01

07/09/2020

Centroid Based Concept Learning for RGB-D Indoor Scene Classification

Ali Ayub, Alan Wagner

Keywords Paper

cognitively-inspired learning, RGBD analysis, scene classification, category merging, labeling flaws analysis

0

0

0

0

10:03

14/06/2020

A U-Net Based Discriminator for Generative Adversarial Networks

Edgar Schönfeld, Bernt Schiele, Anna Khoreva

Keywords Paper

gan, image synthesis, u-net, discriminator, consistency regularization, equivariance, generative adversarial networks, ffhq, biggan

0

0

0

0

1:01

02/02/2021

Correlative Channel-Aware Fusion for Multi-View Time Series Classification

Yue Bai, Lichen Wang, Zhiqiang Tao and
Sheng Li, Yun Fu

Keywords Paper

0

0

0

0

14:04

02/02/2021

Learning Visual Context for Group Activity Recognition

Hangjie Yuan, Dong Ni

Keywords Paper

0

0

0

0

16:54

19/08/2021

Step-Wise Hierarchical Alignment Network for Image-Text Matching

Zhong Ji, Kexin Chen, Haoran Wang

Keywords Paper

Computer Vision, Language and Vision

0

0

0

0

6:07

06/12/2021

Class-agnostic Reconstruction of Dynamic Objects from Videos

Zhongzheng Ren, Xiaoming Zhao, Alex Schwing

Keywords Paper

0

0

0

0

13:29

30/11/2020

Jointly Discriminating and Frequent Visual Representation Mining

Qiannan Wang, Ying Zhou, ZhaoYan Zhu and
Xuefeng Liang, Yu Gu

Keywords Paper

0

0

0

0

8:13

14/06/2020

A2dele: Adaptive and Attentive Depth Distiller for Efficient RGB-D Salient Object Detection

Yongri Piao, Zhengkun Rong, Miao Zhang and
Weisong Ren, Huchuan Lu

Keywords Paper

rgb-d, salient object dection, knowledge distillation, attention, computer vision, cnn

0

0

0

0

1:00

25/07/2020

3D self-attention for unsupervised video quantization

Jingkuan Song, Ruimin Lang, Xiaosu Zhu and
Xing Xu, Lianli Gao, Heng Tao Shen

Keywords Paper

quantization, video retrieval, ann search

0

0

0

0

9:44

02/02/2021

Similarity Reasoning and Filtration for Image-Text Matching

Haiwen Diao, Ying Zhang, Lin Ma, Huchuan Lu

Keywords Paper

0

0

0

0

16:34

22/11/2021

Multi-Modality Task Cascade for 3D Object Detection

Jinhyung Park, Xinshuo Weng, Yunze Man, Kris Kitani

Keywords Paper

Multi Modality Learning, Object Detection, Semantic Segmentation

0

0

0

0

3:03

05/01/2021

Dual-Stream Fusion Network for Spatiotemporal Video Super-Resolution

Min-Yuan Tseng, Yen-Chung Chen, Yi-Lun Lee and
Wei-Sheng Lai, Yi-Hsuan Tsai, Wei-Chen Chiu

Keywords Paper

0

0

0

0

4:58

06/12/2021

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

Chaoqun Wang, Shaobo Min, Xuejin Chen and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

10:51

03/05/2021

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning

Xuebo Liu, Longyue Wang, Derek Wong and
Liam Ding, Lidia Chao, Zhaopeng Tu

Keywords Paper

Sequence-to-sequence learning, Encoder layer fusion, Transformer, Grammatical error correction, Summarization, Machine translation

0

0

0

0

4:53

14/06/2020

Learning Fused Pixel and Feature-Based View Reconstructions for Light Fields

Jinglei Shi, Xiaoran Jiang, Christine Guillemot

Keywords Paper

light field, view synthesis, feature-based reconstruction, pixel-based reconstruction, deep learning, angular super-resolution

0

0

0

0

4:56

14/06/2020

Squeeze-and-Attention Networks for Semantic Segmentation

Zilong Zhong, Zhong Qiu Lin, Rene Bidart and
Xiaodan Hu, Ibrahim Ben Daya, Zhifeng Li, Wei-Shi Zheng, Jonathan Li, Alexander Wong

Keywords Paper

semantic segmentation, squeeze-and-attention, pixel grouping

0

0

0

0

1:01

06/12/2020

Discover, Hallucinate, and Adapt: Open Compound Domain Adaptation for Semantic Segmentation

KwanYong Park, Sanghyun Woo, Inkyu Shin, In So Kweon

Keywords Paper

Probabilistic Methods -> Bayesian Nonparametrics, Algorithms -> Meta-Learning

0

0

0

0

3:25

14/06/2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

Yancheng Wang, Yang Xiao, Fu Xiong and
Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Keywords Paper

3d action recognition, point cloud, 3d motion, temporal rank pooling, pointnet++, multi-stream network

0

0

0

0

1:01

14/06/2020

Image Search With Text Feedback by Visiolinguistic Attention Learning

Yanbei Chen, Shaogang Gong, Loris Bazzani

Keywords Paper

vision and language, image search, text feedback, attention mechanism, transformer, multimodal learning, representation learning, composition, image retrieval, interactive image search

0

0

0

0

1:00

30/11/2020

MIX'EM: Unsupervised Image Classification using a Mixture of Embeddings

Ali Varamesh, Tinne Tuytelaars

Keywords Paper

0

0

0

0

6:40

14/06/2020

Anisotropic Convolutional Networks for 3D Semantic Scene Completion

Jie Li, Kai Han, Peng Wang and
Yu Liu, Xia Yuan

Keywords Paper

semantic scene completion, dense voxel prediction, shape completion, semantic segmentation, rgb-d, anisotropic convolution, voxel-wise receptive fields, 3d convolution

0

0

0

0

1:01

02/02/2021

F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation

Daizong Liu, Dongdong Yu, Changhu Wang, Pan Zhou

Keywords Paper

0

0

0

0

16:59

14/06/2020

Structure-Preserving Super Resolution With Gradient Guidance

Cheng Ma, Yongming Rao, Yean Cheng and
Ce Chen, Jiwen Lu, Jie Zhou

Keywords Paper

super resolution, image restoration, image enhancement, structure preserving, generative model, generative adversarial network, gan, deep-learning

0

0

0

0

1:01

18/07/2021

A Bit More Bayesian: Domain-Invariant Learning with Uncertainty

Zehao Xiao, Jiayi Shen, Xiantong Zhen and
Ling Shao, Cees Snoek

Keywords Paper

Algorithms, Model Selection and Structure Learning, Applications, Computational Biology and Bioinformatics; Applications, Health; Deep Learning, Adversarial Networks; Theory, Deep Learning, Bayesian Deep Learning

0

0

0

0

5:46

22/11/2021

Hierarchical Interaction Network for Video Object Segmentation from Referring Expressions

Zhao Yang, Yansong Tang, Luca Bertinetto and
Hengshuang Zhao, Philip Torr

Keywords Paper

segmentation, video object segmentation, referring segmentation, referring video object segmentation, video object segmentation from referring expressions, referring image segmentation, referring image comprehension, optical flow, visual grounding

0

0

0

0

2:57

19/08/2021

Modality-aware Style Adaptation for RGB-Infrared Person Re-Identification

Ziling Miao, Hong Liu, Wei Shi and
Wanlu Xu, Hanrong Ye

Keywords Paper

Computer Vision, Recognition, Learning Generative Models, Transfer, Adaptation, Multi-task Learning

0

0

0

0

12:18

07/09/2020

MagnifierNet: Towards Semantic Adversary and Fusion for Person Re-identification

Yushi Lan, Yuan Liu, Xinchi Zhou and
Tian Maoqing, Xuesen Zhang, Shuai Yi, Hongsheng Li

Keywords Paper

person re-identification, adversarial samples, metric learning, multi-task learning, image retrieval

0

0

0

0

5:58