Rethinking local and global feature representation for semantic segmentation

22/11/2021

Rethinking local and global feature representation for semantic segmentation

Mohan Chen, Xinxuan Zhao, Bingfei Fu, Li Zhang, Xiangyang Xue

Keywords: Semantic Segmentation, Transformer

Abstract Paper Similar Papers

Abstract: Although fully convolution networks (FCN) have dominated semantic segmentation since the birth of, they are inherently limited in capturing long-range structured relationship with the layers of local kernels. While recent Transformer-based models have proven extremely successful in computer vision tasks by capturing global representation, they would deteriorate semantic segmentation by over-smoothing the regions contain fine details (e.g., boundaries and small objects). To this end, we propose a Dual-Stream Convolution-Transformer segmentation framework, called DSCT, by taking advantage of both the convolution and Transformer to learn a rich feature representation for semantic segmentation. Specifically, DSCT extracts high resolution local feature information from convolution layers and global feature representation across the Transformer layers. Moreover, a feature fusion module is plugged to exchange information between spatial stream and context stream at each stage. With the local and global context modeled explicitly in every layer, the two streams can be combined with a simple decoder to provide a powerful segmentation model. Extensive experiments show that our model builds a new state of the art on Cityscapes dataset (83.31% mIoU) with only 80K training iterations and appealing performance (49.27% mIoU) on ADE20K, outperforming most of the alternatives with a new perspective.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Looking Wider for Better Adaptive Representation in Few-Shot Learning

Jiabao Zhao, Yifan Yang, Xin Lin and
Jing Yang, Liang He

Keywords Paper

0

0

0

0

16:58

22/11/2021

Feature Fusion Vision Transformer for Fine-Grained Visual Categorization

Jun Wang, Xiaohan Yu, Yongsheng Gao

Keywords Paper

Fine-grained visual categorization, Vision transformer, Self-attention, Feature Fusion

0

0

0

0

3:02

05/01/2021

OverNet: Lightweight Multi-Scale Super-Resolution With Overscaling Network

Parichehr Behjati, Pau Rodriguez, Armin Mehri and
Isabelle Hupont, Carles Fernandez Tena, Jordi Gonzalez

Keywords Paper

0

0

0

0

4:24

14/06/2020

ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

Qilong Wang, Banggu Wu, Pengfei Zhu and
Peihua Li, Wangmeng Zuo, Qinghua Hu

Keywords Paper

channel attention, efficient, adaptive 1d convolution, deep cnns, image classifcation, object detection, instance segmentation

0

0

0

0

0:57

06/12/2021

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Bowen Zhang, Yifan liu, Zhi Tian, Chunhua Shen

Keywords Paper

deep learning, vision, representation learning

0

0

0

0

12:04

02/02/2021

Multi-Proxy Wasserstein Classifier for Image Classification

Benlin Liu, Yongming Rao, Jiwen Lu and
Jie Zhou, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

12:05

02/02/2021

Continuous Self-Attention Models with Neural ODE Networks

Jing Zhang, Peng Zhang, Baiwen Kong and
Junqiu Wei, Xin Jiang

Keywords Paper

0

0

0

0

15:25

05/01/2021

CAP: Context-Aware Pruning for Semantic Segmentation

Wei He, Meiqing Wu, Mingfu Liang, Siew-Kei Lam

Keywords Paper

0

0

0

0

5:01

30/11/2020

Attention-Aware Feature Aggregation for Real-time Stereo Matching on Edge Devices

Jia-Ren Chang National Chiao Tung University, aetherAI, Pei-Chun Chang, Yong-Sheng Chen

Keywords Paper

0

0

0

0

9:53

02/02/2021

Consistency Regularization with High-dimensional Non-adversarial Source-guided Perturbation for Unsupervised Domain Adaptation in Segmentation

Kaihong Wang, Chenhongyi Yang, Margrit Betke

Keywords Paper

0

0

0

0

19:35

14/06/2020

Spatially-Attentive Patch-Hierarchical Network for Adaptive Motion Deblurring

Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan

Keywords Paper

motion blur, spatially varying, attention, dynamic filter, adaptive, dynamic scene, deformable, encoder decoder, hierarchical, convolutional neural network

0

0

0

0

1:00

14/06/2020

Transferring and Regularizing Prediction for Semantic Segmentation

Yiheng Zhang, Zhaofan Qiu, Ting Yao and
Chong-Wah Ngo, Dong Liu, Tao Mei

Keywords Paper

semantic segmentation, domain adaptation, adversarial learning

0

0

0

0

0:58

06/12/2021

Global Filter Networks for Image Classification

Yongming Rao, Wenliang Zhao, Zheng Zhu and
Jiwen Lu, Jie Zhou

Keywords Paper

machine learning, robustness, transformers, vision

0

0

0

0

9:28

14/06/2020

Factorized Higher-Order CNNs With an Application to Spatio-Temporal Emotion Estimation

Jean Kossaifi, Antoine Toisoul, Adrian Bulat and
Yannis Panagakis, Timothy M. Hospedales, Maja Pantic

Keywords Paper

tensor methods, deep learning, spatiotemporal, emotion, cnn, tensor decomposition, low-rank, valence, arousal

0

0

0

0

1:01

06/12/2021

MST: Masked Self-Supervised Transformer for Visual Representation

Zhaowen Li, Zhiyang Chen, Fan Yang and
Wei Li, Yousong Zhu, Chaoyang Zhao, Rui Deng, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang

Keywords Paper

self-supervised learning, transformers, vision, language

0

0

0

0

7:13

06/12/2021

Container: Context Aggregation Networks

peng gao, Jiasen Lu, hongsheng Li and
Roozbeh Mottaghi, Aniruddha Kembhavi

Keywords Paper

deep learning, self-supervised learning, transformers, vision, language

0

0

0

0

8:50

02/02/2021

Addressing Domain Gap via Content Invariant Representation for Semantic Segmentation

Li Gao, Lefei Zhang, Qian Zhang

Keywords Paper

0

0

0

0

16:16

06/12/2021

Improved Transformer for High-Resolution GANs

Long Zhao, Zizhao Zhang, Ting Chen and
Dimitris Metaxas, Han Zhang

Keywords Paper

transformers, generative model

0

0

0

0

12:11

14/06/2020

AANet: Adaptive Aggregation Network for Efficient Stereo Matching

Haofei Xu, Juyong Zhang

Keywords Paper

stereo matching, cost aggregation, edge-preserving, deformable convolution, cost volume, dense correspondences

0

0

0

0

1:01

30/11/2020

Dense-Scale Feature Learning in Person Re-Identification

Li Wang, Baoyu Fan, Zhenhua Guo and
Yaqian Zhao, Runze Zhang, Rengang Li, Weifeng Gong

Keywords Paper

0

0

0

0

9:46

14/06/2020

Unified Dynamic Convolutional Network for Super-Resolution With Variational Degradations

Yu-Syuan Xu, Shou-Yao Roy Tseng, Yu Tseng and
Hsien-Kai Kuo, Yi-Min Tsai

Keywords Paper

super-resolution, dynamic convolution, variational degradations, multiple degradations

0

0

0

0

1:00

06/12/2020

Correspondence learning via linearly-invariant embedding

Riccardo Marin, Marie-Julie Rakotosaona, Simone Melzi, Maks Ovsjanikov

Keywords Paper

0

0

0

0

3:18

14/06/2020

Meta-Transfer Learning for Zero-Shot Super-Resolution

Jae Woong Soh, Sunwoo Cho, Nam Ik Cho

Keywords Paper

zero-shot super-resolution, meta learning, transfer learning

0

0

0

0

0:59

14/06/2020

Improving Convolutional Networks With Self-Calibrated Convolutions

Jiang-Jiang Liu, Qibin Hou, Ming-Ming Cheng and
Changhu Wang, Jiashi Feng

Keywords Paper

self-calibrated, feature transformation, image classification, network architecture, convolutional neural networks

0

0

0

0

1:00

14/06/2020

Non-Local Neural Networks With Grouped Bilinear Attentional Transforms

Lu Chi, Zehuan Yuan, Yadong Mu, Changhu Wang

Keywords Paper

attention, non-local, bilinear, image classification, video classification, grouped, data-adaptive

0

0

0

0

1:01

07/09/2020

Towards Convolutional Neural Networks Compression via Global&Progressive Product Quantization

Weihan Chen, Peisong Wang, Jian Cheng

Keywords Paper

convolutional neural network compression, product quantization

0

0

0

0

5:03

02/02/2021

DPFPS: Dynamic and Progressive Filter Pruning for Compressing Convolutional Neural Networks from Scratch

Xiaofeng Ruan, Yufan Liu, Bing Li and
Chunfeng Yuan, Weiming Hu

Keywords Paper

0

0

0

0

14:38

07/09/2020

Learning Effectively from Noisy Supervision for Weakly Supervised Semantic Segmentation

Wenbin Xie, Qiaoqiao Wei, Zheng Li, Hui Zhang

Keywords Paper

Semantic Segmentation, Weakly Supervised Semantic Segmentation, Self Attention

0

0

0

0

3:46

02/02/2021

AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

Qi Song, Kangfu Mei, Rui Huang

Keywords Paper

0

0

0

0

14:15

05/01/2021

Hierarchical Generative Adversarial Networks for Single Image Super-Resolution

Weimin Chen, Yuqing Ma, Xianglong Liu, Yi Yuan

Keywords Paper

0

0

0

0

4:46

06/12/2020

RNNPool: Efficient Non-linear Pooling for RAM Constrained Inference

Oindrila Saha, Aditya Kusupati, Harsha Simhadri and
Manik Varma, Prateek Jain

Keywords Paper

0

0

0

0

3:30

12/07/2020

dS^2LBI: Exploring Structural Sparsity on Deep Network via Differential Inclusion Paths

Yanwei Fu, Chen Liu, Donghao Li and
Xinwei Sun, Jinshan ZENG, Yuan Yao

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

12:45

14/06/2020

RoutedFusion: Learning Real-Time Depth Map Fusion

Silvan Weder, Johannes Schönberger, Marc Pollefeys, Martin R. Oswald

Keywords Paper

depth map fusion, online 3d reconstruction, deep learning, real-time applications, 3d geometry

0

0

0

0

5:00

30/11/2020

Dense Dual-Path Network for Real-time Semantic Segmentation

Xinneng Yang, Yan Wu, Junqiao Zhao, Feilin Liu

Keywords Paper

0

0

0

0

5:43

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

14/06/2020

Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer

Fengmao Lv, Tao Liang, Xiang Chen, Guosheng Lin

Keywords Paper

domain adaptation, semantic segmentation, transfer learning, weakly-supervised segmentation.

0

0

0

0

1:01

02/02/2021

Frequency Consistent Adaptation for Real World Super Resolution

Xiaozhong Ji, Guangpin Tao, Yun Cao and
Ying Tai, Tong Lu, Chengjie Wang, Jilin Li, Feiyue Huang

Keywords Paper

0

0

0

0

14:32

14/06/2020

Explorable Super Resolution

Yuval Bahat, Tomer Michaeli

Keywords Paper

super-resolution, gan, editing

0

0

0

0

4:56

05/01/2021

Exploiting Spatial Relation for Reducing Distortion in Style Transfer

Jia-Ren Chang, Yong-Sheng Chen

Keywords Paper

0

0

0

0

4:54

06/12/2021

Efficient Equivariant Network

Lingshen He, Yuxuan Chen, zhengyang shen and
Yiming Dong, Yisen Wang, Zhouchen Lin

Keywords Paper

deep learning, vision

0

0

0

0

8:20