Deep Grouping Model for Unified Perceptual Parsing

14/06/2020

Deep Grouping Model for Unified Perceptual Parsing

Zhiheng Li, Wenxuan Bao, Jiayang Zheng, Chenliang Xu

Keywords: perceptual grouping, hierarhical graph, bottom-up segmentation, interpretability, unified perceptual parsing, graph neural network, interactive-segmentation, weakly-supervised segmentation

Abstract Paper Similar Papers

Abstract: The perceptual-based grouping process produces a hierarchical and compositional image representation that helps both human and machine vision systems recognize heterogeneous visual concepts. Examples can be found in the classical hierarchical superpixel segmentation or image parsing works. However, the grouping process is largely overlooked in modern CNN-based image segmentation networks due to many challenges, including the inherent incompatibility between the grid-shaped CNN feature map and the irregular-shaped perceptual grouping hierarchy. Overcoming these challenges, we propose a deep grouping model (DGM) that tightly marries the two types of representations and defines a bottom-up and a top-down process for feature exchanging. When evaluating the model on the recent Broden+ dataset for the unified perceptual parsing task, it achieves state-of-the-art results while having a small computational overhead compared to other contextual-based segmentation models. Furthermore, the DGM has better interpretability compared with modern CNN methods.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

05/01/2021

Hierarchical Generative Adversarial Networks for Single Image Super-Resolution

Weimin Chen, Yuqing Ma, Xianglong Liu, Yi Yuan

Keywords Paper

0

0

0

0

4:46

14/06/2020

Erasing Integrated Learning: A Simple Yet Effective Approach for Weakly Supervised Object Localization

Jinjie Mai, Meng Yang, Wenfeng Luo

Keywords Paper

weakly supervised, object localization, adversarial erasing

0

0

0

0

5:00

02/02/2021

Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Ardhendu Behera, Zachary Wharton, Pradeep R P G Hewage, Asish Bera

Keywords Paper

0

0

0

0

18:54

14/06/2020

On Translation Invariance in CNNs: Convolutional Layers Can Exploit Absolute Spatial Location

Osman Semih Kayhan, Jan C. van Gemert

Keywords Paper

inductive prior, equivariance, translation invariance, shift invariance, data efficiency, convolution, boundary effects, padding

0

0

0

0

0:59

02/02/2021

Looking Wider for Better Adaptive Representation in Few-Shot Learning

Jiabao Zhao, Yifan Yang, Xin Lin and
Jing Yang, Liang He

Keywords Paper

0

0

0

0

16:58

06/12/2020

DVERGE: Diversifying Vulnerabilities for Enhanced Robust Generation of Ensembles

Huanrui Yang, Jingyang Zhang, Hongliang Dong and
Nathan Inkawhich, Andrew Gardner, Andrew Touchet, Wesley Wilkes, Heath Berry, Helen Li

Keywords Paper

0

0

0

0

3:25

06/12/2021

Do Vision Transformers See Like Convolutional Neural Networks?

Maithra Raghu, Thomas Unterthiner, Simon Kornblith and
Chiyuan Zhang, Alexey Dosovitskiy

Keywords Paper

deep learning, machine learning, transformers, vision, representation learning, transfer learning

0

0

0

0

13:13

06/12/2021

Encoding Spatial Distribution of Convolutional Features for Texture Representation

Yong Xu, Feng Li, Zhile Chen and
Jinxiu Liang, Yuhui Quan

Keywords Paper

deep learning, machine learning

0

0

0

0

15:10

14/06/2020

Attentive Normalization for Conditional Image Generation

Yi Wang, Ying-Cong Chen, Xiangyu Zhang and
Jian Sun, Jiaya Jia

Keywords Paper

image generation, generative adversarial network, self-attention, normalization, long-range dependency, image inpainting

0

0

0

0

4:59

05/01/2021

Facial Expression Recognition in the Wild via Deep Attentive Center Loss

Amir Hossein Farzaneh, Xiaojun Qi

Keywords Paper

0

0

0

0

4:59

22/11/2021

Global Context and Geometric Priors for Effective Non-Local Self-Attention

Sanghyun Woo, Dahun Kim, Joon-Young Lee, In So Kweon

Keywords Paper

self-attention, non-local attention, attention, transformer, context, position encoding

0

0

0

0

3:03

14/06/2020

Local Context Normalization: Revisiting Local Normalization

Anthony Ortiz, Caleb Robinson, Dan Morris and
Olac Fuentes, Christopher Kiekintveld, Md Mahmudulla Hassan, Nebojsa Jojic

Keywords Paper

normalization layer, image segmentation, instance segmentation, contrast enhancement

0

0

0

0

4:55

02/02/2021

Learning Comprehensive Motion Representation for Action Recognition

Mingyu Wu, Boyuan Jiang, Donghao Luo and
Junchi Yan, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xiaokang Yang

Keywords Paper

0

0

0

0

15:15

02/02/2021

Multi-Proxy Wasserstein Classifier for Image Classification

Benlin Liu, Yongming Rao, Jiwen Lu and
Jie Zhou, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

12:05

22/11/2021

Feature Fusion Vision Transformer for Fine-Grained Visual Categorization

Jun Wang, Xiaohan Yu, Yongsheng Gao

Keywords Paper

Fine-grained visual categorization, Vision transformer, Self-attention, Feature Fusion

0

0

0

0

3:02

14/06/2020

Context Prior for Scene Segmentation

Changqian Yu, Jingbo Wang, Changxin Gao and
Gang Yu, Chunhua Shen, Nong Sang

Keywords Paper

semantic segmentation, scene segmentation, context prior, context aggregation, affinity loss, affinity matrix

0

0

0

0

1:01

30/11/2020

Attention-Aware Feature Aggregation for Real-time Stereo Matching on Edge Devices

Jia-Ren Chang National Chiao Tung University, aetherAI, Pei-Chun Chang, Yong-Sheng Chen

Keywords Paper

0

0

0

0

9:53

14/06/2020

Improving Convolutional Networks With Self-Calibrated Convolutions

Jiang-Jiang Liu, Qibin Hou, Ming-Ming Cheng and
Changhu Wang, Jiashi Feng

Keywords Paper

self-calibrated, feature transformation, image classification, network architecture, convolutional neural networks

0

0

0

0

1:00

06/12/2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training

Hongwei Xue, Yupan Huang, Bei Liu and
Houwen Peng, Jianlong Fu, Houqiang Li, Jiebo Luo

Keywords Paper

optimization, transformers, vision

0

0

0

0

10:13

30/11/2020

MLIFeat: Multi-level information fusion based deep local features

Yuyang Zhang Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences and
Jinge Wang, Shibiao Xu, Xiao Liu, Xiaopeng Zhang

Keywords Paper

0

0

0

0

5:28

07/09/2020

Few-Shot Learning with Complex-valued Neural Networks

Zhen Liu, Baochang Zhang, Guodong Guo

Keywords Paper

few-shot learning, complex-valued network, metric-learning, image classification

0

0

0

0

7:15

14/06/2020

Deep Facial Non-Rigid Multi-View Stereo

Ziqian Bai, Zhaopeng Cui, Jamal Ahmed Rahim and
Xiaoming Liu, Ping Tan

Keywords Paper

multi-view 3d, non-rigid reconstruction, face reconstruction, deep learning

0

0

0

0

1:01

02/02/2021

Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection

Jiajun Deng, Shaoshuai Shi, Peiwei Li and
Wengang Zhou, Yanyong Zhang, Houqiang Li

Keywords Paper

0

0

0

0

16:42

14/06/2020

Deep Degradation Prior for Low-Quality Image Classification

Yang Wang, Yang Cao, Zheng-Jun Zha and
Jing Zhang, Zhiwei Xiong

Keywords Paper

low quality image classification, degraded image recognition, deep degradation prior, without semantic supervision, non classical receptive field

0

0

0

0

4:56

03/05/2021

Attentional Constellation Nets for Few-Shot Learning

Weijian Xu, Yifan Xu, Huaijin Wang, Zhuowen Tu

Keywords Paper

few-shot learning, constellation models

0

0

0

0

5:10

06/12/2020

Soft Contrastive Learning for Visual Localization

Janine Thoma, Danda Pani Paudel, Luc V Gool

Keywords Paper

0

0

0

0

3:18

22/11/2021

Rethinking Token-Mixing MLP for MLP-based Vision Backbone

Tan Yu, XU LI, Yunfeng Cai and
Mingming Sun, Ping Li

Keywords Paper

vision backbone, MLP, image recognition

0

0

0

0

1:59

22/11/2021

Median Pixel Difference Convolutional Network for Robust Face Recognition

Jiehua Zhang, Zhuo Su, Li Liu

Keywords Paper

face recognition, noise robustness, efficient CNN

0

0

0

0

3:03

06/12/2020

Learning Physical Graph Representations from Visual Scenes

Daniel Bear, Chaofei Fan, Damian Mrowca and
Yunzhu Li, Seth Alter, Aran Nayebi, Jeremy Schwartz, Li Fei-Fei, Jiajun Wu, Josh Tenenbaum, Daniel Yamins

Keywords Paper

0

0

0

0

3:19

14/06/2020

Adaptive Fractional Dilated Convolution Network for Image Aesthetics Assessment

Qiuyu Chen, Wei Zhang, Ning Zhou and
Peng Lei, Yi Xu, Yu Zheng, Jianping Fan

Keywords Paper

image aesthetics assessment, kernel embedding, adaptive convolution, parameter-free, aspect ratio

0

0

0

0

1:01

06/12/2020

The Origins and Prevalence of Texture Bias in Convolutional Neural Networks

Katherine L. Hermann, Ting Chen, Simon Kornblith

Keywords Paper

0

0

0

1

3:11

06/12/2021

Efficient Training of Visual Transformers with Small Datasets

Yahui Liu, Enver Sangineto, Wei Bi and
Nicu Sebe, Bruno Lepri, Marco Nadai

Keywords Paper

robustness, transformers, vision

0

0

0

0

8:23

22/11/2021

MVT: Multi-view Vision Transformer for 3D Object Recognition

Shuo Chen, Tan Yu, Ping Li

Keywords Paper

3D object recognition, Transformer-based methods

0

0

0

0

2:51

14/06/2020

Light Field Spatial Super-Resolution via Deep Combinatorial Geometry Embedding and Structural Consistency Regularization

Jing Jin, Junhui Hou, Jie Chen, Sam Kwong

Keywords Paper

light field, super-resolution, deep learning, convolutional neural networks

0

0

0

0

1:00

06/12/2021

MLP-Mixer: An all-MLP Architecture for Vision

Ilya Tolstikhin, Neil Houlsby, Alexander Kolesnikov and
Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Andreas Steiner, Daniel Keysers, Jakob Uszkoreit, Mario Lucic, Alexey Dosovitskiy

Keywords Paper

deep learning, machine learning, transformers, vision, transfer learning

0

0

0

0

11:18

02/02/2021

Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation

Jilin Tang, Yi Yuan, Tianjia Shao and
Yong Liu, Mengmeng Wang, Kun Zhou

Keywords Paper

0

0

0

0

14:33

05/01/2021

On the Texture Bias for Few-Shot CNN Segmentation

Reza Azad, Abdur R. Fayjie, Claude Kauffmann and
Ismail Ben Ayed, Marco Pedersoli, Jose Dolz

Keywords Paper

0

0

0

0

4:55

19/08/2021

DeepME: Deep Mixture Experts for Large-scale Image Classification

Ming He, Guangyi Lv, Weidong He and
Jianping Fan, Guihua Zeng

Keywords Paper

Computer Vision, Recognition, Deep Learning, Classification

0

0

0

0

12:22

14/06/2020

Regularizing CNN Transfer Learning With Randomised Regression

Yang Zhong, Atsuto Maki

Keywords Paper

transfer learning, network regularization, randomised regression, pseudo task regularization, limited samples

0

0

0

0

0:58

02/02/2021

Explaining Convolutional Neural Networks through Attribution-Based Input Sampling and Block-Wise Feature Aggregation

Sam Sattarzadeh, Mahesh Sudhakar, Anthony Lem and
Shervin Mehryar, Konstantinos N Plataniotis, Jongseong Jang, Hyunwoo Kim, Yeonjeong Jeong, Sangmin Lee, Kyunghoon Bae

Keywords Paper

0

0

0

0

19:59