HRFormer: High-Resolution Vision Transformer for Dense Predict

06/12/2021

HRFormer: High-Resolution Vision Transformer for Dense Predict

YUHUI YUAN, Rao Fu, Lang Huang, Weihong Lin, Chao Zhang, Xilin Chen, Jingdong Wang

Keywords: transformers, vision

Abstract Paper Similar Papers

Abstract: We present a High-Resolution Transformer (HRFormer) that learns high-resolution representations for dense prediction tasks, in contrast to the original Vision Transformer that produces low-resolution representations and has high memory and computational cost. We take advantage of the multi-resolution parallel design introduced in high-resolution convolutional networks (HRNet [45]), along with local-window self-attention that performs self-attention over small non-overlapping image windows [21], for improving the memory and computation efficiency. In addition, we introduce a convolution into the FFN to exchange information across the disconnected image windows. We demonstrate the effectiveness of the HighResolution Transformer on both human pose estimation and semantic segmentation tasks, e.g., HRFormer outperforms Swin transformer [27] by 1.3 AP on COCO pose estimation with 50% fewer parameters and 30% fewer FLOPs. Code is available at: https://github.com/HRNet/HRFormer

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification

Yulin Wang, Kangchen Lv, Rui Huang and
Shiji Song, Le Yang, Gao Huang

Keywords Paper

0

0

0

0

3:23

03/05/2021

Randomized Ensembled Double Q-Learning: Learning Fast Without a Model

Xinyue Chen, Che Wang, Zijian Zhou, Keith Ross

Keywords Paper

Machine Learning, Artificial Integlligence, Deep Reinforcement Learning

0

0

0

0

5:05

06/12/2021

Look at the Variance! Efficient Black-box Explanations with Sobol-based Sensitivity Analysis

Thomas FEL, Remi Cadene, Mathieu Chalvidal and
Matthieu Cord, David Vigouroux, Thomas Serre

Keywords Paper

deep learning, machine learning, vision, interpretability

0

0

0

0

14:17

14/06/2020

BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation

Hao Chen, Kunyang Sun, Zhi Tian and
Chunhua Shen, Yongming Huang, Youliang Yan

Keywords Paper

instance segmentation, fully-convolutional, object detection, real-time

0

0

0

0

4:39

14/06/2020

MUXConv: Information Multiplexing in Convolutional Neural Networks

Zhichao Lu, Kalyanmoy Deb, Vishnu Naresh Boddeti

Keywords Paper

convolutional neural networks, neural architecture search, evolutionary algorithms

0

0

0

0

0:56

06/12/2021

Global Filter Networks for Image Classification

Yongming Rao, Wenliang Zhao, Zheng Zhu and
Jiwen Lu, Jie Zhou

Keywords Paper

machine learning, robustness, transformers, vision

0

0

0

0

9:28

06/12/2021

CHIP: CHannel Independence-based Pruning for Compact Neural Networks

Yang Sui, Miao Yin, Yi Xie and
Huy Phan, Saman Aliari Zonouz, Bo Yuan

Keywords Paper

deep learning

0

0

0

0

6:19

14/06/2020

Fast-MVSNet: Sparse-to-Dense Multi-View Stereo With Learned Propagation and Gauss-Newton Refinement

Zehao Yu, Shenghua Gao

Keywords Paper

multi-view stereo, sparse-to-dense, gauss-newton optimization, propagation, coarse-to-fine

0

0

0

0

1:01

26/04/2020

Quantifying the Cost of Reliable Photo Authentication via High-Performance Learned Lossy Representations

Pawel Korus, Nasir Memon

Keywords Paper

image forensics, photo manipulation detection, learned compression, lossy compression, image compression, entropy estimation

0

0

0

0

4:56

14/06/2020

Improved Few-Shot Visual Classification

Peyman Bateni, Raghav Goyal, Vaden Masrani and
Frank Wood, Leonid Sigal

Keywords Paper

meta-learning, few-shot classification, transfer learning, mahalanobis metric, bergman divergences

0

0

0

0

1:01

03/05/2021

Overfitting for Fun and Profit: Instance-Adaptive Data Compression

Ties van Rozendaal, Iris Huijben, Taco Cohen

Keywords Paper

Neural data compression, Learned compression, Generative modeling, Finetuning, Overfitting, Instance learning, Instance adaptation, Variational autoencoders, Rate-distortion optimization, Model compression, Weight quantization

0

0

0

0

4:59

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

02/02/2021

Explicitly Modeled Attention Maps for Image Classification

Andong Tan, Duc Tam Nguyen, Maximilian Dax and
Matthias Nießner, Thomas Brox

Keywords Paper

0

0

0

0

16:59

06/12/2020

Compressing Images by Encoding Their Latent Representations with Relative Entropy Coding

Greg Flamich, Marton Havasi, Jose Miguel Hernández-Lobato

Keywords Paper

0

0

0

0

3:37

06/12/2020

Improving robustness against common corruptions by covariate shift adaptation

Steffen Schneider, Evgenia Rusak, Luisa Eck and
Oliver Bringmann, Wieland Brendel, Matthias Bethge

Keywords Paper

0

0

0

0

3:29

14/06/2020

SCOUT: Self-Aware Discriminant Counterfactual Explanations

Pei Wang, Nuno Vasconcelos

Keywords Paper

counterfactual explanation, confidence, machine teaching, explainable ai, fine-grained recognition

0

0

0

0

1:01

05/01/2021

Covariance-Free Partial Least Squares: An Incremental Dimensionality Reduction Method

Artur Jordao, Maiko Lie, Victor Hugo Cunha de Melo, William Robson Schwartz

Keywords Paper

0

0

0

0

4:07

22/11/2021

Parameter Efficient Dynamic Convolution via Tensor Decomposition

Zejiang Hou, Sun-Yuan Kung

Keywords Paper

dynamic convolution, input-dependent reparameterization, parameter efficiency, tensor decomposition

0

0

0

0

3:58

14/06/2020

A Spatial RNN Codec for End-to-End Image Compression

Chaoyi Lin, Jiabao Yao, Fangdong Chen, Li Wang

Keywords Paper

image compression, spatial rnn, adaptive quantization, lstm

0

0

0

0

1:01

06/12/2021

An Empirical Study of Adder Neural Networks for Object Detection

Xinghao Chen, Chang Xu, Minjing Dong and
Chunjing XU, Yunhe Wang

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

13:17

22/11/2021

Joint Detection of Motion Boundaries and Occlusions

Hannah H Kim, Shuzhi Yu, Carlo Tomasi

Keywords Paper

motion boundary, occlusion, optical flow, motion estimation

0

0

0

0

3:22

06/12/2020

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

Mathilde Caron, Ishan Misra, Julien Mairal and
Priya Goyal, Piotr Bojanowski, Armand Joulin

Keywords Paper

0

1

0

0

3:22

06/12/2021

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Bowen Zhang, Yifan liu, Zhi Tian, Chunhua Shen

Keywords Paper

deep learning, vision, representation learning

0

0

0

0

12:04

30/11/2020

Lossless Image Compression Using a Multi-Scale Progressive Statistical Model

Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli and
Nannan Zou, Emre Aksu, Miska M. Hannuksela

Keywords Paper

0

0

0

0

9:33

12/07/2020

Soft Threshold Weight Reparameterization for Learnable Sparsity

Aditya Kusupati, Vivek Ramanujan, Raghav Somani and
Mitchell Wortsman, Prateek Jain, Sham Kakade, Ali Farhadi

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:24

14/06/2020

Adaptive Loss-Aware Quantization for Multi-Bit Networks

Zhongnan Qu, Zimu Zhou, Yun Cheng, Lothar Thiele

Keywords Paper

quantization, binary neural networks, adaptive bitwidth, loss-aware

0

0

0

0

1:01

18/07/2021

Understanding self-supervised learning dynamics without contrastive pairs

Yuandong Tian, Xinlei Chen, Surya Ganguli

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

18:16

02/02/2021

Fast and Compact Bilinear Pooling by Shifted Random Maclaurin

Tan Yu, Xiaoyun Li, Ping Li

Keywords Paper

0

0

0

0

14:24

05/01/2021

Spike-Thrift: Towards Energy-Efficient Deep Spiking Neural Networks by Limiting Spiking Activity via Attention-Guided Compression

Souvik Kundu, Gourav Datta, Massoud Pedram, Peter A. Beerel

Keywords Paper

0

0

0

0

5:22

06/12/2021

Long-Short Transformer: Efficient Transformers for Language and Vision

Chen Zhu, Wei Ping, Chaowei Xiao and
Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro

Keywords Paper

machine learning, transformers

0

0

0

0

11:44

03/05/2021

LambdaNetworks: Modeling long-range Interactions without Attention

Irwan Bello

Keywords Paper

attention, neural networks, image classification, deep learning, vision, transformer

0

0

0

0

9:59

02/02/2021

Learned Bi-Resolution Image Coding using Generalized Octave Convolutions

Mohammad Akbari, Jie Liang, Jingning Han, Chengjie Tu

Keywords Paper

0

0

0

0

18:45

14/06/2020

Basis Prediction Networks for Effective Burst Denoising With Large Kernels

Zhihao Xia, Federico Perazzi, Michaël Gharbi and
Kalyan Sunkavalli, Ayan Chakrabarti

Keywords Paper

burst denoising, kernel prediction, basis decomposition, deep learning, image restoration

0

0

0

0

1:01

14/06/2020

Learning in the Frequency Domain

Kai Xu, Minghai Qin, Fei Sun and
Yuhao Wang, Yen-Kuang Chen, Fengbo Ren

Keywords Paper

frequency domain, discrete cosine transform, image downsampling, spectral bias, data pre-processing pipeline, image compression, detection, segmentation.

0

0

0

0

1:01

14/06/2020

Dual Super-Resolution Learning for Semantic Segmentation

Li Wang, Dong Li, Yousong Zhu and
Lu Tian, Yi Shan

Keywords Paper

semantic segmentation, super-resolution, feature affinity, human pose estimation

0

0

0

0

4:44

06/12/2021

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text

Hassan Akbari, Liangzhe Yuan, Rui Qian and
Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong

Keywords Paper

machine learning, self-supervised learning, transformers, vision, contrastive learning

0

0

0

0

15:59

14/06/2020

HyperSTAR: Task-Aware Hyperparameters for Deep Networks

Gaurav Mittal, Chang Liu, Nikolaos Karianakis and
Victor Fragoso, Mei Chen, Yun Fu

Keywords Paper

auto ml, hyperparameter optimization, meta learning, task aware, hyperband, hyperparameters, warm start, image classication, resnet, shufflenet

0

0

0

0

4:58

05/01/2021

A Variational Information Bottleneck Based Method to Compress Sequential Networks for Human Action Recognition

Ayush Srivastava, Oshin Dutta, Jigyasa Gupta and
Sumeet Agarwal, Prathosh AP

Keywords Paper

0

0

0

0

4:29

18/07/2021

Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference

Shumao Zhang, Pengchuan Zhang, Thomas Hou

Keywords Paper

Deep Learning, Generative Models

0

0

0

0

5:01

06/12/2020

HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks

Zhen Dong, Zhewei Yao, Daiyaan Arfeen and
Amir Gholami, Michael Mahoney, Kurt Keutzer

Keywords Paper

1

0

0

0

3:21