Exploring Self-Attention for Image Recognition

14/06/2020

Exploring Self-Attention for Image Recognition

Hengshuang Zhao, Jiaya Jia, Vladlen Koltun

Keywords: self-attention, pairwise, patchwise, vector attention, image recognition

Abstract Paper Similar Papers

Abstract: Recent work has shown that self-attention can serve as a basic building block for image recognition models. We explore variations of self-attention and assess their effectiveness for image recognition. We consider two forms of self-attention. One is pairwise self-attention, which generalizes standard dot-product attention and is fundamentally a set operator. The other is patchwise self-attention, which is strictly more powerful than convolution. Our pairwise self-attention networks match or outperform their convolutional counterparts, and the patchwise models substantially outperform the convolutional baselines. We also conduct experiments that probe the robustness of learned representations and conclude that self-attention networks may have significant benefits in terms of robustness and generalization.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Learning Selective Self-Mutual Attention for RGB-D Saliency Detection

Nian Liu, Ni Zhang, Junwei Han

Keywords Paper

rgb-d saliency detection, middle fusion, self-attention, mutual-attention, non-local network, two-stream cnn

0

0

0

0

1:01

06/12/2021

Passive attention in artificial neural networks predicts human visual selectivity

Thomas Langlois, Haicheng Zhao, Erin Grant and
Ishita Dasgupta, Tom Griffiths, Nori Jacoby

Keywords Paper

deep learning, machine learning, vision, interpretability

0

0

0

0

19:23

06/12/2021

Residual Relaxation for Multi-view Representation Learning

Yifei Wang, Zhengyang Geng, Feng Jiang and
Chuming Li, Yisen Wang, Jiansheng Yang, Zhouchen Lin

Keywords Paper

self-supervised learning, representation learning

0

0

0

0

11:48

06/12/2020

GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network

Prune Truong, Martin Danelljan, Luc V Gool, Radu Timofte

Keywords Paper

0

0

0

0

3:18

06/12/2021

Looking Beyond Single Images for Contrastive Semantic Segmentation Learning

FEIHU ZHANG, Philip Torr, Rene Ranftl, Stephan Richter

Keywords Paper

machine learning, vision, contrastive learning, representation learning

0

0

0

0

14:48

30/11/2020

Second Order enhanced Multi-glimpse Attention in Visual Question Answering

Qiang Sun, Binghui Xie, Yanwei Fu

Keywords Paper

0

0

0

0

7:20

19/04/2021

Crisscrossed captions: Extended intramodal and intermodal semantic similarity judgments for MS-COCO

Zarana Parekh, Jason Baldridge, Daniel Cer and
Austin Waters, Yinfei Yang

Keywords Paper

0

0

0

0

10:19

06/12/2021

Variational Multi-Task Learning with Gumbel-Softmax Priors

Jiayi Shen, Xiantong Zhen, Marcel Worring, Ling Shao

Keywords Paper

machine learning, generative model

0

0

0

0

13:09

30/11/2020

Transforming Multi-Concept Attention into Video Summarization

Yen-Ting Liu, Yu-Jhe Li, Yu-Chiang Frank Wang

Keywords Paper

0

0

0

0

7:07

26/04/2020

On the Relationship between Self-Attention and Convolutional Layers

Jean-Baptiste Cordonnier, Andreas Loukas, Martin Jaggi

Keywords Paper

self-attention, attention, transformers, convolution, CNN, image, expressivity, capacity

0

0

0

0

5:18

14/09/2020

On Saliency Maps and Adversarial Robustness

Puneet Mangla, Vedant Singh, Vineeth Balasubramanian

Keywords Paper

adversarial robustness, saliency maps, deep neural networks

0

0

0

0

17:29

02/02/2021

Bridging Towers of Multi-task Learning with a Gating Mechanism for Aspect-based Sentiment Analysis and Sequential Metaphor Identification

Rui Mao, Xiao Li

Keywords Paper

0

0

0

0

19:27

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

03/05/2021

Improving Transformation Invariance in Contrastive Representation Learning

Adam Foster, Rattana Pukdee, Tom Rainforth

Keywords Paper

transformation invariance, contrastive learning, representation learning

0

0

0

0

5:23

12/07/2020

Weakly-Supervised Disentanglement Without Compromises

Francesco Locatello, Ben Poole, Gunnar Raetsch and
Bernhard Schölkopf, Olivier Bachem, Michael Tschannen

Keywords Paper

Representation Learning

0

0

0

0

14:47

14/06/2020

Interpretable and Accurate Fine-grained Recognition via Region Grouping

Zixuan Huang, Yin Li

Keywords Paper

interpretable deep model, fine-grained recognition, region-based recognition

0

0

0

0

4:58

02/02/2021

Similarity Reasoning and Filtration for Image-Text Matching

Haiwen Diao, Ying Zhang, Lin Ma, Huchuan Lu

Keywords Paper

0

0

0

0

16:34

26/04/2020

Theory and Evaluation Metrics for Learning Disentangled Representations

Kien Do, Truyen Tran

Keywords Paper

disentanglement, metrics

0

0

0

0

3:37

03/05/2021

Learning Robust State Abstractions for Hidden-Parameter Block MDPs

Amy Zhang, Shagun Sodhani, Khimya Khetarpal, Joelle Pineau

Keywords Paper

bisimulation, block mdp, hidden-parameter mdp, multi-task reinforcement learning

0

0

0

0

4:17

06/12/2021

MAU: A Motion-Aware Unit for Video Prediction and Beyond

Zheng Chang, Xinfeng Zhang, Shanshe Wang and
Siwei Ma, Yan Ye, Xiang Xinguang, Wen Gao

Keywords Paper

vision

0

0

0

0

9:54

07/09/2020

Adversarial Concurrent Training: Optimizing Robustness and Accuracy Trade-off of Deep Neural Networks

Elahe Arani, Fahad Sarfraz, Bahram Zonooz

Keywords Paper

Adversarial Robustness, Generalization, Adversarial Training, Deep Learning, Collaborative Learning

0

0

0

0

3:39

06/12/2021

Grounding inductive biases in natural images: invariance stems from variations in data

Diane Bouchacourt, Mark Ibrahim, Ari Morcos

Keywords Paper

machine learning, transformers

0

0

0

0

14:19

06/12/2020

Contrastive Learning with Adversarial Examples

Chih-Hui Ho, Nuno Nvasconcelos

Keywords Paper

0

0

0

0

3:13

06/12/2021

Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning

Yifan Zhang, Bryan Hooi, Dapeng Hu and
Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, self-supervised learning, vision, contrastive learning, representation learning, transfer learning

0

0

0

0

14:34

07/09/2020

Attentive Action and Context Factorization

Yang Wang, Vinh Tran, Gedas Bertasius and
Lorenzo Torresani, Minh Hoai Nguyen

Keywords Paper

action factorization, attention, conjugate samples

0

0

0

0

9:59

03/05/2021

CPR: Classifier-Projection Regularization for Continual Learning

Sungmin Cha, Hsiang Hsu, Taebaek Hwang and
Flavio Calmon, Taesup Moon

Keywords Paper

regularization, wide local minima, continual learning

0

0

0

1

5:21

22/11/2021

Paying Attention to Varying Receptive Fields: Object Detection with Atrous Filters and Vision Transformers

Arthur Jian Shun Lam, Jun Yi Lim, Ricky Sutopo, Vishnu Monn Baskaran

Keywords Paper

object detection, atrous convolution, vision transformers, attention mechanism

0

0

0

0

3:01

07/09/2020

Attention Distillation for Learning Video Representations

Miao Liu, Xin Chen, Yun Zhang and
Yin Li, James Rehg

Keywords Paper

Action Recognition, Deep Learning, Representation Learning

0

0

0

0

9:50

02/02/2021

Domain General Face Forgery Detection by Learning to Weight

Ke Sun, Hong Liu, Qixiang Ye and
Yue Gao, Jianzhuang Liu, Ling Shao, Rongrong Ji

Keywords Paper

0

0

0

0

14:07

25/07/2020

Regional relation modeling for visual place recognition

Yingying Zhu, Biao Li, Jiong Wang, Zhou Zhao

Keywords Paper

convolutional neural network, visual place recognition, content-based image retrieval, relation modeling

0

0

0

0

14:11

26/04/2020

Sharing Knowledge in Multi-Task Deep Reinforcement Learning

Carlo D'Eramo, Davide Tateo, Andrea Bonarini and
Marcello Restelli, Jan Peters

Keywords Paper

Deep Reinforcement Learning, Multi-Task

0

0

0

0

4:27

02/02/2021

Understanding Deformable Alignment in Video Super-Resolution

Kelvin C.K. Chan, Xintao Wang, Ke Yu and
Chao Dong, Chen Change Loy

Keywords Paper

0

0

0

0

14:15

03/05/2021

Exploring Balanced Feature Spaces for Representation Learning

Bingyi Kang, Yu Li, Sain Xie and
Zehuan Yuan, Jiashi Feng

Keywords Paper

Representation Learning, Contrastive Learning, Long-Tailed Recognition

0

0

0

0

7:18

05/01/2021

Cross-Domain Latent Modulation for Variational Transfer Learning

Jinyong Hou, Jeremiah D. Deng, Stephen Cranefield, Xuejie Ding

Keywords Paper

0

0

0

0

4:52

03/05/2021

Representation Learning via Invariant Causal Mechanisms

Jovana Mitrovic, Brian McWilliams, Jacob C Walker and
Lars Buesing, Charles Blundell

Keywords Paper

Self-supervised Learning, Representation Learning, Causality, Contrastive Methods

1

0

0

0

7:03

22/11/2021

Image-Text Alignment using Adaptive Cross-attention with Transformer Encoder for Scene Graphs

Juyong Song, Sunghyun Choi

Keywords Paper

cross-attention, multi-modal, retrieval, scene-graphs, graph neural networks, contrastive loss

0

0

0

0

3:01

18/11/2020

CCA-flow: Deep multi-view subspace learning with inverse autoregressive flow

Jia He, Feiyang Pan, Fuzhen Zhuang, Qing He

Keywords Paper

0

0

0

0

11:33

14/06/2020

Relation-Aware Global Attention for Person Re-Identification

Zhizheng Zhang, Cuiling Lan, Wenjun Zeng and
Xin Jin, Zhibo Chen

Keywords Paper

relation-aware global attention, attention mechanism, person re-identification, feature relations, global structural information

0

0

0

0

1:01

26/04/2020

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Piyush Gupta, Nikaash Puri, Sukriti Verma and
Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Keywords Paper

Deep Reinforcement Learning, Saliency maps, Chess, Atari games, Interpretable AI

0

0

0

0

4:59

02/02/2021

Learning Intact Features by Erasing-Inpainting for Few-shot Classification

Junjie Li, Zilei Wang, Xiaoming Hu

Keywords Paper

0

0

0

0

15:15