Auto Learning Attention

06/12/2020

Auto Learning Attention

Benteng Ma, Jing Zhang, Yong Xia, Dacheng Tao

Keywords: Algorithms -> Representation Learning, Algorithms -> Relational Learning

Abstract Paper Similar Papers

Abstract: Attention modules have been demonstrated effective in strengthening the representation ability of a neural network via reweighting spatial or channel features or stacking both operations sequentially. However, designing the structures of different attention operations requires a bulk of computation and extensive expertise. In this paper, we devise an Auto Learning Attention (AutoLA) method, which is the first attempt on automatic attention design. Specifically, we define a novel attention module named high order group attention (HOGA) as a directed acyclic graph (DAG) where each group represents a node, and each edge represents an operation of heterogeneous attentions. A typical HOGA architecture can be searched automatically via the differential AutoLA method within 1 GPU day using the ResNet-20 backbone on CIFAR10. Further, the searched attention module can generalize to various backbones as a plug-and-play component and outperforms popular manually designed channel and spatial attentions for many vision tasks, including image classification on CIFAR100 and ImageNet, object detection and human keypoint detection on COCO dataset. The code will be released.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

MemNAS: Memory-Efficient Neural Architecture Search With Grow-Trim Learning

Peiye Liu, Bo Wu, Huadong Ma, Mingoo Seok

Keywords Paper

neural architecture search, recurrent neural network, memory optimization

0

0

0

0

0:59

18/07/2021

AutoAttend: Automated Attention Representation Search

Chaoyu Guan, Xin Wang, wenwu zhu

Keywords Paper

Algorithms, AutoML

0

0

0

0

4:49

30/11/2020

MTNAS: Search Multi-Task Networks for Autonomous Driving

Hao Liu, Dong Li, JinZhang Peng and
Qingjie Zhao, Lu Tian, Yi Shan

Keywords Paper

0

0

0

0

9:06

14/06/2020

Densely Connected Search Space for More Flexible Neural Architecture Search

Jiemin Fang, Yuzhu Sun, Qian Zhang and
Yuan Li, Wenyu Liu, Xinggang Wang

Keywords Paper

neural architecture search (nas), densely connected search space

0

0

0

0

1:00

14/06/2020

Non-Local Neural Networks With Grouped Bilinear Attentional Transforms

Lu Chi, Zehuan Yuan, Yadong Mu, Changhu Wang

Keywords Paper

attention, non-local, bilinear, image classification, video classification, grouped, data-adaptive

0

0

0

0

1:01

03/05/2021

A Trainable Optimal Transport Embedding for Feature Aggregation and its Relationship to Attention

Grégoire Mialon, Dexiong Chen, Alexandre d'Aspremont, Julien Mairal

Keywords Paper

attention, bioinformatics, transformers, optimal transport, kernel methods

0

0

0

0

5:29

26/04/2020

Computation Reallocation for Object Detection

Feng Liang, Chen Lin, Ronghao Guo and
Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang

Keywords Paper

Neural Architecture Search, Object Detection

0

0

0

0

5:29

06/12/2021

Rethinking Neural Operations for Diverse Tasks

Nicholas Roberts, Mikhail Khodak, Tri Dao and
Liam Li, Christopher Ré, Ameet S Talwalkar

Keywords Paper

deep learning, machine learning

0

0

0

0

10:26

06/12/2020

GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network

Prune Truong, Martin Danelljan, Luc V Gool, Radu Timofte

Keywords Paper

0

0

0

0

3:18

04/07/2020

Highway Transformer: Self-Gating Enhanced Self-Attentive Networks

Yekun Chai, Shuo Jin, Xinwen Hou

Keywords Paper

sequence tasks, optimization process, Highway Transformer, Self-Gating Networks

0

0

0

0

12:14

02/02/2021

Visual Concept Reasoning Networks

Taesup Kim, Sungwoong Kim, Yoshua Bengio

Keywords Paper

0

0

0

0

13:01

05/01/2021

MPRNet: Multi-Path Residual Network for Lightweight Image Super Resolution

Armin Mehri, Parichehr B. Ardakani, Angel D. Sappa

Keywords Paper

0

0

0

0

4:57

26/04/2020

Tensor Decompositions for Temporal Knowledge Base Completion

Timothée Lacroix, Guillaume Obozinski, Nicolas Usunier

Keywords Paper

knowledge base completion, temporal embeddings

0

0

0

0

4:38

07/09/2020

Automated Search for Resource-Efficient Branched Multi-Task Networks

David Brüggemann, Menelaos Kanakis, Stamatios Georgoulis, Luc Van Gool

Keywords Paper

multi task, neural architecture search, resource efficient networks, dense prediction, encoder branching, proxyless resource loss, differentiable search space, branched networks, tree-like networks, Gumbel-Softmax

0

0

0

0

8:31

02/02/2021

Transformer-Style Relational Reasoning with Dynamic Memory Updating for Temporal Network Modeling

Dongkuan Xu, Junjie Liang, Wei Cheng and
Hua Wei, Haifeng Chen, Xiang Zhang

Keywords Paper

0

0

0

0

15:47

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

02/02/2021

Learning Visual Context for Group Activity Recognition

Hangjie Yuan, Dong Ni

Keywords Paper

0

0

0

0

16:54

18/07/2021

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Jianfei Chen, Lianmin Zheng, Zhewei Yao and
Dequan Wang, Ion Stoica, Michael Mahoney, Joseph E Gonzalez

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

18:54

22/11/2021

Spatiotemporal Deformable Scene Graphs for Complex Activity Detection

Salman Khan, Fabio Cuzzolin

Keywords Paper

action detection, activity detection, complex activity detection, scene graph, graph convolutional network, autonomous driving, surgical robotics, deformable pooling, parts deformation

0

0

0

0

3:02

22/11/2021

Logsig-RNN: a novel network for robust and efficient skeleton-based action recognition

Hao Ni, Shujian Liao, Weixin Yang and
Kevin Schlegel, Terry J Lyons

Keywords Paper

skeleton-based action recognition, recurrent neural network, log-signature

0

0

0

0

2:58

06/12/2021

Alignment Attention by Matching Key and Query Distributions

Shujian Zhang, Xinjie Fan, Huangjie Zheng and
Korawat Tanwisuth, Mingyuan Zhou

Keywords Paper

deep learning, robustness, adversarial robustness and security, vision, graph learning, language

0

0

0

0

7:16

02/02/2021

End-to-end Semantic Role Labeling with Neural Transition-based Model

Hao Fei, Meishan Zhang, Bobo Li, Donghong Ji

Keywords Paper

0

0

0

0

18:47

16/11/2020

An Exploration of Arbitrary-Order Sequence Labeling via Energy-Based Inference Networks

Lifu Tu, Tianyu Liu, Kevin Gimpel

Keywords Paper

natural processing, sequence labeling, semantic labeling, parsing

0

0

0

0

10:07

06/12/2020

Timeseries Anomaly Detection using Temporal Hierarchical One-Class Network

Lifeng Shen, Zhuocong Li, James Kwok

Keywords Paper

0

0

0

0

3:12

06/12/2020

Interstellar: Searching Recurrent Architecture for Knowledge Graph Embedding

Yongqi Zhang, Quanming Yao, Lei Chen

Keywords Paper

0

0

0

0

3:23

03/05/2021

Hopfield Networks is All You Need

Hubert Ramsauer, Bernhard Schäfl, Johannes Lehner and
Philipp Seidl, Michael Widrich, Lukas Gruber, Markus Holzleitner, Thomas Adler, David Kreil, Michael K Kopp, Günter Klambauer, Johannes Brandstetter, Sepp Hochreiter

Keywords Paper

Attention, Associative Memory, Hopfield layer, Storage Capacity, Convergence, Energy, Modern Hopfield Network

0

0

0

0

5:11

25/07/2020

Web table retrieval using multimodal deep learning

Roee Shraga, Haggai Roitman, Guy Feigenblat, Mustafa Cannim

Keywords Paper

experimentation, multimodal deep-learning, table retrieval

0

0

0

0

14:08

06/12/2020

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning

Ximeng Sun, Rameswar Panda, Rogerio Feris, Kate Saenko

Keywords Paper

0

0

0

0

3:13

19/08/2021

Multi-hop Attention Graph Neural Networks

Guangtao Wang, Rex Ying, Jing Huang, Jure Leskovec

Keywords Paper

Machine Learning, Deep Learning, Learning Graphical Models, Relational Learning

0

0

0

0

12:35

02/02/2021

Encoder-Decoder Based Unified Semantic Role Labeling with Label-Aware Syntax

Hao Fei, Fei Li, Bobo Li, Donghong Ji

Keywords Paper

0

0

0

0

16:10

23/08/2020

Controllable multi-interest framework for recommendation

Yukuo Cen, Jianwei Zhang, Xu Zou and
Chang Zhou, Hongxia Yang, Jie Tang

Keywords Paper

recommender system, multi-interest framework, sequential recommendation

0

0

0

0

15:59

26/04/2020

Progressive learning and disentanglement of hierarchical representations

Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali, Linwei Wang

Keywords Paper

generative model, disentanglement, progressive learning, VAE

0

0

0

0

5:06

14/06/2020

APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

Tianzhe Wang, Kuan Wang, Han Cai and
Ji Lin, Zhijian Liu, Hanrui Wang, Yujun Lin, Song Han

Keywords Paper

efficiency, model compression, joint design, neural architecture search, channel pruning, mixed-precision quantization

0

0

0

0

1:00

06/12/2021

Generalized Shape Metrics on Neural Representations

Alex H Williams, Erin Kunz, Simon Kornblith, Scott Linderman

Keywords Paper

deep learning, machine learning, generative model, representation learning

0

0

0

0

10:55

22/11/2021

Searching for TrioNet: Combining Convolution with Local and Global Self-Attention

Huaijin Pi, Huiyu Wang, Yingwei Li and
Zizhang Li, Alan Yuille

Keywords Paper

Self-Attention, Neural Architecture Search

0

0

0

0

2:56

19/08/2021

Asynchronous Multi-grained Graph Network For Interpretable Multi-hop Reading Comprehension

Ronghan Li, Lifang Wang, Shengli Wang, Zejun Jiang

Keywords Paper

Natural Language Processing, Natural Language Processing, Question Answering

0

0

0

0

14:40

16/11/2020

Supertagging Combinatory Categorial Grammar with Attentive Graph Convolutional Networks

Yuanhe Tian, Yan Song, Fei Xia

Keywords Paper

supertagging, combinatory parsing, neural supertagging, parsing

0

0

0

0

6:53

19/08/2021

Progressive Open-Domain Response Generation with Multiple Controllable Attributes

Haiqin Yang, Xiaoyuan Yao, Yiqun Duan and
Jianping Shen, Jie Zhong, Kun Zhang

Keywords Paper

Machine Learning, Learning Generative Models, Dialogue

0

0

0

0

14:43

26/04/2020

Fast Neural Network Adaptation via Parameter Remapping and Architecture Search

Jiemin Fang, Yuzhu Sun, Kangjian Peng* and
Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang

Keywords Paper

1

0

0

0

4:39

23/08/2020

Spectrum-guided adversarial disparity learning

Zhe Liu, Lina Yao, Lei Bai and
Xianzhi Wang, Can Wang

Keywords Paper

adversarial autoencoder, generative models, intraclass variability, activity recognition

0

0

0

0

14:30