Searching the Search Space of Vision Transformer

06/12/2021

Searching the Search Space of Vision Transformer

Minghao Chen, Kan Wu, Bolin Ni, Houwen Peng, Bei Liu, Jianlong Fu, Hongyang Chao, Haibin Ling

Keywords: deep learning, transformers, vision

Abstract Paper Similar Papers

Abstract: Vision Transformer has shown great visual representation power in substantial vision tasks such as recognition and detection, and thus been attracting fast-growing efforts on manually designing more effective architectures. In this paper, we propose to use neural architecture search to automate this process, by searching not only the architecture but also the search space. The central idea is to gradually evolve different search dimensions guided by their E-T Error computed using a weight-sharing supernet. Moreover, we provide design guidelines of general vision transformers with extensive analysis according to the space searching process, which could promote the understanding of vision transformer. Remarkably, the searched models, named S3 (short for Searching the Search Space), from the searched space achieve superior performance to recently proposed models, such as Swin, DeiT and ViT, when evaluated on ImageNet. The effectiveness of S3 is also illustrated on object detection, semantic segmentation and visual question answering, demonstrating its generality to downstream vision and vision-language tasks. Code and models will be available at https://github.com/microsoft/Cream.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

30/11/2020

MTNAS: Search Multi-Task Networks for Autonomous Driving

Hao Liu, Dong Li, JinZhang Peng and
Qingjie Zhao, Lu Tian, Yi Shan

Keywords Paper

0

0

0

0

9:06

14/06/2020

Deep Non-Line-of-Sight Reconstruction

Javier Grau Chopite, Matthias B. Hullin, Michael Wand, Julian Iseringhausen

Keywords Paper

non-line-of-sight, time-of-flight, transient imaging, deep learning, geometry reconstruction, synthetic training

0

0

0

0

1:00

22/11/2021

Self-Supervised Monocular Depth Estimation with Internal Feature Fusion

Hang Zhou, David Greenwood, Sarah Taylor

Keywords Paper

depth estimation, structure from motion

0

0

0

0

2:49

06/12/2021

SOLQ: Segmenting Objects by Learning Queries

Bin Dong, Fangao Zeng, Tiancai Wang and
Xiangyu Zhang, Yichen Wei

Keywords Paper

machine learning, transformers

0

0

0

0

7:12

14/06/2020

SP-NAS: Serial-to-Parallel Backbone Search for Object Detection

Chenhan Jiang, Hang Xu, Wei Zhang and
Xiaodan Liang, Zhenguo Li

Keywords Paper

object detection, nas, autonomous driving scene

0

0

0

0

1:01

02/02/2021

A Scalable Reasoning and Learning Approach for Neural-Symbolic Stream Fusion

Danh Le-Phuoc, Thomas Eiter, Anh Le-Tuan

Keywords Paper

0

0

0

0

18:49

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

02/02/2021

Generate Your Counterfactuals: Towards Controlled Counterfactual Generation for Text

Nishtha Madaan, Inkit Padhi, Naveen Panwar, Diptikalyan Saha

Keywords Paper

0

0

0

0

20:15

19/08/2021

AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP Investigation in the Recommender System

Pengyu Zhao, Kecheng Xiao, Yuanxing Zhang and
Kaigui Bian, Wei Yan

Keywords Paper

Knowledge Representation and Reasoning, Preference Modelling and Preference-Based Reasoning, Recommender Systems, Recommender Systems

0

0

0

0

15:05

14/06/2020

Image Search With Text Feedback by Visiolinguistic Attention Learning

Yanbei Chen, Shaogang Gong, Loris Bazzani

Keywords Paper

vision and language, image search, text feedback, attention mechanism, transformer, multimodal learning, representation learning, composition, image retrieval, interactive image search

0

0

0

0

1:00

03/05/2021

Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search

Peidong Liu, Gengwei Zhang, Bochao Wang and
Hang Xu, Xiaodan Liang, Yong Jiang, Zhenguo Li

Keywords Paper

AutoML, Loss function search, Evolutionary algorithm, Object detection

0

0

0

0

5:15

14/06/2020

Learning Saliency Propagation for Semi-Supervised Instance Segmentation

Yanzhao Zhou, Xin Wang, Jianbin Jiao and
Trevor Darrell, Fisher Yu

Keywords Paper

semi-supervised, instance segmentation, saliency, propagation, message passing, multiple instance learning, partial-supervised, generalization

0

0

0

0

1:01

03/05/2021

A Design Space Study for LISTA and Beyond

Tianjian Meng, Xiaohan Chen, Yifan Jiang, Zhangyang Wang

Keywords Paper

0

0

0

0

5:50

15/06/2020

Verifying concurrent search structure templates

Siddharth Krishna, Nisarg Patel, Dennis Shasha, Thomas Wies

Keywords Paper

separation logic, concurrent data structures, flow framework, template-based verification

0

0

0

0

14:56

14/06/2020

DNU: Deep Non-Local Unrolling for Computational Spectral Imaging

Lizhi Wang, Chen Sun, Maoqing Zhang and
Ying Fu, Hua Huang

Keywords Paper

computational spectral imaging, spectral image reconstruction, deep unrolling, non-local similarity, deep prior, image compressive sensing

0

0

0

0

1:01

14/06/2020

MemNAS: Memory-Efficient Neural Architecture Search With Grow-Trim Learning

Peiye Liu, Bo Wu, Huadong Ma, Mingoo Seok

Keywords Paper

neural architecture search, recurrent neural network, memory optimization

0

0

0

0

0:59

06/12/2021

Associating Objects with Transformers for Video Object Segmentation

Zongxin Yang, Yunchao Wei, Yi Yang

Keywords Paper

transformers

0

0

0

0

12:29

12/07/2020

MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time

XICHUAN ZHOU, YiCong Peng, Chunqiao Long and
Fengbo Ren, Cong Shi

Keywords Paper

Applications - Computer Vision

0

0

0

0

11:57

15/11/2020

Satune: Synthesizing Efficient SAT Encoders

Hamed Gorjiara, Guoqing Harry Xu, Brian Demsky

Keywords Paper

Auto-tuning, SAT encoding, Constraint Solvers

0

0

0

0

15:23

14/06/2020

3DRegNet: A Deep Neural Network for 3D Point Registration

G. Dias Pais, Srikumar Ramalingam, Venu Madhav Govindu and
Jacinto C. Nascimento, Rama Chellappa, Pedro Miraldo

Keywords Paper

3d registration, deep learning, pose regression, classification

0

0

0

0

0:59

26/04/2020

Computation Reallocation for Object Detection

Feng Liang, Chen Lin, Ronghao Guo and
Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang

Keywords Paper

Neural Architecture Search, Object Detection

0

0

0

0

5:29

06/12/2020

GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network

Prune Truong, Martin Danelljan, Luc V Gool, Radu Timofte

Keywords Paper

0

0

0

0

3:18

05/04/2021

Larq Compute Engine: Design, Benchmark and Deploy State-of-the-Art Binarized Neural Networks

Tom Bannink, Adam Hillier, Lukas Geiger and
Tim de Bruin, Leon Overweel, Jelmer Neeven, Koen Helwegen

Keywords Paper

0

0

0

0

22:15

23/06/2021

AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations

Jie Zhao, Bojie Li, Wang Nie and
Zhen Geng, Renwei Zhang, Xiong Gao, Bin Cheng, Chen Wu, Yun Cheng, Zheng Li, Peng Di, Kun Zhang, Xuefeng Jin

Keywords Paper

neural networks, neural processing units, polyhedral model, code generation, auto-tuning

0

0

0

0

21:49

06/12/2020

RANet: Region Attention Network for Semantic Segmentation

Dingguo Shen, Yuanfeng Ji, Ping Li and
Yi Wang, Di Lin

Keywords Paper

0

0

0

0

3:13

14/06/2020

PointAugment: An Auto-Augmentation Framework for Point Cloud Classification

Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu

Keywords Paper

auto-augmentation framework, point cloud processing, sample-aware, jointly optimizing, classification

0

0

0

0

5:01

30/11/2020

Project to Adapt: Domain Adaptation for Depth Completion from Noisy and Sparse Sensor Data

Adrian Lopez-Rodriguez, Benjamin Busam, Krystian Mikolajczyk

Keywords Paper

0

0

0

0

10:00

26/04/2020

Fast Neural Network Adaptation via Parameter Remapping and Architecture Search

Jiemin Fang, Yuzhu Sun, Kangjian Peng* and
Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang

Keywords Paper

1

0

0

0

4:39

02/02/2021

AnchorFace: An Anchor-based Facial Landmark Detector Across Large Poses

Zixuan Xu, Banghuai Li, Ye Yuan, Miao Geng

Keywords Paper

0

0

0

0

15:02

02/02/2021

RGB-D Salient Object Detection via 3D Convolutional Neural Networks

Qian Chen, Ze Liu, Yi Zhang and
Keren Fu, Qijun Zhao, Hongwei Du

Keywords Paper

0

0

0

0

14:04

23/08/2020

AutoML pipeline selection: Efficiently navigating the combinatorial space

Chengrun Yang, Jicong Fan, Ziyang Wu, Madeleine Udell

Keywords Paper

pipeline search, greedy algorithms, experiment design, AutoML, tensor decomposition, submodular optimization, meta-learning

0

0

0

0

13:40

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

14/06/2020

Dynamic Refinement Network for Oriented and Densely Packed Object Detection

Xingjia Pan, Yuqiang Ren, Kekai Sheng and
Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu

Keywords Paper

object detection, oriented, densely packed, sku110k, feature selection, dynamic, anchor-free

0

0

0

0

5:01

22/11/2021

Point3D: tracking actions as moving points with 3D CNNs

Shentong Mo, Jingfei Xia, Xiaoqing Tan, Bhiksha Raj

Keywords Paper

Spatio-temporal action detection

0

0

0

0

3:02

19/08/2021

DACBench: A Benchmark Library for Dynamic Algorithm Configuration

Theresa Eimer, André Biedenkapp, Maximilian Reimer and
Steven Adriansen, Frank Hutter, Marius Lindauer

Keywords Paper

Heuristic Search and Game Playing, Evaluation and Analysis, Heuristic Search and Machine Learning, Meta-Reasoning and Meta-Heuristics

0

0

0

0

13:51

14/06/2020

BFBox: Searching Face-Appropriate Backbone and Feature Pyramid Network for Face Detector

Yang Liu, Xu Tang

Keywords Paper

face detection, neural architecture search, feature pyramid layer

0

0

0

0

0:59

06/12/2020

DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation

Alexandre Carlier, Martin Danelljan, Alexandre Alahi, Radu Timofte

Keywords Paper

0

0

0

0

3:23

22/11/2021

Spatiotemporal Deformable Scene Graphs for Complex Activity Detection

Salman Khan, Fabio Cuzzolin

Keywords Paper

action detection, activity detection, complex activity detection, scene graph, graph convolutional network, autonomous driving, surgical robotics, deformable pooling, parts deformation

0

0

0

0

3:02

06/12/2020

Synbols: Probing Learning Algorithms with Synthetic Datasets

Alexandre Lacoste, Pau Rodríguez López, Frederic Branchaud-Charron and
Parmida Atighehchian, Massimo Caccia, Issam Hadj Laradji, Alexandre Drouin, Matthew Craddock, Laurent Charlin, David Vázquez

Keywords Paper

0

0

0

0

3:17

18/07/2021

AutoAttend: Automated Attention Representation Search

Chaoyu Guan, Xin Wang, wenwu zhu

Keywords Paper

Algorithms, AutoML

0

0

0

0

4:49