NAS-Bench-x11 and the Power of Learning Curves

06/12/2021

NAS-Bench-x11 and the Power of Learning Curves

Shen Yan, Colin White, Yash Savani, Frank Hutter

Keywords: deep learning

Abstract Paper Similar Papers

Abstract: While early research in neural architecture search (NAS) required extreme computational resources, the recent releases of tabular and surrogate benchmarks have greatly increased the speed and reproducibility of NAS research. However, two of the most popular benchmarks do not provide the full training information for each architecture. As a result, on these benchmarks it is not possible to evaluate many types of multi-fidelity algorithms, such as learning curve extrapolation, that require evaluating architectures at arbitrary epochs. In this work, we present a method using singular value decomposition and noise modeling to create surrogate benchmarks, NAS-Bench-111, NAS-Bench-311, and NAS-Bench-NLP11, that output the full training information for each architecture, rather than just the final validation accuracy. We demonstrate the power of using the full training information by introducing a learning curve extrapolation framework to modify single-fidelity algorithms, showing that it leads to improvements over popular single-fidelity algorithms which claimed to be state-of-the-art upon release.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Computing the Testing Error Without a Testing Set

Ciprian A. Corneanu, Sergio Escalera, Aleix M. Martinez

Keywords Paper

deep learning, algebraic topology, generalization, object recognition, facial analysis, semantic segmentation

0

0

0

0

4:43

26/04/2020

Towards Fast Adaptation of Neural Architectures with Meta Learning

Dongze Lian, Yin Zheng, Yintao Xu and
Yanxiong Lu, Leyu Lin, Peilin Zhao, Junzhou Huang, Shenghua Gao

Keywords Paper

Fast adaptation, Meta learning, NAS

0

0

0

0

4:55

08/12/2020

E.T.: Entity-Transformers. Coreference augmented Neural Language Model for richer mention representations via Entity-Transformer blocks

Nikolaos Stylianou, Ioannis Vlahavas

Keywords Paper

0

0

0

0

8:49

16/11/2020

Reproducible and Efficient Benchmarks for Hyperparameter Optimization of Neural Machine Translation Systems

Xuan Zhang, Kevin Duh

Keywords Paper

hyperparameter selection, neural systems, automatic optimization, nmt

0

0

0

0

11:38

04/07/2020

Learning Architectures from an Extended Search Space for Language Modeling

Yinqiao Li, Chi Hu, Yuhao Zhang and
Nuo Xu, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li

Keywords Paper

Language Modeling, intra-cell NAS, recurrent modeling, CoNLL task

0

0

0

0

10:28

05/04/2021

Pipelined Backpropagation at Scale: Training Large Models without Batches

Atli Kosson, Vitaliy Chiley, Abhi Venigalla and
Joel Hestness, Urs Koster

Keywords Paper

0

0

0

0

4:14

05/04/2021

Pipelined Backpropagation at Scale: Training Large Models without Batches

Atli Kosson, Vitaliy Chiley, Abhi Venigalla and
Joel Hestness, Urs Koster

Keywords Paper

0

0

0

0

18:00

12/07/2020

Evolving Machine Learning Algorithms From Scratch

Esteban Real, Chen Liang, David So, Quoc Le

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:01

12/07/2020

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

Yong Guo, Yaofo Chen, Yin Zheng and
Peilin Zhao, Jian Chen, Junzhou Huang, Mingkui Tan

Keywords Paper

Deep Learning - General

0

0

0

0

13:38

06/12/2021

Learning Transferable Adversarial Perturbations

Krishna kanth Nakka, Mathieu Salzmann

Keywords Paper

deep learning, optimization, adversarial robustness and security

0

0

0

0

12:00

12/07/2020

Small Data, Big Decisions: Model Selection in the Small-Data Regime

Jorg Bornschein, Francesco Visin, Simon Osindero

Keywords Paper

Deep Learning - General

0

0

0

0

11:47

08/12/2020

Dynamic Curriculum Learning for Low-Resource Neural Machine Translation

Chen Xu, Bojie Hu, Yufan Jiang and
Kai Feng, Zeyang Wang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

Keywords Paper

0

0

0

0

13:28

02/02/2021

Any-Precision Deep Neural Networks

Haichao Yu, Haoxiang Li, Humphrey Shi and
Thomas S. Huang, Gang Hua

Keywords Paper

0

0

0

0

14:26

03/05/2021

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective

Wuyang Chen, Xinyu Gong, Zhangyang Wang

Keywords Paper

number of linear regions, neural tangent kernel, Neural Architecture Search

0

0

0

0

5:01

06/12/2021

What can linearized neural networks actually say about generalization?

Guillermo Ortiz-Jimenez, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

Keywords Paper

theory, deep learning

0

0

0

0

9:46

26/04/2020

Understanding Architectures Learnt by Cell-based Neural Architecture Search

Yao Shu, Wei Wang, Shaofeng Cai

Keywords Paper

Neural Architecture Search, connection pattern, optimization, convergence, Lipschitz smoothness, gradient variance, generalization

0

0

0

0

4:21

06/12/2021

GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training

Chen Zhu, Renkun Ni, Zheng Xu and
Kezhi Kong, W. Ronny Huang, Tom Goldstein

Keywords Paper

deep learning, transformers, vision

0

0

0

0

13:17

08/12/2020

TransQuest: Translation Quality Estimation with Cross-lingual Transformers

Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Keywords Paper

0

0

0

0

14:27

06/12/2021

How Powerful are Performance Predictors in Neural Architecture Search?

Colin White, Arber Zela, Robin Ru and
Yang Liu, Frank Hutter

Keywords Paper

deep learning, optimization

0

0

0

0

15:02

14/06/2020

Block-Wisely Supervised Neural Architecture Search With Knowledge Distillation

Changlin Li, Jiefeng Peng, Liuchun Yuan and
Guangrun Wang, Xiaodan Liang, Liang Lin, Xiaojun Chang

Keywords Paper

neural architecture search, knowledge distillation, automated machine learning, weight sharing nas, one-shot nas, auto-ml, efficient model, image classification

0

0

0

0

1:00

02/02/2021

Large Batch Optimization for Deep Learning Using New Complete Layer-Wise Adaptive Rate Scaling

Zhouyuan Huo, Bin Gu, Heng Huang

Keywords Paper

0

0

0

0

15:17

02/02/2021

Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks

Xiangyu Chang, Yingcong Li, Samet Oymak, Christos Thrampoulidis

Keywords Paper

0

0

0

0

18:14

12/07/2020

Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data

Felipe Petroski Such, Aditya Rawal, Joel Lehman and
Kenneth Stanley, Jeffrey Clune

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

7:25

03/05/2021

Robust Pruning at Initialization

Soufiane Hayou, Jean-Francois Ton, Arnaud Doucet, Yee Whye Teh

Keywords Paper

Pruning, Compression, Initialization

0

0

0

0

4:07

06/12/2020

Adapting Neural Architectures Between Domains

Yanxi Li, Zhaohui Yang, Yunhe Wang, Chang Xu

Keywords Paper

0

0

0

0

3:20

18/07/2021

Learning Neural Network Subspaces

Mitchell Wortsman, Maxwell Horton, Carlos Guestrin and
Ali Farhadi, Mohammad Rastegari

Keywords Paper

Deep Learning, Applications, Dialog- or Communication-Based Learning, Algorithms, Representation Learning

0

0

0

0

5:07

26/04/2020

NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search

Xuanyi Dong, Yi Yang

Keywords Paper

Neural Architecture Search, AutoML, Benchmark

0

0

0

0

4:56

02/02/2021

NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search

Rameswar Panda, Michele Merler, Mayoore S Jaiswal and
Hui Wu, Kandan Ramakrishnan, Ulrich Finkler, Chun-Fu Richard Chen, Minsik Cho, Rogerio Feris, David Kung, Bishwaranjan Bhattacharjee

Keywords Paper

0

0

0

0

13:54

03/05/2021

NAS-Bench-ASR: Reproducible Neural Architecture Search for Speech Recognition

Abhinav Mehrotra, Alberto Gil Couto Pimentel Ramos, Sourav Bhattacharya and
Łukasz Dudziak, Ravichander Vipperla, Thomas C Chau, Mohamed Abdelfattah, Samin Ishtiaq, Nic Lane

Keywords Paper

Benchmark, NAS, ASR

0

0

0

0

4:50

03/05/2021

HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

Chaojian Li, Zhongzhi Yu, Yonggan Fu and
Yongan Zhang, Yang Zhao, Haoran You, Qixuan Yu, Yue Wang, Cong Hao, Yingyan Lin

Keywords Paper

AutoML, Benchmark, Hardware-Aware Neural Architecture Search

0

0

0

0

11:02

19/04/2021

Bootstrapping relation extractors using syntactic search by examples

Matan Eyal, Asaf Amrami, Hillel Taub-Tabib, Yoav Goldberg

Keywords Paper

0

0

0

0

9:55

06/12/2020

Optimizing Neural Networks via Koopman Operator Theory

Akshunna S. Dogra, Will Redman

Keywords Paper

0

0

0

0

3:12

08/12/2020

Data-Efficient Paraphrase Generation to Bootstrap Intent Classification and Slot Labeling for New Features in Task-Oriented Dialog Systems

Shailza Jolly, Tobias Falke, Caglar Tirkaz, Daniil Sorokin

Keywords Paper

0

0

0

0

11:38

02/02/2021

Meta-Transfer Learning for Low-Resource Abstractive Summarization

Yi-Syuan Chen, Hong-Han Shuai

Keywords Paper

0

0

0

0

19:10

06/12/2020

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Houwen Peng, Hao Du, Hongyuan Yu and
QI LI, Jing Liao, Jianlong Fu

Keywords Paper

0

0

0

0

3:12

26/04/2020

Effect of Activation Functions on the Training of Overparametrized Neural Nets

Abhishek Panigrahi, Abhishek Shetty, Navin Goyal

Keywords Paper

activation functions, deep learning theory, neural networks

0

0

0

0

5:13

18/07/2021

Neural Architecture Search without Training

Joe Mellor, Jack Turner, Amos Storkey, Elliot Crowley

Keywords Paper

Deep Learning, Architectures

0

0

0

1

20:37

06/12/2021

Structured Reordering for Modeling Latent Alignments in Sequence Transduction

bailin wang, Mirella Lapata, Ivan Titov

Keywords Paper

language

0

0

0

0

15:00

14/07/2020

On the limits of parallelizing convolutional neural networks on GPUs

Behnam Pourghassemi, Chenghao Zhang, Joo Hwan Lee, Aparna Chandramowlishwaran

Keywords Paper

GPU, non-linear networks, convolutional neural networks (CNNs), resource utilization, parallelization

0

0

0

0

7:42

19/08/2021

Hardware-Aware Neural Architecture Search: Survey and Taxonomy

Hadjer Benmeziane, Kaoutar El Maghraoui, Hamza Ouarnoughi and
Smail Niar, Martin Wistuba, Naigang Wang

Keywords Paper

Machine learning, General, General, General

0

0

0

0

14:12