Understanding and Robustifying Differentiable Architecture Search

26/04/2020

Understanding and Robustifying Differentiable Architecture Search

Arber Zela, Thomas Elsken, Tonmoy Saikia, Yassine Marrakchi, Thomas Brox, Frank Hutter

Keywords: Neural Architecture Search, AutoML, AutoDL, Deep Learning, Computer Vision

Abstract Paper Code Similar Papers

Abstract: Differentiable Architecture Search (DARTS) has attracted a lot of attention due to its simplicity and small search costs achieved by a continuous relaxation and an approximation of the resulting bi-level optimization problem. However, DARTS does not work robustly for new problems: we identify a wide range of search spaces for which DARTS yields degenerate architectures with very poor test performance. We study this failure mode and show that, while DARTS successfully minimizes validation loss, the found solutions generalize poorly when they coincide with high validation loss curvature in the architecture space. We show that by adding one of various types of regularization we can robustify DARTS to find solutions with less curvature and better generalization properties. Based on these observations, we propose several simple variations of DARTS that perform substantially more robustly in practice. Our observations are robust across five search spaces on three image classification tasks and also hold for the very different domains of disparity estimation (a dense regression task) and language modelling.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Stabilizing Differentiable Architecture Search via Perturbation-based Regularization

Xiangning Chen, Cho-Jui Hsieh

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:13

22/11/2021

Variance-stationary Differentiable NAS

Hyeokjun Choe, Byunggook Na, Jisoo Mok, Sungroh Yoon

Keywords Paper

darts, differentiable nas, one-shot nas, neural architecture search, nas, architecture parameter, automl, vs-darts, vsdarts, variance-stationary

0

0

0

0

3:08

18/07/2021

iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

Miao Zhang, Steven Su, Shirui Pan and
Xiaojun Chang, Mohammad Abbasnejad, Reza Haffari

Keywords Paper

Algorithms, AutoML

0

0

0

0

5:19

03/05/2021

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

Xiangxiang Chu, Victor Wang, Bo Zhang and
Shun Lu, Xiaolin Wei, Junchi Yan

Keywords Paper

neural architecture search, DARTS stability

0

0

0

0

5:01

06/12/2020

Theory-Inspired Path-Regularized Differential Network Architecture Search

Pan Zhou, Caiming Xiong, Richard Socher, Steven Hoi

Keywords Paper

0

0

0

0

3:18

03/05/2021

Geometry-Aware Gradient Algorithms for Neural Architecture Search

Liam Li, Misha Khodak, Nina Balcan, Ameet Talwalkar

Keywords Paper

weight-sharing, neural architecture search, optimization, automated machine learning

0

0

0

0

12:16

22/11/2021

Noisy Differentiable Architecture Search

Xiangxiang Chu, Bo Zhang

Keywords Paper

Neural architecture search, AutoML

0

0

0

0

2:30

26/04/2020

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Yuhui Xu, Lingxi Xie, Xiaopeng Zhang and
Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong

Keywords Paper

Neural Architecture Search, DARTS, Regularization, Normalization

0

0

0

0

4:40

14/06/2020

Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing

Vedika Agarwal, Rakshith Shetty, Mario Fritz

Keywords Paper

robustness, vqa, causality, gan, dataset, evaluation, automated semantic scene editing, data augmentation, invariance, covariance

0

0

0

0

1:00

14/06/2020

Quasi-Newton Solver for Robust Non-Rigid Registration

Yuxin Yao, Bailin Deng, Weiwei Xu, Juyong Zhang

Keywords Paper

non-rigid registration, robust estimator, quasi-newton, welsch's function, mm algorithm, l-bfgs, deformation graph.

0

0

0

0

4:56

22/11/2021

DU-DARTS: Decreasing the Uncertainty of Differentiable Architecture Search

Shun Lu, Yu Hu, Longxing Yang and
Zihao Sun, Jilin Mei, Yiming Zeng, Xiaowei Li

Keywords Paper

neural architecture search, differentiable NAS, decrease uncertainty

0

0

0

0

3:01

26/04/2020

Double Neural Counterfactual Regret Minimization

Hui Li, Kailiang Hu, Shaohua Zhang and
Yuan Qi, Le Song

Keywords Paper

Counterfactual Regret Minimization, Imperfect Information game, Neural Strategy, Deep Learning, Robust Sampling

0

0

0

0

4:49

14/06/2020

A Graduated Filter Method for Large Scale Robust Estimation

Huu Le, Christopher Zach

Keywords Paper

robust fitting, bundle adjustment, non-convex, poor local minima, non-linear least squares, graduated non-convexity.

0

0

0

0

1:01

26/04/2020

NAS evaluation is frustratingly hard

Antoine Yang, Pedro M. Esperança, Fabio M. Carlucci

Keywords Paper

neural architecture search, nas, benchmark, reproducibility, harking

0

0

0

0

4:56

26/04/2020

Short and Sparse Deconvolution --- A Geometric Approach

Yenson Lau, Qing Qu, Han-Wen Kuo and
Pengcheng Zhou, Yuqian Zhang, John Wright

Keywords Paper

0

0

0

0

7:18

26/08/2020

Robust Optimisation Monte Carlo

Borislav Ikonomov, Michael U. Gutmann

Keywords Paper

0

0

0

0

14:13

03/05/2021

Rethinking Architecture Selection in Differentiable NAS

Ruochen Wang, Minhao Cheng, Xiangning Chen and
Xiaocheng Tang, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

17:22

14/06/2020

Rethinking Differentiable Search for Mixed-Precision Neural Networks

Zhaowei Cai, Nuno Vasconcelos

Keywords Paper

mixed-precision network, bit allocation, differentiable, architecture search

0

0

0

0

1:01

04/08/2021

Group testing and local search: is there a computational-statistical gap?

Fotis Iliopoulos, Ilias Zadik

Keywords Paper

0

0

0

0

17:50

22/11/2021

Neighborhood-Aware Neural Architecture Search

Xiaofang Wang, Shengcao Cao, Mengtian Li, Kris Kitani

Keywords Paper

Neural Architecture Search, Generalization, Flat Minima

0

0

0

0

2:45

19/08/2021

CIMON: Towards High-quality Hash Codes

Xiao Luo, Daqing Wu, Zeyu Ma and
Chong Chen, Minghua Deng, Jinwen Ma, Zhongming Jin, Jianqiang Huang, Xian-Sheng Hua

Keywords Paper

Computer Vision, Recognition, Information Retrieval

0

0

0

0

14:20

02/02/2021

Learning Precise Temporal Point Event Detection with Misaligned Labels

Julien Schroeter, Kirill Sidorov, David Marshall

Keywords Paper

0

0

0

0

21:24

02/02/2021

EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation

Qi Zhou, Haipeng Chen, Yitao Zheng, Zhen Wang

Keywords Paper

0

0

0

0

19:28

03/05/2021

DrNAS: Dirichlet Neural Architecture Search

Xiangning Chen, Ruochen Wang, Minhao Cheng and
Xiaocheng Tang, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

5:00

06/12/2021

Smooth Bilevel Programming for Sparse Regularization

Clarice Poon, Gabriel Peyré

Keywords Paper

machine learning

0

0

0

0

13:06

13/04/2021

Scalable gaussian process variational autoencoders

Metod Jazbec, Matt Ashman, Vincent Fortuin and
Michael Pearce, Stephan Mandt, Gunnar Rätsch

Keywords Paper

0

0

0

0

3:10

06/12/2020

Tight First- and Second-Order Regret Bounds for Adversarial Linear Bandits

Shinji Ito, Shuichi Hirahara, Tasuku Soma, Yuichi Yoshida

Keywords Paper

0

0

0

0

3:24

14/06/2020

Transferring and Regularizing Prediction for Semantic Segmentation

Yiheng Zhang, Zhaofan Qiu, Ting Yao and
Chong-Wah Ngo, Dong Liu, Tao Mei

Keywords Paper

semantic segmentation, domain adaptation, adversarial learning

0

0

0

0

0:58

14/06/2020

Equalization Loss for Long-Tailed Object Recognition

Jingru Tan, Changbao Wang, Buyu Li and
Quanquan Li, Wanli Ouyang, Changqing Yin, Junjie Yan

Keywords Paper

long tail, object detection, lvis, object recognition

0

0

0

0

1:00

20/08/2020

The Simple Essence of Algebraic Subtyping: Principal Type Inference with Subtyping Made Easy (Functional Pearl)

Lionel Parreaux

Keywords Paper

subtyping, principal types, type inference

0

0

0

0

14:39

06/12/2021

Closing the Gap: Tighter Analysis of Alternating Stochastic Gradient Methods for Bilevel Problems

Tianyi Chen, Yuejiao Sun, Wotao Yin

Keywords Paper

theory, optimization, reinforcement learning and planning, machine learning

0

0

0

0

7:27

03/05/2021

Contemplating Real-World Object Classification

Ali Borji

Keywords Paper

Robustness, object recognition, deep learning, ObjectNet

0

0

0

0

5:12

02/02/2021

Rethinking Bi-Level Optimization in Neural Architecture Search: A Gibbs Sampling Perspective

Chao Xue, Xiaoxing Wang, Junchi Yan and
Yonggang Hu, Xiaokang Yang, Kewei Sun

Keywords Paper

0

0

0

0

18:56

19/08/2021

Accelerating Neural Architecture Search via Proxy Data

Byunggook Na, Jisoo Mok, Hyeokjun Choe, Sungroh Yoon

Keywords Paper

Machine Learning, Deep Learning, Classification

0

0

0

0

11:25

03/05/2021

Deciphering and Optimizing Multi-Task Learning: a Random Matrix Approach

Malik Tiomoko, Hafiz Tiomoko Ali, Romain Couillet

Keywords Paper

Transfer Learning, Random Matrix Theory, Multi Task Learning

0

0

0

0

11:15

06/12/2021

Iteratively Reweighted Least Squares for Basis Pursuit with Global Linear Convergence Rate

Christian Kümmerle, Claudio Mayrink Verdun, Dominik Stöger

Keywords Paper

theory, optimization, machine learning

0

0

0

0

14:17

02/02/2021

Frequency Consistent Adaptation for Real World Super Resolution

Xiaozhong Ji, Guangpin Tao, Yun Cao and
Ying Tai, Tong Lu, Chengjie Wang, Jilin Li, Feiyue Huang

Keywords Paper

0

0

0

0

14:32

26/08/2020

Linearly Convergent Frank-Wolfe without Line-Search

Fabian Pedregosa, Geoffrey Negiar, Armin Askari, Martin Jaggi

Keywords Paper

0

0

0

0

10:14

02/02/2021

On Generating Plausible Counterfactual and Semi-Factual Explanations for Deep Learning

Eoin M. Kenny, Mark T Keane

Keywords Paper

0

0

0

0

17:38

13/04/2021

Principal component regression with semirandom observations via matrix completion

Aditya Bhaskara, Aravinda Kanchana Ruwanpathirana, Maheshakya Wijewardena

Keywords Paper

0

0

0

0

2:48