Accounting for Variance in Machine Learning Benchmarks

05/04/2021

Accounting for Variance in Machine Learning Benchmarks

Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi, Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Nazanin Mohammadi Sepahvand, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Tal Arbel, Chris Pal, Gael Varoquaux, Pascal Vincent

Keywords:

Abstract Paper Similar Papers

Abstract: Strong empirical evidence that one machine-learning algorithm A outperforms another one B, ideally calls for multiple trials optimizing the learning pipeline over sources of variation such as data sampling, augmentation, parameter initialization, and hyperparameters choices. This is prohibitively expensive, and corners are cut to reach conclusions. We model the whole benchmarking process and all sources of variation, revealing that variance due to data sampling, parameter initialization and hyperparameter choice impact markedly machine learning benchmark. We analyze the predominant comparison methods used today in the light of this variance. We show a counter-intuitive result that a biased estimator with more source of variation will give better results, closer to the ideal estimator at a 51× reduction in compute cost. Using this we perform a detailed study on the error rate of detecting improvements, on five different deep-learning tasks/architectures. This study leads us to propose recommendations for future performance comparisons.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38952709

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at MLSYS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

05/04/2021

Accounting for Variance in Machine Learning Benchmarks

Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi and
Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Nazanin Mohammadi Sepahvand, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Tal Arbel, Chris Pal, Gael Varoquaux, Pascal Vincent

Keywords Paper

0

0

0

0

19:40

26/04/2020

Selection via Proxy: Efficient Data Selection for Deep Learning

Cody Coleman, Christopher Yeh, Stephen Mussmann and
Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

Keywords Paper

data selection, active-learning, core-set selection, deep learning, uncertainty sampling

0

0

0

0

4:46

18/07/2021

Understanding and Mitigating Accuracy Disparity in Regression

Jianfeng Chi, Yuan Tian, Geoff Gordon, Han Zhao

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

5:17

26/08/2020

Learning Hierarchical Interactions at Scale: A Convex Optimization Approach

Hussein Hazimeh, Rahul Mazumder

Keywords Paper

0

0

0

0

15:07

05/01/2021

SALAD: Self-Assessment Learning for Action Detection

Guillaume Vaudaux-Ruth, Adrien Chan-Hon-Tong, Catherine Achard

Keywords Paper

0

0

0

0

4:31

03/05/2021

Evaluation of Neural Architectures Trained With Square Loss vs Cross-Entropy in Classification Tasks

Like Hui, Misha Belkin

Keywords Paper

classification, experimental evaluation, square loss vs cross-entropy, large scale learning

0

0

0

0

5:09

18/07/2021

Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers

Robin M Schmidt, Frank Schneider, Philipp Hennig

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

5:19

13/04/2021

Gradient descent in RKHS with importance labeling

Tomoya Murata, Taiji Suzuki

Keywords Paper

0

0

0

0

3:04

06/12/2021

Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination

Dylan J Foster, Akshay Krishnamurthy

Keywords Paper

theory, reinforcement learning and planning, bandits, online learning

0

0

0

0

19:34

18/07/2021

Sequential Domain Adaptation by Synthesizing Distributionally Robust Experts

Bahar Taskesen, Man Chung Yue, Jose Blanchet and
Daniel Kuhn, Viet Anh Nguyen

Keywords Paper

Optimization, Convex Optimization, Theory, Regularization

0

0

0

0

17:53

03/05/2021

Theoretical bounds on estimation error for meta-learning

James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI and
Toniann Pitassi, Richard Zemel

Keywords Paper

meta learning, minimax risk, few-shot, lower bounds, learning theory

0

0

0

0

4:46

06/12/2020

AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularity

Silviu-Marian Udrescu, Andrew Tan, Jiahai Feng and
Orisvaldo Neto, Tailin Wu, Max Tegmark

Keywords Paper

0

0

0

0

3:13

12/07/2020

Why Are Learned Indexes So Effective?

Paolo Ferragina, Fabrizio Lillo, Giorgio Vinciguerra

Keywords Paper

Applications - Other

0

0

0

0

13:22

02/02/2021

Unsupervised Active Learning via Subspace Learning

Changsheng Li, Kaihang Mao, Lingyan Liang and
Dongchun Ren, Wei Zhang, Ye Yuan, Guoren Wang

Keywords Paper

0

0

0

0

16:45

18/07/2021

Robust Unsupervised Learning via L-statistic Minimization

Andreas Maurer, Daniela Angela Parletta, Andrea Paudice, Massimiliano Pontil

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:03

26/04/2020

Input Complexity and Out-of-distribution Detection with Likelihood-based Generative Models

Joan Serrà, David Álvarez, Vicenç Gómez and
Olga Slizovskaia, José F. Núñez, Jordi Luque

Keywords Paper

OOD, generative models, likelihood

0

0

0

0

5:26

06/12/2020

Theoretical Insights Into Multiclass Classification: A High-dimensional Asymptotic View

Christos Thrampoulidis, oymak Oymak, Mahdi Soltanolkotabi

Keywords Paper

0

0

0

0

4:25

06/12/2021

Deep Reinforcement Learning at the Edge of the Statistical Precipice

Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro and
Aaron Courville, Marc Bellemare

Keywords Paper

reinforcement learning and planning

0

0

0

0

19:36

06/07/2020

Towards multi-sequence MR image recovery from undersampled k-space data

Cheng Peng, Wei-An Lin, Rama Chellappa, S. Kevin Zhou

Keywords Paper

0

0

0

0

5:00

13/04/2021

Good classifiers are abundant in the interpolating regime

Ryan Theisen, Jason Klusowski, Michael Mahoney

Keywords Paper

0

0

0

0

2:59

04/07/2020

The Right Tool for the Job: Matching Model and Instance Complexities

Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta and
Jesse Dodge, Noah A. Smith

Keywords Paper

inference, early decisions, costly retraining, Job Model

0

0

0

0

11:27

06/12/2021

Few-Shot Data-Driven Algorithms for Low Rank Approximation

Piotr Indyk, Tal Wagner, David Woodruff

Keywords Paper

optimization

0

0

0

0

14:50

03/08/2020

Semi-bandit Optimization in the Dispersed Setting

Travis Dick, Wesley Pegden, Maria-Florina Balcan

Keywords Paper

0

0

0

0

8:04

06/12/2021

Boost Neural Networks by Checkpoints

Feng Wang, Guoyizhe Wei, Qiao Liu and
Jinxiang Ou, xian wei, Hairong Lv

Keywords Paper

deep learning

1

0

0

0

4:45

06/12/2021

On the interplay between data structure and loss function in classification problems

Stéphane d'Ascoli, Marylou Gabrié, Levent Sagun, Giulio Biroli

Keywords Paper

deep learning, machine learning

0

0

0

0

8:59

05/01/2021

Class-Wise Metric Scaling for Improved Few-Shot Classification

Ge Liu, Linglan Zhao, Wei Li and
Dashan Guo, Xiangzhong Fang

Keywords Paper

0

0

0

0

5:01

02/02/2021

Self-correcting Q-learning

Rong Zhu, Mattia Rigotti

Keywords Paper

0

0

0

0

15:22

26/08/2020

Constructing a provably adversarially-robust classifier from a high accuracy one

Grzegorz Gluch, Rüdiger Urbanke

Keywords Paper

0

0

0

0

13:10

06/12/2020

Diversity-Guided Multi-Objective Bayesian Optimization With Batch Evaluations

Mina Konakovic Lukovic, Yunsheng Tian, Wojciech Matusik

Keywords Paper

0

0

0

0

3:22

26/08/2020

Ordered SGD: A New Stochastic Optimization Framework for Empirical Risk Minimization

Kenji Kawaguchi, Haihao Lu

Keywords Paper

0

0

0

0

14:10

03/05/2021

Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy

Akinori Ebihara, Taiki Miyagawa, Kazuyuki Sakurai, Hitoshi Imaoka

Keywords Paper

Density ratio estimation, Early classification, Sequential probability ratio test

0

0

0

0

9:55

23/08/2020

Learning based distributed tracking

Hao WU, Junhao Gan, Rui Zhang

Keywords Paper

distributed tracking, sampling, algorithms, machine learning

0

0

0

0

15:07

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49

06/12/2020

Effective Dimension Adaptive Sketching Methods for Faster Regularized Least-Squares Optimization

Jonathan Lacotte, Mert Pilanci

Keywords Paper

0

0

0

0

3:17

13/04/2021

Faster & more reliable tuning of neural networks: Bayesian optimization with importance sampling

Setareh Ariafar, Zelda Mariet, Dana Brooks and
Jennifer Dy, Jasper Snoek

Keywords Paper

0

0

0

0

3:01

06/12/2021

RETRIEVE: Coreset Selection for Efficient and Robust Semi-Supervised Learning

Krishnateja Killamsetty, Xujiang Zhao, Feng Chen, Rishabh Iyer

Keywords Paper

optimization, semi-supervised learning

0

0

0

0

13:59

14/06/2020

HyperSTAR: Task-Aware Hyperparameters for Deep Networks

Gaurav Mittal, Chang Liu, Nikolaos Karianakis and
Victor Fragoso, Mei Chen, Yun Fu

Keywords Paper

auto ml, hyperparameter optimization, meta learning, task aware, hyperband, hyperparameters, warm start, image classication, resnet, shufflenet

0

0

0

0

4:58

06/12/2021

Conformal Prediction using Conditional Histograms

Matteo Sesia, Yaniv Romano

Keywords Paper

machine learning

0

0

0

0

10:48

18/07/2021

Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Yangjun Ruan, Karen Ullrich, Daniel Severo and
James Townsend, Ashish Khisti, Arnaud Doucet, Alireza Makhzani, Chris Maddison

Keywords Paper

Probabilistic Methods

0

0

0

0

17:00

06/12/2020

Adaptive Discretization for Model-Based Reinforcement Learning

Sean Sinclair, Tianyu Wang, Gauri Jain and
Sid Banerjee, Christina Yu

Keywords Paper

0

0

0

0

3:12