Model Selection for Production System via Automated Online Experiments

06/12/2020

Model Selection for Production System via Automated Online Experiments

Zhenwen Dai, Praveen Chandar, Ghazal Fazelnia, Benjamin Carterette, Mounia Lalmas

Keywords: Algorithms -> Relational Learning; Algorithms -> Representation Learning, Deep Learning

Abstract Paper Similar Papers

Abstract: A challenge that machine learning practitioners in the industry face is the task of selecting the best model to deploy in production. As a model is often an intermediate component of a production system, online controlled experiments such as A/B tests yield the most reliable estimation of the effectiveness of the whole system, but can only compare two or a few models due to budget constraints. We propose an automated online experimentation mechanism that can efficiently perform model selection from a large pool of models with a small number of online experiments. We derive the probability distribution of the metric of interest that contains the model uncertainty from our Bayesian surrogate model trained using historical logs. Our method efficiently identifies the best model by sequentially selecting and deploying a list of models from the candidate set that balance exploration-exploitation. Using simulations based on real data, we demonstrate the effectiveness of our method on two different tasks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/09/2020

Model Bridging: Connection between Simulation Model and Neural Network

Keiichi Kisamori, Keisuke Yamazaki, Yuto Komori, Hiroshi Tokieda

Keywords Paper

interpretability, simulation model, kernel mean embedding, data assimilation

0

0

0

0

14:20

19/10/2020

Autonomous predictive modeling via reinforcement learning

Udayan Khurana, Horst Samulowitz

Keywords Paper

reinforcement learning, data science automation, automated machine learning

0

0

0

0

4:21

06/12/2021

An Information-theoretic Approach to Distribution Shifts

Marco Federici, Ryota Tomioka, Patrick Forré

Keywords Paper

theory, deep learning, machine learning, graph learning, domain adaptation, representation learning

0

0

0

0

9:50

18/07/2021

Meta-learning Hyperparameter Performance Prediction with Neural Processes

Ying WEI, Peilin Zhao, Junzhou Huang

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

5:07

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

26/10/2020

Solving the Test Laboratory Scheduling Problem with Variable Task Grouping

Philipp Danzinger, Tobias Geibinger, Florian Mischek, Nysret Musliu

Keywords Paper

Project scheduling, Constraint programming, Large neighborhood search, Real world

0

0

0

0

10:52

06/12/2020

Online Bayesian Goal Inference for Boundedly Rational Planning Agents

Tan Zhi-Xuan, Jordyn Mann, Tom Silver and
Josh Tenenbaum, Vikash Mansinghka

Keywords Paper

0

0

0

0

3:23

16/11/2020

Model-Based Inverse Reinforcement Learning from Visual Demonstrations

Neha Das, Sarah Bechtle, Todor Davchev and
Dinesh Jayaraman, Akshara Rai, Franziska Meier

Keywords Paper

0

0

0

0

5:03

12/07/2020

Learning to Rank Learning Curves

Martin Wistuba, Tejaswini Pedapati

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:34

06/12/2021

Near-Optimal Multi-Perturbation Experimental Design for Causal Structure Learning

Scott Sussex, Caroline Uhler, Andreas Krause

Keywords Paper

causality

0

0

0

0

14:14

06/12/2020

Learning Augmented Energy Minimization via Speed Scaling

Etienne Bamas, Andreas Maggiori, Lars Rohwedder, Ola Svensson

Keywords Paper

0

0

0

0

3:05

03/08/2020

Semi-bandit Optimization in the Dispersed Setting

Travis Dick, Wesley Pegden, Maria-Florina Balcan

Keywords Paper

0

0

0

0

8:04

06/12/2021

Robust Generalization despite Distribution Shift via Minimum Discriminating Information

Tobias Sutter, Andreas Krause, Daniel Kuhn

Keywords Paper

optimization, machine learning

0

0

0

0

15:05

06/12/2020

Influence-Augmented Online Planning for Complex Environments

Jinke He, Miguel Suau de Castro, Frans Oliehoek

Keywords Paper

Algorithms -> AutoML; Optimization -> Non-Convex Optimization; Probabilistic Methods; Probabilistic Methods -> Bayesian Theory, Probabilistic Methods -> Gaussian Processes

0

0

0

0

3:19

26/10/2020

Generating and Exploiting Cost Predictions in Heuristic State-Space Planning

Francesco Percassi, Alfonso E. Gerevini, Enrico Scala and
Ivan Serina, Mauro Vallati

Keywords Paper

Predicting Plan's Cost, Learning for Domain-Independent Planning, Improving Best-First Search Schema

0

0

0

0

9:52

12/07/2020

A distributional view on multi objective policy optimization

Abbas Abdolmaleki, Sandy Huang, Leonard Hasenclever and
Michael Neunert, Martina Zambelli, Murilo Martins, Francis Song, Nicolas Heess, Raia Hadsell, Martin Riedmiller

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:04

16/11/2020

A User’s Guide to Calibrating Robotic Simulators

Bhairav Mehta, Ankur Handa, Dieter Fox, Fabio Ramos

Keywords Paper

0

0

0

0

4:33

06/12/2021

Design of Experiments for Stochastic Contextual Linear Bandits

Andrea Zanette, Kefan Dong, Jonathan N Lee, Emma Brunskill

Keywords Paper

reinforcement learning and planning, bandits

0

0

0

0

13:58

23/08/2020

Towards automated neural interaction discovery for click-through rate prediction

Qingquan Song, Dehua Cheng, Hanning Zhou and
Jiyan Yang, Yuandong Tian, Xia Hu

Keywords Paper

neural architecture search, evolutionary algorithm, CTR prediction

0

0

0

0

18:00

06/12/2021

Efficient Online Estimation of Causal Effects by Deciding What to Observe

Shantanu Gupta, Zachary Lipton, David Childers

Keywords Paper

reinforcement learning and planning, graph learning, causality

0

0

0

0

14:18

02/02/2021

Deeplite NeutrinoTM: A BlackBox Framework for Constrained Deep Learning Model Optimization

Anush Sankaran, Olivier Mastropietro, Ehsan Saboori and
Yasser Idris, Davis Sawyer, MohammadHossein AskariHemmat, Ghouthi Boukli Hacene

Keywords Paper

0

0

0

0

18:29

06/12/2020

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

12/07/2020

Online metric algorithms with untrusted predictions

Antonios Antoniadis, Christian Coester, Marek Elias and
Adam Polak, Bertrand Simon

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

15:15

06/12/2020

Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

Yuval Emek, Ron Lavi, Rad Niazadeh, Yangguang Shi

Keywords Paper

0

0

0

0

3:10

03/05/2021

Few-Shot Bayesian Optimization with Deep Kernel Surrogates

Martin Wistuba, Josif Grabocka

Keywords Paper

automl, bayesian optimization, metalearning, few-shot learning

0

0

0

0

5:18

02/02/2021

Learning Prediction Intervals for Model Performance

Benjamin Elder, Matthew Arnold, Anupama Murthi, Jiří Navrátil

Keywords Paper

0

0

0

0

20:12

14/06/2020

RoboTHOR: An Open Simulation-to-Real Embodied AI Platform

Matt Deitke, Winson Han, Alvaro Herrasti and
Aniruddha Kembhavi, Eric Kolve, Roozbeh Mottaghi, Jordi Salvador, Dustin Schwenk, Eli VanderBilt, Matthew Wallingford, Luca Weihs, Mark Yatskar, Ali Farhadi

Keywords Paper

embodied ai, visual navigation, sim to real transfer, reinforcement learning

0

0

0

0

1:01

19/08/2021

A Comparative Survey: Benchmarking for Pool-based Active Learning

Xueying Zhan, Huan Liu, Qing Li, Antoni B. Chan

Keywords Paper

Machine learning, General

0

0

0

0

15:13

19/08/2021

Online Learning of Action Models for PDDL Planning

Leonardo Lamanna, Alessandro Saetti, Luciano Serafini and
Alfonso Gerevini, Paolo Traverso

Keywords Paper

Planning and Scheduling, Model-Based Reasoning, Planning Algorithms, Planning and Scheduling, Action, Change and Causality

0

0

0

0

13:38

25/04/2020

Understanding and Visualizing Data Iteration in Machine Learning

Fred Hohman, Kanit Wongsuphasawat, Mary Beth Kery, Kayur Patel

Keywords Paper

data iteration, evolving datasets, machine learning iteration, visual analytics, interactive interfaces

0

0

0

0

15:02

06/12/2020

Learning Causal Effects via Weighted Empirical Risk Minimization

Yonghan Jung, Jin Tian, Elias Bareinboim

Keywords Paper

0

0

0

0

3:18

06/12/2021

Efficient Training of Retrieval Models using Negative Cache

Erik Lindgren, Sashank Reddi, Ruiqi Guo, Sanjiv Kumar

Keywords Paper

deep learning, machine learning

0

0

0

0

10:41

18/11/2020

Deep-n-cheap: An automated search framework for low complexity deep learning

Sourya Dey, Saikrishna C. Kanala, Keith M. Chugg, Peter A. Beerel

Keywords Paper

0

0

0

0

11:59

26/10/2020

Exploring Context-Free Languages via Planning: The Case for Automating Machine Learning

Michael Katz, Parikshit Ram, Shirin Sohrabi, Octavian Udrea

Keywords Paper

Context-Free Grammar, HTN Planning, Classical Planning, AutoML

0

0

0

0

9:25

15/06/2020

Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training

Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko

Keywords Paper

0

0

0

0

25:03

03/05/2021

Learning Mesh-Based Simulation with Graph Networks

Tobias Pfaff, Meire Fortunato, Alvaro Sanchez Gonzalez, Peter Battaglia

Keywords Paper

mesh, graph networks, simulation, physics

0

0

0

0

10:41

06/12/2020

Probabilistic Active Meta-Learning

Jean Kaddour, Steindor Saemundsson, Marc Deisenroth

Keywords Paper

0

0

0

0

3:17

06/12/2021

Adversarial Regression with Doubly Non-negative Weighting Matrices

Tam Le, Truyen Nguyen, Makoto Yamada and
Jose Blanchet, Viet Anh Nguyen

Keywords Paper

machine learning

0

0

0

0

7:27

02/02/2021

Physarum Powered Differentiable Linear Programming Layers and Applications

Zihang Meng, Sathya N. Ravi, Vikas Singh

Keywords Paper

0

0

0

0

16:57

13/04/2021

Competing AI: How does competition feedback affect machine learning?

Tony Ginart, Eva Zhang, Yongchan Kwon, James Zou

Keywords Paper

0

0

0

0

3:10