A New Theoretical Framework for Fast and Accurate Online Decision-Making

Abstract: We introduce a novel theoretical framework for Return On Investment (ROI) maximization in repeated decision-making. Our setting is motivated by the use case of companies that regularly receive proposals for technological innovations and want to quickly decide whether they are worth implementing. We design an algorithm for learning ROI-maximizing decision-making policies over a sequence of innovation proposals. Our algorithm provably converges to an optimal policy in class $\Pi$ at a rate of order $\min\big\{1/(N\Delta^2),N^{-1/3}\}$, where $N$ is the number of innovations and $\Delta$ is the suboptimality gap in $\Pi$. A significant hurdle of our formulation, which sets it aside from other online learning problems such as bandits, is that running a policy does not provide an unbiased estimate of its performance.

06/12/2020

Heuristic Search and Game Playing, Combinatorial Search and Optimisation, Heuristic Search, Meta-Reasoning and Meta-Heuristics

13:54

03/05/2021

A New Theoretical Framework for Fast and Accurate Online Decision-Making

Nicolò Cesa-Bianchi, Tom Cesari, Yishay Mansour, Vianney Perchet

Comments

Similar Papers

Learning Augmented Energy Minimization via Speed Scaling

Etienne Bamas, Andreas Maggiori, Lars Rohwedder, Ola Svensson

Keywords Abstract Paper

Faster Matchings via Learned Duals

Michael Dinitz, Sungjin Im, Thomas Lavastida and Benjamin Moseley, Sergei Vassilvitskii

Keywords Abstract Paper

theory, optimization

Adaptivity in Adaptive Submodularity

Hossein Esfandiari, Amin Karbasi, Vahab Mirrokni

Keywords Abstract Paper

Hardware-Aware Neural Architecture Search: Survey and Taxonomy

Hadjer Benmeziane, Kaoutar El Maghraoui, Hamza Ouarnoughi and Smail Niar, Martin Wistuba, Naigang Wang

Keywords Abstract Paper

Machine learning, General, General, General

Generating and Exploiting Cost Predictions in Heuristic State-Space Planning

Francesco Percassi, Alfonso E. Gerevini, Enrico Scala and Ivan Serina, Mauro Vallati

Keywords Abstract Paper

Predicting Plan's Cost, Learning for Domain-Independent Planning, Improving Best-First Search Schema

Choosing the Right Algorithm With Hints From Complexity Theory

Shouda Wang, Weijie Zheng, Benjamin Doerr

Keywords Abstract Paper

Heuristic Search and Game Playing, Combinatorial Search and Optimisation, Heuristic Search, Meta-Reasoning and Meta-Heuristics

Fidelity-based Deep Adiabatic Scheduling

Eli Ovits, Lior Wolf

Keywords Abstract Paper

Provably Efficient Reinforcement Learning with Linear Function Approximation

Chi Jin, Zhuoran Yang, Zhaoran Wang, Michael Jordan

Keywords Abstract Paper

Reinforcement learning,

High Dimensional Level Set Estimation with Bayesian Neural Network

Huong Ha, Sunil Gupta, Santu Rana, Svetha Venkatesh

Keywords Abstract Paper

Generalization in Portfolio-Based Algorithm Selection

Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

Keywords Abstract Paper

Opening the Blackbox: Accelerating Neural Differential Equations by Regularizing Internal Solver Heuristics

Avik Pal, Yingbo Ma, Viral Shah, Christopher Rackauckas

Keywords Abstract Paper

Deep Learning

Pruning neural networks without any data by iteratively conserving synaptic flow

Hidenori Tanaka, Daniel Kunin, Daniel Yamins, Surya Ganguli

Keywords Abstract Paper

Deep Learning -> Optimization for Deep Networks; Optimization -> Non-Convex Optimization, Theory

Learning to Plan in High Dimensions via Neural Exploration-Exploitation Trees

Binghong Chen, Bo Dai, Qinjie Lin and Guo Ye, Han Liu, Le Song

Keywords Abstract Paper

learning to plan, representation learning, learning to design algorithm, reinforcement learning, meta learning

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Quentin Cappart, Thierry Moisan, Louis-Martin Rousseau and Isabeau Prémont-Schwarz, Andre A. Cire

Keywords Abstract Paper

Autonomous predictive modeling via reinforcement learning

Udayan Khurana, Horst Samulowitz

Keywords Abstract Paper

reinforcement learning, data science automation, automated machine learning

Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Jie Xu, Yunsheng Tian, Pingchuan Ma and Daniela Rus, Shinjiro Sueda, Wojciech Matusik

Keywords Abstract Paper

Reinforcement Learning - Deep RL

Minimizing Polarization and Disagreement in Social Networks via Link Recommendation

Liwang Zhu, Qi Bao, Zhongzhi Zhang

Keywords Abstract Paper

optimization, graph learning

A Novel Method to Solve Neural Knapsack Problems

Duanshun Li, Jing Liu, Dongeun Lee and Ali S. Mazloom, Giridhar Kaushik , Kookjin Lee, Noseong Park

Keywords Abstract Paper

Deep Learning

Learning Augmented Methods for Matching: Improving Invasive Species Management and Urban Mobility

Johan Bjorck, Qinru Shi, Carrie Brown-Lima and Jennifer Dean, Angela Fuller, Carla Gomes

Keywords Abstract Paper

Evolutionary product description generation: A dynamic fine-tuning approach leveraging user click behavior

Yongzhen Wang, Jian Wang, Heng Huang and Hongsong Li, Xiaozhong Liu

Keywords Abstract Paper

product description generation, neural network, sequence-to-sequence, click-through rate, reinforcement learning

How to 0wn the NAS in Your Spare Time

Sanghyun Hong, Michael Davinroy, Yiǧitcan Kaya and Dana Dachman-Soled, Tudor Dumitraş

Keywords Abstract Paper

Keywords Paper

Michael Dinitz, Sungjin Im, Thomas Lavastida and
Benjamin Moseley, Sergei Vassilvitskii

Keywords Paper

Keywords Paper

Hadjer Benmeziane, Kaoutar El Maghraoui, Hamza Ouarnoughi and
Smail Niar, Martin Wistuba, Naigang Wang

Keywords Paper

Francesco Percassi, Alfonso E. Gerevini, Enrico Scala and
Ivan Serina, Mauro Vallati

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Binghong Chen, Bo Dai, Qinjie Lin and
Guo Ye, Han Liu, Le Song

Keywords Paper

Quentin Cappart, Thierry Moisan, Louis-Martin Rousseau and
Isabeau Prémont-Schwarz, Andre A. Cire

Keywords Paper

Keywords Paper

Jie Xu, Yunsheng Tian, Pingchuan Ma and
Daniela Rus, Shinjiro Sueda, Wojciech Matusik

Keywords Paper

Keywords Paper

Duanshun Li, Jing Liu, Dongeun Lee and
Ali S. Mazloom, Giridhar Kaushik , Kookjin Lee, Noseong Park

Keywords Paper

Johan Bjorck, Qinru Shi, Carrie Brown-Lima and
Jennifer Dean, Angela Fuller, Carla Gomes

Keywords Paper

Yongzhen Wang, Jian Wang, Heng Huang and
Hongsong Li, Xiaozhong Liu

Keywords Paper

Sanghyun Hong, Michael Davinroy, Yiǧitcan Kaya and
Dana Dachman-Soled, Tudor Dumitraş

Keywords Paper

Keywords Paper

Yang Zhang, Bo Tang, Qingyu Yang and
Dou An, Hongyin Tang, Chenyang Xi, Xueying LI, Feiyu Xiong

Keywords Paper

Taoran Ji, Nathan Self, Kaiqun Fu and
Zhiqian Chen, Naren Ramakrishnan, Chang-Tien Lu

Keywords Paper

Vitchyr Pong, Murtaza Dalal, Steven Lin and
Ashvin Nair, Shikhar Bahl, Sergey Levine

Keywords Paper

Keywords Paper

Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka and
Amar Phanishayee, Matei Zaharia

Keywords Paper

Sahil Manchanda, Akash MITTAL, Anuj Dhawan and
Sourav Medya, Sayan Ranu, Ambuj K Singh

Keywords Paper

Keywords Paper

Zhuoran Yang, Chi Jin, Zhaoran Wang and
Mengdi Wang, Michael Jordan

Keywords Paper

Keywords Paper

Keywords Paper

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper