Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting

Abstract: We design and analyze CascadeBAI, an algorithm for finding the best set of K items, also called an arm, within the framework of cascading bandits. An upper bound on the time complexity of CascadeBAI is derived by overcoming a crucial analytical challenge, namely, that of probabilistically estimating the amount of available feedback at each step. To do so, we define a new class of random variables (r.v.'s) which we term as left-sided sub-Gaussian r.v.'s; these are r.v.'s whose cumulant generating functions (CGFs) can be bounded by a quadratic only for non-positive arguments of the CGFs. This enables the application of a sufficiently tight Bernstein-type concentration inequality. We show, through the derivation of a lower bound on the time complexity, that the performance of CascadeBAI is optimal in some practical regimes. Finally, extensive numerical simulations corroborate the efficacy of CascadeBAI as well as the tightness of our upper bound on its time complexity.

06/12/2020

Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting

Zixin Zhong, Wang Chi Cheung, Vincent Tan

Comments

Similar Papers

Extrapolation Towards Imaginary 0-Nearest Neighbour and Its Improved Convergence Rate

Akifumi Okuno, Hidetoshi Shimodaira

Keywords Abstract Paper

Efficiently learning structured distributions from untrusted batches

Sitan Chen, Jerry Li, Ankur Moitra

Keywords Abstract Paper

sum-of-squares, federated learning, VC complexity, Robust statistics

Bayesian experimental design using regularized determinantal point processes

Michal Derezinski, Feynman Liang, Michael Mahoney

Keywords Abstract Paper

Randomized Algorithms for Submodular Function Maximization with a $k$-System Constraint

Shuang Cui, Kai Han, Tianshuai Zhu and Jing Tang, Benwei Wu, He Huang

Keywords Abstract Paper

Hamiltonian Monte Carlo using an adjoint-differentiated Laplace approximation: Bayesian inference for latent Gaussian models and beyond

Charles Margossian, Aki Vehtari, Daniel Simpson, Raj Agrawal

Keywords Abstract Paper

Streaming k-Submodular Maximization under Noise subject to Size Constraint

Lan N. Nguyen, My T. Thai

Keywords Abstract Paper

Simple and sharp analysis of k-means||

Keywords Abstract Paper

Best of Both Worlds: Practical and Theoretically Optimal Submodular Maximization in Parallel

Yixin Chen, Tonmoy Dey, Alan Kuhnle

Keywords Abstract Paper

Fast Noise Removal for k-Means Clustering

Sungjin Im, Mahshid Montazer Qaem, Benjamin Moseley and Xiaorui Sun, Rudy Zhou

Keywords Abstract Paper

Exploration by Optimisation in Partial Monitoring

Tor Lattimore, Csaba Szepesvari

Keywords Abstract Paper

Bandit problems, Online learning

Finite Time Analysis of Linear Two-timescale Stochastic Approximation with Markovian Noise

Maksim Kaledin, Eric Moulines, Alexey Naumov and Vladislav Tadic, Hoi-To Wai

Keywords Abstract Paper

Stochastic optimization, Reinforcement learning

Sample-Optimal PAC Learning of Halfspaces with Malicious Noise

Keywords Abstract Paper

Theory, Computational Learning Theory

Stability and risk bounds of iterative hard thresholding

Xiaotong Yuan, Ping Li

Keywords Abstract Paper

Explicit Mean-Square Error Bounds for Monte-Carlo and Linear Stochastic Approximation

Shuhang Chen, Adithya Devraj, Ana Busic, Sean Meyn

Keywords Abstract Paper

First-Order Methods for Wasserstein Distributionally Robust MDP

Julien Grand-Clement, Christian Kroer

Keywords Abstract Paper

Theory, RL, Decisions and Control Theory

Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination

Dylan J Foster, Akshay Krishnamurthy

Keywords Abstract Paper

theory, reinforcement learning and planning, bandits, online learning

A Fast Spectral Algorithm for Mean Estimation with Sub-Gaussian Rates

Zhixian Lei, Kyle Luh, Prayaag Venkat, Fred Zhang

Keywords Abstract Paper

High-dimensional statistics, Adversarial learning and robustness

On Convergence of Nearest Neighbor Classifiers over Feature Transformations

Luka Rimanic, Cedric Renggli, Bo Li, Ce Zhang

Keywords Abstract Paper

Quick streaming algorithms for maximization of monotone submodular functions in linear time

Keywords Abstract Paper

Exact Optimization of Conformal Predictors via Incremental and Decremental Learning

Giovanni Cherubin, Konstantinos Chatzikokolakis, Martin Jaggi

Keywords Abstract Paper

Submodular Maximization Through Barrier Functions

Ashwinkumar Badanidiyuru, Amin Karbasi, Ehsan Kazemi, Jan Vondrak

Keywords Abstract Paper

Estimating Principal Components under Adversarial Perturbations

Pranjal Awasthi, Xue Chen, Aravindan Vijayaraghavan

Keywords Abstract Paper

Unsupervised and semi-supervised learning, Adversarial learning and robustness

Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy

Akinori Ebihara, Taiki Miyagawa, Kazuyuki Sakurai, Hitoshi Imaoka

Keywords Abstract Paper

Density ratio estimation, Early classification, Sequential probability ratio test

Fast Algorithms for $L_\infty$-constrained S-rectangular Robust MDPs

Keywords Paper

Keywords Paper

Keywords Paper

Shuang Cui, Kai Han, Tianshuai Zhu and
Jing Tang, Benwei Wu, He Huang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Sungjin Im, Mahshid Montazer Qaem, Benjamin Moseley and
Xiaorui Sun, Rudy Zhou

Keywords Paper

Keywords Paper

Maksim Kaledin, Eric Moulines, Alexey Naumov and
Vladislav Tadic, Hoi-To Wai

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zhuoran Yang, Chi Jin, Zhaoran Wang and
Mengdi Wang, Michael Jordan

Keywords Paper

Silviu-Marian Udrescu, Andrew Tan, Jiahai Feng and
Orisvaldo Neto, Tailin Wu, Max Tegmark

Keywords Paper

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and
Jerry Li, Kevin Tian

Keywords Paper

Dheeraj Nagaraj, Xian Wu, Guy Bresler and
Prateek Jain, Praneeth Netrapalli

Keywords Paper

Keywords Paper

Keywords Paper