Quantile Bandits for Best Arms Identification

18/07/2021

Quantile Bandits for Best Arms Identification

Mengyan Zhang, Cheng Soon Ong

Keywords: Algorithms, AutoML, Applications, Object Detection; Deep Learning, CNN Architectures, Reinforcement Learning and Planning, Bandits

Abstract Paper Similar Papers

Abstract: We consider a variant of the best arm identification task in stochastic multi-armed bandits. Motivated by risk-averse decision-making problems, our goal is to identify a set of $m$ arms with the highest $\tau$-quantile values within a fixed budget. We prove asymmetric two-sided concentration inequalities for order statistics and quantiles of random variables that have non-decreasing hazard rate, which may be of independent interest. With these inequalities, we analyse a quantile version of Successive Accepts and Rejects (Q-SAR). We derive an upper bound for the probability of arm misidentification, the first justification of a quantile based algorithm for fixed budget multiple best arms identification. We show illustrative experiments for best arm identification.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination

Arpan Mukherjee, Ali Tajer, Pin-Yu Chen, Payel Das

Keywords Paper

theory, bandits

0

0

0

0

15:07

06/12/2021

Multi-Armed Bandits with Bounded Arm-Memory: Near-Optimal Guarantees for Best-Arm Identification and Regret Minimization

Arnab Maiti, Vishakha Patil, Arindam Khan

Keywords Paper

theory, bandits

0

0

0

0

14:33

19/08/2021

Optimal Algorithms for Range Searching over Multi-Armed Bandits

Siddharth Barman, Ramakrishnan Krishnamurthy, Saladi Rahul

Keywords Paper

Machine Learning, Online Learning

0

0

0

0

14:43

06/12/2020

Optimal Best-arm Identification in Linear Bandits

Yassir Jedra, Alexandre Proutiere

Keywords Paper

0

0

0

0

3:21

06/12/2021

A unified framework for bandit multiple testing

Ziyu Xu, Ruodu Wang, Aaditya Ramdas

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

13:39

12/07/2020

Robust Outlier Arm Identification

Yinglun Zhu, Sumeet Katariya, Robert Nowak

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

12:22

18/07/2021

Combinatorial Blocking Bandits with Stochastic Delays

Alexia Atsidakou, Orestis Papadigenopoulos, Soumya Basu and
Constantine Caramanis, Sanjay Shakkottai

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:12

06/12/2020

Batched Coarse Ranking in Multi-Armed Bandits

Nikolai Karpov, Qin Zhang

Keywords Paper

0

0

0

0

3:20

18/07/2021

Problem Dependent View on Structured Thresholding Bandit Problems

James Cheshire, Pierre MENARD, Alexandra Carpentier

Keywords Paper

Algorithms, Online Learning, Algorithms, Bandit Algorithms, Reinforcement Learning and Planning, Bandits

0

0

0

0

4:49

06/12/2021

Bandits with many optimal arms

Rianne de Heide, James Cheshire, Pierre Ménard, Alexandra Carpentier

Keywords Paper

bandits

0

0

0

0

12:23

06/12/2021

Online Multi-Armed Bandits with Adaptive Inference

Maria Dimakopoulou, Zhimei Ren, Zhengyuan Zhou

Keywords Paper

theory, reinforcement learning and planning, bandits, online learning, causality

0

0

0

0

17:11

09/07/2020

The Influence of Shape Constraints on the Thresholding Bandit Problem

James Cheshire, Pierre Menard, Alexandra Carpentier

Keywords Paper

Bandit problems, Convex optimization

0

0

0

0

14:51

26/08/2020

Budget-Constrained Bandits over General Cost and Reward Distributions

Semih Cayci, Atilla Eryilmaz, R Srikant

Keywords Paper

0

0

0

0

10:40

18/07/2021

Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism

Brijen Thananjeyan, Kirthevasan Kandasamy, Ion Stoica and
Michael Jordan, Ken Goldberg, Joseph E Gonzalez

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

20:41

18/07/2021

Probabilistic Sequential Shrinking: A Best Arm Identification Algorithm for Stochastic Bandits with Corruptions

Zixin Zhong, Wang Chi Cheung, Vincent Tan

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

4:54

12/07/2020

On conditional versus marginal bias in multi-armed bandits

Jaehyeok Shin, Aaditya Ramdas, Alessandro Rinaldo

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

14:10

06/12/2020

Online Algorithm for Unsupervised Sequential Selection with Contextual Information

Arun Verma, Manjesh Kumar Hanawal, Csaba Szepesvari, Venkatesh Saligrama

Keywords Paper

0

0

0

0

3:21

13/04/2021

Contextual blocking bandits

Soumya Basu, Orestis Papadigenopoulos, Constantine Caramanis, Sanjay Shakkottai

Keywords Paper

0

0

0

0

2:47

18/07/2021

Top-k eXtreme Contextual Bandits with Arm Hierarchy

Rajat Sen, Alexander Rakhlin, Lexing Ying and
Rahul Kidambi, Dean Foster, Daniel Hill, Inderjit Dhillon

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:23

18/07/2021

The Symmetry between Arms and Knapsacks: A Primal-Dual Approach for Bandits with Knapsacks

Xiaocheng Li, Chunlin Sun, Yinyu Ye

Keywords Paper

Algorithms, Online Learning, Algorithms, Bandit Algorithms, Reinforcement Learning and Planning, Bandits

0

0

0

0

18:17

12/07/2020

The Intrinsic Robustness of Stochastic Bandits to Strategic Manipulation

Zhe Feng, David Parkes, Haifeng Xu

Keywords Paper

Learning Theory

0

0

0

0

12:47

18/07/2021

Dynamic Planning and Learning under Recovering Rewards

David Simchi-Levi, Zeyu Zheng, Feng Zhu

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

4:53

06/12/2020

Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms

Mohsen Bayati, Nima Hamidi, Ramesh Johari, Khashayar Khosravi

Keywords Paper

0

0

0

0

3:23

26/08/2020

A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option

P Sharoff, Nishant Mehta, Ravi Ganti

Keywords Paper

0

0

0

0

15:01

19/08/2021

State-Aware Value Function Approximation with Attention Mechanism for Restless Multi-armed Bandits

Shuang Wu, Jingyu Zhao, Guangjian Tian, Jun Wang

Keywords Paper

Agent-based and Multi-agent Systems, Multi-agent Planning, Resource Allocation, Planning and Scheduling, Markov Decisions Processes

0

0

0

0

14:56

26/08/2020

A Novel Confidence-Based Algorithm for Structured Bandits

Andrea Tirinzoni, Alessandro Lazaric, Marcello Restelli

Keywords Paper

0

0

0

0

12:17

06/12/2021

Recurrent Submodular Welfare and Matroid Blocking Semi-Bandits

Orestis Papadigenopoulos, Constantine Caramanis

Keywords Paper

bandits

0

0

0

0

12:28

06/12/2021

Stochastic bandits with groups of similar arms.

Fabien Pesquerel, Hassan SABER, Odalric-Ambrym Maillard

Keywords Paper

optimization, generative model, bandits

0

0

0

0

13:22

18/11/2020

Thompson sampling for unsupervised sequential selection

Arun Verma, Manjesh K Hanawal, Nandyala Hemachandra

Keywords Paper

0

0

0

0

9:22

06/12/2021

Optimal Best-Arm Identification Methods for Tail-Risk Measures

Shubhada Agrawal, Wouter Koolen, Sandeep Juneja

Keywords Paper

optimization, reinforcement learning and planning, bandits

0

0

0

0

12:59

06/12/2020

Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards

Vrettos Moulos

Keywords Paper

0

0

0

0

3:10

26/08/2020

The True Sample Complexity of Identifying Good Arms

Julian Katz-Samuels, Kevin Jamieson

Keywords Paper

0

0

0

0

12:16

06/12/2021

Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification

Clémence Réda, Andrea Tirinzoni, Rémy Degenne

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

14:14

06/12/2021

Fair Algorithms for Multi-Agent Multi-Armed Bandits

Safwan Hossain, Evi Micha, Nisarg Shah

Keywords Paper

bandits, fairness

0

0

0

0

13:32

06/12/2021

A/B/n Testing with Control in the Presence of Subpopulations

Yoan Russac, Christina Katsimerou, Dennis Bohle and
Olivier Cappé, Aurélien Garivier, Wouter Koolen

Keywords Paper

reinforcement learning and planning, bandits

0

0

0

0

12:04

02/02/2021

Disposable Linear Bandits for Online Recommendations

Melda Korkut, Andrew Li

Keywords Paper

0

0

0

0

17:20

18/07/2021

Optimal Streaming Algorithms for Multi-Armed Bandits

Tianyuan Jin, Keke Huang, Jing Tang, Xiaokui Xiao

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:06

06/12/2020

On Regret with Multiple Best Arms

Yinglun Zhu, Robert Nowak

Keywords Paper

0

0

0

0

3:22

26/08/2020

Old Dog Learns New Tricks: Randomized UCB for Bandit Problems

Sharan Vaswani, Abbas Mehrabian, Audrey Durand, Branislav Kveton

Keywords Paper

0

0

0

0

15:10

26/08/2020

Fixed-confidence guarantees for Bayesian best-arm identification

Xuedong Shang, Rianne de Heide, Pierre Menard and
Emilie Kaufmann, Michal Valko

Keywords Paper

0

0

0

0

14:59