Optimal Algorithms for Multiplayer Multi-Armed Bandits

26/08/2020

Optimal Algorithms for Multiplayer Multi-Armed Bandits

Po-An Wang, Alexandre Proutiere, Kaito Ariu, Yassir Jedra, Alessio Russo

Keywords:

Abstract Paper Similar Papers

Abstract: The paper addresses various Multiplayer Multi-Armed Bandit (MMAB) problems, where M decision-makers, or players, collaborate to maximize their cumulative reward. We first investigate the MMAB problem where players selecting the same arms experience a collision (and are aware of it) and do not collect any reward. For this problem, we present DPE1 (Decentralized Parsimonious Exploration), a decentralized algorithm that achieves the same asymptotic regret as that obtained by an optimal centralized algorithm. DPE1 is simpler than the state-of-the-art algorithm SIC-MMAB Boursier and Perchet (2019), and yet offers better performance guarantees. We then study the MMAB problem without collision, where players may select the same arm. Players sit on vertices of a graph, and in each round, they are able to send a message to their neighbours in the graph. We present DPE2, a simple and asymptotically optimal algorithm that outperforms the state-of-the-art algorithm DD- UCB Mart ́ınez-Rubio et al. (2019). Besides, under DPE2, the expected number of bits transmitted by the players in the graph is finite.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/08/2020

A Practical Algorithm for Multiplayer Bandits when Arm Means Vary Among Players

Abbas Mehrabian, Etienne Boursier, Emilie Kaufmann, Vianney Perchet

Keywords Paper

0

0

0

0

15:32

09/07/2020

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang

Keywords Paper

Reinforcement learning, Planning and control

0

0

0

0

15:16

02/02/2021

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

Siwei Wang, Haoyun Wang, Longbo Huang

Keywords Paper

0

0

0

0

19:29

26/04/2020

Double Neural Counterfactual Regret Minimization

Hui Li, Kailiang Hu, Shaohua Zhang and
Yuan Qi, Le Song

Keywords Paper

Counterfactual Regret Minimization, Imperfect Information game, Neural Strategy, Deep Learning, Robust Sampling

0

0

0

0

4:49

04/08/2021

Cooperative and Stochastic Multi-Player Multi-Armed Bandit: Optimal Regret With Neither Communication Nor Collisions

Mark Sellke, Sebastien Bubeck, Thomas Budzinski

Keywords Paper

0

0

0

0

9:25

12/07/2020

My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits

Ilai Bistritz, Tavor Baharav, Amir Leshem, Nicholas Bambos

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

15:04

06/12/2020

Joint Policy Search for Multi-agent Collaboration with Imperfect Information

Yuandong Tian, Qucheng Gong, Yu Jiang

Keywords Paper

0

0

0

0

3:32

12/07/2020

Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games

Youzhi Zhang, Bo An

Keywords Paper

Learning Theory

0

0

0

0

15:52

06/12/2021

Recurrent Submodular Welfare and Matroid Blocking Semi-Bandits

Orestis Papadigenopoulos, Constantine Caramanis

Keywords Paper

bandits

0

0

0

0

12:28

06/12/2020

Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games

Arun Suggala, Praneeth Netrapalli

Keywords Paper

1

1

0

0

3:29

09/07/2020

Selfish Robustness and Equilibria in Multi-Player Bandits

Etienne Boursier, Vianney Perchet

Keywords Paper

Bandit problems, Economics, game theory, and incentives

0

0

0

0

15:07

06/12/2020

Towards More Practical Adversarial Attacks on Graph Neural Networks

Jiaqi Ma, Shuangrui Ding, Qiaozhu Mei

Keywords Paper

0

0

0

0

3:19

02/02/2021

Decentralized Multi-Agent Linear Bandits with Safety Constraints

Sanae Amani, Christos Thrampoulidis

Keywords Paper

0

0

0

0

19:13

06/12/2020

Contextual Games: Multi-Agent Learning with Side Information

Pier Giuseppe Sessa, Ilija Bogunovic, Andreas Krause, Maryam Kamgarpour

Keywords Paper

0

0

0

0

3:30

26/08/2020

The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits

Ronshee Chawla, Abishek Sankararaman, Ayalvadi Ganesh, Sanjay Shakkottai

Keywords Paper

0

0

0

0

15:59

06/12/2021

Better Algorithms for Individually Fair $k$-Clustering

Maryam Negahbani, Deeparnab Chakrabarty

Keywords Paper

theory, self-supervised learning, clustering, fairness

0

0

0

0

14:02

06/12/2020

On Regret with Multiple Best Arms

Yinglun Zhu, Robert Nowak

Keywords Paper

0

0

0

0

3:22

04/08/2021

Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

Yu-Guan Hsieh, Kimon Antonakopoulos, Panayotis Mertikopoulos

Keywords Paper

0

0

0

0

16:09

03/05/2021

Impact of Representation Learning in Linear Bandits

Jiaqi Yang, Wei Hu, Jason Lee, Simon Du

Keywords Paper

multi-task learning, representation learning, linear bandits

0

0

0

0

5:40

13/04/2021

Multitask bandit learning through heterogeneous feedback aggregation

Zhi Wang, Chicheng Zhang, Manish Kumar Singh and
Laurel Riek, Kamalika Chaudhuri

Keywords Paper

0

0

0

0

3:07

09/07/2020

Non-Stochastic Multi-Player Multi-Armed Bandits: Optimal Rate With Collision Information, Sublinear Without

Sebastien Bubeck, Yuanzhi Li, Yuval Peres, Mark Sellke

Keywords Paper

Bandit problems,

0

0

0

0

10:13

06/12/2020

Cooperative Multi-player Bandit Optimization

Ilai Bistritz, Nicholas Bambos

Keywords Paper

0

0

0

0

3:13

26/08/2020

Thresholding Graph Bandits with GrAPL

Daniel LeJeune, Gautam Dasarathy, Richard Baraniuk

Keywords Paper

0

0

0

0

6:05

02/02/2021

Signaling in Bayesian Network Congestion Games: the Subtle Power of Symmetry

Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Nicola Gatti

Keywords Paper

0

0

0

0

15:03

06/12/2020

Online Influence Maximization under Linear Threshold Model

Shuai Li, Fang Kong, Kejie Tang and
Qizhi Li, Wei Chen

Keywords Paper

0

0

0

0

3:15

03/08/2020

Brief announcement: Deterministic lower bound for dynamic balanced graph partitioning

Maciej Pacut, Mahmoud Parham, Stefan Schmid

Keywords Paper

online algorithms, graph partitioning, self-adjusting networks

0

0

0

0

10:22

18/07/2021

Parametric Graph for Unimodal Ranking Bandit

CamilleS GAUTHIER, Romaric Gaudel, Elisa Fromont, Boammani Aser Lompo

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

4:49

02/02/2021

On the Approximation of Nash Equilibria in Sparse Win-Lose Multi-player Games

Zhengyang Liu, Jiawei Li, Xiaotie Deng

Keywords Paper

0

0

0

0

16:33

12/07/2020

Provable Self-Play Algorithms for Competitive Reinforcement Learning

Yu Bai, Chi Jin

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

14:28

18/07/2021

Modelling Behavioural Diversity for Learning in Open-Ended Games

Nicolas Perez-Nieves, Yaodong Yang, Oliver Slumbers and
David Mguni, Ying Wen, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

17:06

18/07/2021

Online Learning in Unknown Markov Games

Yi Tian, Yuanhao Wang, Tiancheng Yu, Suvrit Sra

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

5:13

06/12/2021

Cooperative Stochastic Bandits with Asynchronous Agents and Constrained Feedback

Lin Yang, Yu-Zhen Janice Chen, Stephen Pasteris and
Mohammad Hajiesmaili, John C. S. Lui, Don Towsley

Keywords Paper

bandits

0

0

0

0

12:07

04/08/2021

Online Learning with Simple Predictors and a Combinatorial Characterization of Minimax in 0/1 Games

Steve Hanneke, Roi Livni, Shay Moran

Keywords Paper

0

0

0

0

18:07

06/12/2020

Theory-Inspired Path-Regularized Differential Network Architecture Search

Pan Zhou, Caiming Xiong, Richard Socher, Steven Hoi

Keywords Paper

0

0

0

0

3:18

06/12/2021

Subgame solving without common knowledge

Brian Zhang, Tuomas Sandholm

Keywords Paper

0

0

0

0

14:40

18/07/2021

Incentivized Bandit Learning with Self-Reinforcing User Preferences

Tianchen Zhou, Jia Liu, Chaosheng Dong, jingyuan deng

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

4:46

06/12/2021

Conic Blackwell Algorithm: Parameter-Free Convex-Concave Saddle-Point Solving

Julien Grand-Clément, Christian Kroer

Keywords Paper

optimization

0

0

0

0

14:29

26/08/2020

Decentralized Multi-player Multi-armed Bandits with No Collision Information

Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Keywords Paper

0

0

0

0

14:18

12/07/2020

(Locally) Differentially Private Combinatorial Semi-Bandits

Xiaoyu Chen, Kai Zheng, Zixin Zhou and
Yunchang Yang, Wei Chen, Liwei Wang

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

12:39

18/07/2021

A Sharp Analysis of Model-based Reinforcement Learning with Self-Play

Qinghua Liu, Tiancheng Yu, Yu Bai, Chi Jin

Keywords Paper

Algorithms, Multitask and Transfer Learning, Algorithms, Meta-Learning; Applications, Object Recognition; Data, Challenges, Implementations, and Software, Benchmarks;, Theory, RL, Decisions and Control Theory

0

0

0

0

4:49