Cooperative and Stochastic Multi-Player Multi-Armed Bandit: Optimal Regret With Neither Communication Nor Collisions

04/08/2021

Cooperative and Stochastic Multi-Player Multi-Armed Bandit: Optimal Regret With Neither Communication Nor Collisions

Mark Sellke, Sebastien Bubeck, Thomas Budzinski

Keywords:

Abstract Paper Similar Papers

Abstract: We consider the cooperative multi-player version of the stochastic multi-armed bandit problem. We study the regime where the players cannot communicate but have access to shared randomness. In prior work by the first two authors, a strategy for this regime was constructed for two players and three arms, with regret \tilde{O(\sqrt{T}), and with no collisions at all between the players (with very high probability). In this paper we show that these properties (near-optimal regret and no collisions at all) are achievable for any number of players and arms. At a high level, the previous strategy heavily relied on a 2-dimensional geometric intuition that was difficult to generalize in higher dimensions, while here we take a more combinatorial route to build the new strategy.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLT 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

09/07/2020

Coordination without communication: optimal regret in two players multi-armed bandits

Sebastien Bubeck, Thomas Budzinski

Keywords Paper

Bandit problems,

0

0

0

0

14:56

26/08/2020

Optimal Algorithms for Multiplayer Multi-Armed Bandits

Po-An Wang, Alexandre Proutiere, Kaito Ariu and
Yassir Jedra, Alessio Russo

Keywords Paper

0

0

0

0

11:56

09/07/2020

Non-Stochastic Multi-Player Multi-Armed Bandits: Optimal Rate With Collision Information, Sublinear Without

Sebastien Bubeck, Yuanzhi Li, Yuval Peres, Mark Sellke

Keywords Paper

Bandit problems,

0

0

0

0

10:13

02/02/2021

On the Approximation of Nash Equilibria in Sparse Win-Lose Multi-player Games

Zhengyang Liu, Jiawei Li, Xiaotie Deng

Keywords Paper

0

0

0

0

16:33

12/07/2020

My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits

Ilai Bistritz, Tavor Baharav, Amir Leshem, Nicholas Bambos

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

15:04

26/08/2020

A Practical Algorithm for Multiplayer Bandits when Arm Means Vary Among Players

Abbas Mehrabian, Etienne Boursier, Emilie Kaufmann, Vianney Perchet

Keywords Paper

0

0

0

0

15:32

06/12/2020

Joint Policy Search for Multi-agent Collaboration with Imperfect Information

Yuandong Tian, Qucheng Gong, Yu Jiang

Keywords Paper

0

0

0

0

3:32

09/07/2020

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang

Keywords Paper

Reinforcement learning, Planning and control

0

0

0

0

15:16

26/04/2020

Double Neural Counterfactual Regret Minimization

Hui Li, Kailiang Hu, Shaohua Zhang and
Yuan Qi, Le Song

Keywords Paper

Counterfactual Regret Minimization, Imperfect Information game, Neural Strategy, Deep Learning, Robust Sampling

0

0

0

0

4:49

06/12/2021

Equilibrium Refinement for the Age of Machines: The One-Sided Quasi-Perfect Equilibrium

Gabriele Farina, Tuomas Sandholm

Keywords Paper

0

0

0

0

16:07

03/05/2021

Chaos of Learning Beyond Zero-sum and Coordination via Game Decompositions

Yun Kuen Cheung, Yixin Tao

Keywords Paper

Dynamical Systems, Volume Analysis, Follow-the-Regularized-Leader, Multiplicative Weights Update, Game Decomposition, Lyapunov Chaos, Learning in Games

0

0

0

0

3:53

12/07/2020

Randomization matters How to defend against strong adversarial attacks

Rafael Pinot, Raphael Ettedgui, Geovani Rizk and
Yann Chevaleyre, Jamal Atif

Keywords Paper

Adversarial Examples

0

0

0

0

15:20

18/07/2021

Modelling Behavioural Diversity for Learning in Open-Ended Games

Nicolas Perez-Nieves, Yaodong Yang, Oliver Slumbers and
David Mguni, Ying Wen, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

17:06

06/12/2021

Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning

Yiqin Yang, Xiaoteng Ma, Li Chenghao and
Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:36

18/07/2021

Mixed Nash Equilibria in the Adversarial Examples Game

Laurent Meunier, Meyer Scetbon, Rafael Pinot and
Jamal Atif, Yann Chevaleyre

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:30

06/12/2020

Learning to Play No-Press Diplomacy with Best Response Policy Iteration

Thomas Anthony, Tom Eccles, Andrea Tacchetti and
János Kramár, Ian Gemp, Thomas Hudson, Nicolas Porcel, Marc Lanctot, Julien Perolat, Richard Everett, Satinder Singh, Thore Graepel, Yoram Bachrach

Keywords Paper

0

0

0

0

3:23

26/04/2020

Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks

Tianyu Pang, Kun Xu, Jun Zhu

Keywords Paper

Trustworthy Machine Learning, Adversarial Robustness, Inference Principle, Mixup

0

0

0

0

4:59

06/12/2021

A Winning Hand: Compressing Deep Networks Can Improve Out-of-Distribution Robustness

James Diffenderfer, Brian Bartoldson, Shreya Chaganti and
Jize Zhang, Bhavya Kailkhura

Keywords Paper

deep learning, robustness

0

0

0

0

10:43

19/08/2021

Combining Tree Search and Action Prediction for State-of-the-Art Performance in DouDiZhu

Yunsheng Zhang, Dong Yan, Bei Shi and
Haobo Fu, Qiang Fu, Hang Su, Jun Zhu, Ning Chen

Keywords Paper

Machine Learning, Reinforcement Learning, Game Playing and Machine Learning

0

0

0

0

12:03

12/07/2020

Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games

Youzhi Zhang, Bo An

Keywords Paper

Learning Theory

0

0

0

0

15:52

06/12/2020

No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

Andrea Celli, Alberto Marchesi, Gabriele Farina, Nicola Gatti

Keywords Paper

0

0

0

0

2:56

26/08/2020

The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits

Ronshee Chawla, Abishek Sankararaman, Ayalvadi Ganesh, Sanjay Shakkottai

Keywords Paper

0

0

0

0

15:59

04/08/2021

Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

Yu-Guan Hsieh, Kimon Antonakopoulos, Panayotis Mertikopoulos

Keywords Paper

0

0

0

0

16:09

06/12/2020

From Finite to Countable-Armed Bandits

Anand Kalvit, Assaf Zeevi

Keywords Paper

, Theory -> Control Theory

0

0

0

0

3:15

18/07/2021

Trajectory Diversity for Zero-Shot Coordination

Andrei Lupu, Brandon Cui, Hengyuan Hu, Jakob Foerster

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:16

06/12/2020

No-Regret Learning and Mixed Nash Equilibria: They Do Not Mix

Manolis Vlatakis-Gkaragkounis, Lampros Flokas, Thanasis Lianeas and
Panayotis Mertikopoulos, Georgios Piliouras

Keywords Paper

Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Deep Learning, Applications -> Computational Photography

0

0

0

0

3:10

06/12/2021

Subgame solving without common knowledge

Brian Zhang, Tuomas Sandholm

Keywords Paper

0

0

0

0

14:40

02/02/2021

Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions

Zhe Feng, Guru Guruganesh, Christopher Liaw and
Aranyak Mehta, Abhishek Sethi

Keywords Paper

0

0

0

0

20:14

06/12/2021

Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games

Yu Bai, Chi Jin, Huan Wang, Caiming Xiong

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

12:14

02/02/2021

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

Siwei Wang, Haoyun Wang, Longbo Huang

Keywords Paper

0

0

0

0

19:29

18/07/2021

Connecting Optimal Ex-Ante Collusion in Teams to Extensive-Form Correlation: Faster Algorithms and Positive Complexity Results

Gabriele Farina, Andrea Celli, Nicola Gatti, Tuomas Sandholm

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:13

06/12/2021

IQ-Learn: Inverse soft-Q Learning for Imitation

Divyansh Garg, Shuvam Chakraborty, Chris Cundy and
Jiaming Song, Stefano Ermon

Keywords Paper

optimization, reinforcement learning and planning, adversarial robustness and security

0

0

0

0

8:25

18/07/2021

Learning in Nonzero-Sum Stochastic Games with Potentials

David Mguni, Yutong Wu, Yali Du and
Yaodong Yang, Ziyi Wang, M. Li, Ying Wen, Joel Jennings, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:36

22/06/2020

Extractors for adversarial sources via extremal hypergraphs

Eshan Chattopadhyay, Jesse Goodman, Vipul Goyal, Xin Li

Keywords Paper

randomness extractors, non-malleable extractors, extremal hypergraphs, explicit constructions, cap sets, Ramsey graphs

0

0

0

0

28:16

02/02/2021

Computing Quantal Stackelberg Equilibrium in Extensive-Form Games

Jakub Černý, Viliam Lisý, Branislav Bošanský, Bo An

Keywords Paper

0

0

0

0

15:01

18/07/2021

A Sharp Analysis of Model-based Reinforcement Learning with Self-Play

Qinghua Liu, Tiancheng Yu, Yu Bai, Chi Jin

Keywords Paper

Algorithms, Multitask and Transfer Learning, Algorithms, Meta-Learning; Applications, Object Recognition; Data, Challenges, Implementations, and Software, Benchmarks;, Theory, RL, Decisions and Control Theory

0

0

0

0

4:49

18/07/2021

Sparsity-Agnostic Lasso Bandit

Min-hwan Oh, Garud Iyengar, Assaf Zeevi

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:02

02/02/2021

Signaling in Bayesian Network Congestion Games: the Subtle Power of Symmetry

Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Nicola Gatti

Keywords Paper

0

0

0

0

15:03

06/12/2021

One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

Udari Madhushani, Abhimanyu Dubey, Naomi Leonard, Alex Pentland

Keywords Paper

bandits

0

0

0

0

15:01

06/12/2021

Decentralized Q-learning in Zero-sum Markov Games

Muhammed Sayin, Kaiqing Zhang, David Leslie and
Tamer Basar, Asuman Ozdaglar

Keywords Paper

reinforcement learning and planning

0

0

0

0

15:07