Learning to Play No-Press Diplomacy with Best Response Policy Iteration

06/12/2020

Learning to Play No-Press Diplomacy with Best Response Policy Iteration

Thomas Anthony, Tom Eccles, Andrea Tacchetti, János Kramár, Ian Gemp, Thomas Hudson, Nicolas Porcel, Marc Lanctot, Julien Perolat, Richard Everett, Satinder Singh, Thore Graepel, Yoram Bachrach

Keywords:

Abstract Paper Similar Papers

Abstract: Recent advances in deep reinforcement learning (RL) have led to considerable progress in many 2-player zero-sum games, such as Go, Poker and Starcraft. The purely adversarial nature of such games allows for conceptually simple and principled application of RL methods. However real-world settings are many-agent, and agent interactions are complex mixtures of common-interest and competitive aspects. We consider Diplomacy, a 7-player board game designed to accentuate dilemmas resulting from many-agent interactions. It also features a large combinatorial action space and simultaneous moves, which are challenging for RL algorithms. We propose a simple yet effective approximate best response operator, designed to handle large combinatorial action spaces and simultaneous moves. We also introduce a family of policy iteration methods that approximate fictitious play. With these methods, we successfully apply RL to Diplomacy: we show that our agents convincingly outperform the previous state-of-the-art, and game theoretic equilibrium analysis shows that the new process yields consistent improvements.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

From Chaos to Order: Symmetry and Conservation Laws in Game Dynamics

Sai Ganesh Nagarajan, David Balduzzi, Georgios Piliouras

Keywords Paper

Learning Theory

0

0

0

0

15:35

18/07/2021

Learning in Nonzero-Sum Stochastic Games with Potentials

David Mguni, Yutong Wu, Yali Du and
Yaodong Yang, Ziyi Wang, M. Li, Ying Wen, Joel Jennings, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:36

04/08/2021

Learning in Matrix Games can be Arbitrarily Complex

Gabriel P Andrade, Rafael Frongillo, Georgios Piliouras

Keywords Paper

0

0

0

0

14:59

19/08/2021

Combining Tree Search and Action Prediction for State-of-the-Art Performance in DouDiZhu

Yunsheng Zhang, Dong Yan, Bei Shi and
Haobo Fu, Qiang Fu, Hang Su, Jun Zhu, Ning Chen

Keywords Paper

Machine Learning, Reinforcement Learning, Game Playing and Machine Learning

0

0

0

0

12:03

06/12/2021

Equilibrium Refinement for the Age of Machines: The One-Sided Quasi-Perfect Equilibrium

Gabriele Farina, Tuomas Sandholm

Keywords Paper

0

0

0

0

16:07

03/05/2021

Human-Level Performance in No-Press Diplomacy via Equilibrium Search

Jonathan Gray, Adam Lerer, Anton Bakhtin, Noam Brown

Keywords Paper

reinforcement learning, game theory, no-regret learning, regret minimization, multi-agent systems

0

0

0

0

15:10

18/07/2021

Modelling Behavioural Diversity for Learning in Open-Ended Games

Nicolas Perez-Nieves, Yaodong Yang, Oliver Slumbers and
David Mguni, Ying Wen, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

17:06

06/12/2021

Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games

Yu Bai, Chi Jin, Huan Wang, Caiming Xiong

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

12:14

26/04/2020

Double Neural Counterfactual Regret Minimization

Hui Li, Kailiang Hu, Shaohua Zhang and
Yuan Qi, Le Song

Keywords Paper

Counterfactual Regret Minimization, Imperfect Information game, Neural Strategy, Deep Learning, Robust Sampling

0

0

0

0

4:49

06/12/2020

Joint Policy Search for Multi-agent Collaboration with Imperfect Information

Yuandong Tian, Qucheng Gong, Yu Jiang

Keywords Paper

0

0

0

0

3:32

02/02/2021

Safe Search for Stackelberg Equilibria in Extensive-Form Games

Chun Kai Ling, Noam Brown

Keywords Paper

0

0

0

0

18:49

12/09/2020

Nondeterministic Strategies and their Refinement in Strategy Logic

Giuseppe De Giacomo, Bastien Maubert, Aniello Murano

Keywords Paper

Reasoning about actions and change, action languages-General

0

0

0

0

15:05

03/05/2021

Chaos of Learning Beyond Zero-sum and Coordination via Game Decompositions

Yun Kuen Cheung, Yixin Tao

Keywords Paper

Dynamical Systems, Volume Analysis, Follow-the-Regularized-Leader, Multiplicative Weights Update, Game Decomposition, Lyapunov Chaos, Learning in Games

0

0

0

0

3:53

02/02/2021

Evolutionary Game Theory Squared: Evolving Agents in Endogenously Evolving Zero-Sum Games

Stratis Skoulakis, Tanner Fiez, Ryann Sim and
Georgios Piliouras, Lillian Ratliff

Keywords Paper

0

0

0

0

20:14

02/02/2021

On the Approximation of Nash Equilibria in Sparse Win-Lose Multi-player Games

Zhengyang Liu, Jiawei Li, Xiaotie Deng

Keywords Paper

0

0

0

0

16:33

12/07/2020

OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning

Alexander Vezhnevets, Yuhuai Wu, Maria Eckstein and
Rémi Leblond, Joel Z Leibo

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:17

18/07/2021

Estimating $\alpha$-Rank from A Few Entries with Low Rank Matrix Completion

Yali Du, Xue Yan, Xu Chen and
Jun Wang, Haifeng Zhang

Keywords Paper

Optimization, Probabilistic Methods, Distributed Inference, Algorithms, Algorithms Evaluation

0

0

0

0

4:52

09/07/2020

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang

Keywords Paper

Reinforcement learning, Planning and control

0

0

0

0

15:16

02/02/2021

Hindsight and Sequential Rationality of Correlated Play

Dustin Morrill, Ryan D'Orazio, Reca Sarfati and
Marc Lanctot, James R Wright, Amy R Greenwald, Michael Bowling

Keywords Paper

0

0

0

0

18:34

26/04/2020

A Generalized Training Approach for Multiagent Learning

Paul Muller, Shayegan Omidshafiei, Mark Rowland and
Karl Tuyls, Julien Perolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Remi Munos

Keywords Paper

multiagent learning, game theory, training, games

0

0

0

0

14:11

26/04/2020

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

Hengyuan Hu, Jakob N Foerster

Keywords Paper

multi-agent RL, theory of mind

0

0

0

0

5:20

06/12/2021

Neural Auto-Curricula in Two-Player Zero-Sum Games

Xidong Feng, Oliver Slumbers, Ziyu Wan and
Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang

Keywords Paper

deep learning, optimization, reinforcement learning and planning, meta learning

0

0

0

0

14:46

06/12/2021

No-Press Diplomacy from Scratch

Anton Bakhtin, David Wu, Adam Lerer, Noam Brown

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

12:37

02/02/2021

Coalition Formation in Multi-defender Security Games

Dolev Mutzari, Jiarui Gan, Sarit Kraus

Keywords Paper

0

0

0

0

18:11

08/07/2020

Refinement-Based Game Semantics for Certified Abstraction Layers

Jérémie Koenig Zhong Shao

Keywords Paper

algebraic effects, interaction specification, certified abstraction layers, game semantics, strategy specification, program refinement, dual nondeterminism

0

0

0

0

25:15

06/12/2020

Learning Strategy-Aware Linear Classifiers

Yiling Chen, Yang Liu, Chara Podimata

Keywords Paper

0

0

0

0

3:15

12/07/2020

Provable Self-Play Algorithms for Competitive Reinforcement Learning

Yu Bai, Chi Jin

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

14:28

03/05/2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Zhenggang Tang, Chao Yu, Boyuan Chen and
Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu

Keywords Paper

reward randomization, strategic behavior, diverse strategies, multi-agent reinforcement learning

0

0

0

0

2:40

02/02/2021

Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions

Zhe Feng, Guru Guruganesh, Christopher Liaw and
Aranyak Mehta, Abhishek Sethi

Keywords Paper

0

0

0

0

20:14

18/07/2021

A Sharp Analysis of Model-based Reinforcement Learning with Self-Play

Qinghua Liu, Tiancheng Yu, Yu Bai, Chi Jin

Keywords Paper

Algorithms, Multitask and Transfer Learning, Algorithms, Meta-Learning; Applications, Object Recognition; Data, Challenges, Implementations, and Software, Benchmarks;, Theory, RL, Decisions and Control Theory

0

0

0

0

4:49

18/07/2021

Provably Efficient Algorithms for Multi-Objective Competitive RL

Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

17:04

06/12/2020

A Game Theoretic Analysis of Additive Adversarial Attacks and Defenses

Ambar Pal, Rene Vidal

Keywords Paper

0

0

0

0

3:19

18/07/2021

SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II

Xiangjun Wang, Junxiao SONG, Penghui Qi and
Peng Peng, Zhenkun Tang, Wei Zhang, Weimin Li, Xiongjun Pi, Jujie He, Chao Gao, Haitao Long, Quan Yuan

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:11

06/12/2020

Calibration of Shared Equilibria in General Sum Partially Observable Markov Games

Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso

Keywords Paper

0

0

0

0

3:18

06/12/2021

Stateful Strategic Regression

Keegan Harris, Hoda Heidari, Steven Wu

Keywords Paper

optimization

0

0

0

0

14:02

06/12/2021

Exploiting Opponents Under Utility Constraints in Sequential Games

Martino Bernasconi-de-Luca, Federico Cacciamani, Simone Fioravanti and
Nicola Gatti, Alberto Marchesi, Francesco Trovò

Keywords Paper

online learning

0

0

0

0

12:59

18/07/2021

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Tarun Gupta, Anuj Mahajan, Bei Peng and
Wendelin Boehmer, Shimon Whiteson

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:23

13/04/2021

Reinforcement learning for mean field games with strategic complementarities

Kiyeob Lee, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

Keywords Paper

0

0

0

0

2:57

04/08/2021

Cooperative and Stochastic Multi-Player Multi-Armed Bandit: Optimal Regret With Neither Communication Nor Collisions

Mark Sellke, Sebastien Bubeck, Thomas Budzinski

Keywords Paper

0

0

0

0

9:25

06/12/2021

Scalable Online Planning via Reinforcement Learning Fine-Tuning

Arnaud Fickinger, Hengyuan Hu, Brandon Amos and
Stuart Russell, Noam Brown

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

8:04