Small Nash Equilibrium Certificates in Very Large Games

06/12/2020

Small Nash Equilibrium Certificates in Very Large Games

Brian Zhang, Tuomas Sandholm

Keywords:

Abstract Paper Similar Papers

Abstract: In many game settings, the game is not explicitly given but is only accessible by playing it. While there have been impressive demonstrations in such settings, prior techniques have not offered safety guarantees, that is, guarantees on the game-theoretic exploitability of the computed strategies. In this paper we introduce an approach that shows that it is possible to provide exploitability guarantees in such settings without ever exploring the entire game. We introduce a notion of a certificatae of an extensive-form approximate Nash equilibrium. For verifying a certificate, we give an algorithm that runs in time linear in the size of the certificate rather than the size of the whole game. In zero-sum games, we further show that an optimal certificate---given the exploration so far---can be computed with any standard game-solving algorithm (e.g., using a linear program or counterfactual regret minimization). However, unlike in the cases of normal form or perfect information, we show that certain families of extensive-form games do not have small approximate certificates, even after making extremely nice assumptions on the structure of the game. Despite this difficulty, we find experimentally that very small certificates, even exact ones, often exist in large and even in infinite games. Overall, our approach enables one to try one's favorite exploration strategies while offering exploitability guarantees, thereby decoupling the exploration strategy from the equilibrium-finding process.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Finding and Certifying (Near-)Optimal Strategies in Black-Box Extensive-Form Games

Brian Hu Zhang, Tuomas Sandholm

Keywords Paper

0

0

0

0

15:00

06/12/2021

Solving Min-Max Optimization with Hidden Structure via Gradient Descent Ascent

Emmanouil-Vasileios Vlatakis-Gkaragkounis, Lampros Flokas, Georgios Piliouras

Keywords Paper

optimization, generative model

0

0

0

0

8:51

19/08/2021

Temporal Induced Self-Play for Stochastic Bayesian Games

Weizhe Chen, Zihan Zhou, Yi Wu, Fei Fang

Keywords Paper

Agent-based and Multi-agent Systems, Multi-agent Learning, Applications of Reinforcement Learning

0

0

0

0

11:52

09/07/2020

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang

Keywords Paper

Reinforcement learning, Planning and control

0

0

0

0

15:16

04/08/2021

Learning in Matrix Games can be Arbitrarily Complex

Gabriel P Andrade, Rafael Frongillo, Georgios Piliouras

Keywords Paper

0

0

0

0

14:59

02/02/2021

If You Like Shapley Then You’ll Love the Core

Tom Yan, Ariel D. Procaccia

Keywords Paper

0

0

0

0

16:01

06/12/2021

Subgame solving without common knowledge

Brian Zhang, Tuomas Sandholm

Keywords Paper

0

0

0

0

14:40

18/07/2021

Robust Learning-Augmented Caching: An Experimental Study

Jakub Chłędowski, Adam Polak, Bartosz Szabucki, Konrad Zolna

Keywords Paper

Applications

0

0

0

0

4:52

12/07/2020

Randomization matters How to defend against strong adversarial attacks

Rafael Pinot, Raphael Ettedgui, Geovani Rizk and
Yann Chevaleyre, Jamal Atif

Keywords Paper

Adversarial Examples

0

0

0

0

15:20

13/04/2021

A limited-capacity minimax theorem for non-convex games or: How i learned to stop worrying about mixed-nash and love neural nets

Gauthier Gidel, David Balduzzi, Wojciech Czarnecki and
Marta Garnelo, Yoram Bachrach

Keywords Paper

0

0

0

0

2:51

14/09/2020

Escaping Saddle Points of Empirical Risk Privately and Scalably via DP-Trust Region Method

Di Wang, Jinhui Xu

Keywords Paper

differential privacy, empirical risk minimization, private machine learning

0

0

0

0

15:13

12/07/2020

A Distributional Framework For Data Valuation

Amirata Ghorbani, Michael Kim, James Zou

Keywords Paper

Learning Theory

0

0

0

0

14:15

12/07/2020

Invariant Risk Minimization Games

Kartik Ahuja, Karthikeyan Shanmugam, Kush Varshney, Amit Dhurandhar

Keywords Paper

Causality

0

0

0

0

14:57

19/08/2021

Learning Implicitly with Noisy Data in Linear Arithmetic

Alexander Rader, Ionela G Mocanu, Vaishak Belle, Brendan Juba

Keywords Paper

Constraints and SAT, Constraints and Data Mining; Constraints and Machine Learning

0

0

0

0

15:43

18/07/2021

Learning in Nonzero-Sum Stochastic Games with Potentials

David Mguni, Yutong Wu, Yali Du and
Yaodong Yang, Ziyi Wang, M. Li, Ying Wen, Joel Jennings, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:36

06/12/2020

Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration

Andrea Zanette, Alessandro Lazaric, Mykel J Kochenderfer, Emma Brunskill

Keywords Paper

0

0

0

0

3:11

06/12/2020

Causal Shapley Values: Exploiting Causal Knowledge to Explain Individual Predictions of Complex Models

Tom Heskes, Evi Sijben, Ioan Gabriel Bucur, Tom Claassen

Keywords Paper

0

0

0

0

3:07

04/08/2021

Online Learning with Simple Predictors and a Combinatorial Characterization of Minimax in 0/1 Games

Steve Hanneke, Roi Livni, Shay Moran

Keywords Paper

0

0

0

0

18:07

12/07/2020

Provable Self-Play Algorithms for Competitive Reinforcement Learning

Yu Bai, Chi Jin

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

14:28

02/02/2021

Computing Quantal Stackelberg Equilibrium in Extensive-Form Games

Jakub Černý, Viliam Lisý, Branislav Bošanský, Bo An

Keywords Paper

0

0

0

0

15:01

12/07/2020

Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games

Youzhi Zhang, Bo An

Keywords Paper

Learning Theory

0

0

0

0

15:52

26/10/2020

Certified Unsolvability for SAT Planning with Property Directed Reachability

Salomé Eriksson, Malte Helmert

Keywords Paper

classical planning, unsolvability, satisfiability, SAT, certifying algorithms, certificate

0

0

0

0

10:20

18/07/2021

A Sharp Analysis of Model-based Reinforcement Learning with Self-Play

Qinghua Liu, Tiancheng Yu, Yu Bai, Chi Jin

Keywords Paper

Algorithms, Multitask and Transfer Learning, Algorithms, Meta-Learning; Applications, Object Recognition; Data, Challenges, Implementations, and Software, Benchmarks;, Theory, RL, Decisions and Control Theory

0

0

0

0

4:49

26/04/2020

Double Neural Counterfactual Regret Minimization

Hui Li, Kailiang Hu, Shaohua Zhang and
Yuan Qi, Le Song

Keywords Paper

Counterfactual Regret Minimization, Imperfect Information game, Neural Strategy, Deep Learning, Robust Sampling

0

0

0

0

4:49

06/12/2021

Equilibrium Refinement for the Age of Machines: The One-Sided Quasi-Perfect Equilibrium

Gabriele Farina, Tuomas Sandholm

Keywords Paper

0

0

0

0

16:07

06/12/2020

Certifying Strategyproof Auction Networks

Michael Curry, Ping-yeh Chiang, Tom Goldstein, John Dickerson

Keywords Paper

0

0

0

0

3:22

06/12/2020

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs

Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei, Mengxiao Zhang

Keywords Paper

0

0

0

0

3:16

06/12/2020

No-Regret Learning and Mixed Nash Equilibria: They Do Not Mix

Manolis Vlatakis-Gkaragkounis, Lampros Flokas, Thanasis Lianeas and
Panayotis Mertikopoulos, Georgios Piliouras

Keywords Paper

Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Deep Learning, Applications -> Computational Photography

0

0

0

0

3:10

04/08/2021

Adversarially Robust Low Dimensional Representations

Pranjal Awasthi, Vaggos Chatziafratis, Xue Chen, Aravindan Vijayaraghavan

Keywords Paper

0

0

0

0

20:19

12/08/2020

SmartVerif: Push the Limit of Automation Capability of Verifying Security Protocols by Dynamic Strategies

Yan Xiong, Cheng Su, Wenchao Huang and
Fuyou Miao, Wansen Wang, Hengyi Ouyang

Keywords Paper

0

0

0

0

11:18

03/05/2021

On the Impossibility of Global Convergence in Multi-Loss Optimization

Alistair Letcher

Keywords Paper

convergence, descent, gradient, multi-player, global, impossibility, multi-loss, optimization, multi-agent

0

0

0

0

5:23

06/12/2020

Secretary and Online Matching Problems with Machine Learned Advice

Antonios Antoniadis, Themis Gouleakis, Pieter Kleer, Pavel Kolev

Keywords Paper

0

0

0

0

3:27

02/02/2021

Programmatic Strategies for Real-Time Strategy Games

Julian R. H. Mariño, Rubens O. Moraes, Tassiana C. Oliveira and
Claudio Toledo, Levi H. S. Lelis

Keywords Paper

0

0

0

0

19:22

06/12/2021

Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality

Stefanos Leonardos, Georgios Piliouras, Kelly Spendlove

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:11

02/02/2021

Why Adversarial Interaction Creates Non-Homogeneous Patterns: A Pseudo-Reaction-Diffusion Model for Turing Instability

Litu Rout

Keywords Paper

0

0

0

0

18:23

26/04/2020

The Gambler's Problem and Beyond

Baoxiang Wang, Shuai Li, Jiajin Li, Siu On Chan

Keywords Paper

the gambler's problem, reinforcement learning, fractal, self-similarity, Bellman equation

0

0

0

0

5:23

06/12/2021

Bellman-consistent Pessimism for Offline Reinforcement Learning

Tengyang Xie, Ching-An Cheng, Nan Jiang and
Paul Mineiro, Alekh Agarwal

Keywords Paper

theory, reinforcement learning and planning, robustness

0

0

0

0

17:42

06/12/2021

Online Learning in Periodic Zero-Sum Games

Tanner Fiez, Ryann Sim, Stratis Skoulakis and
Georgios Piliouras, Lillian Ratliff

Keywords Paper

theory, robustness, online learning

0

0

0

0

12:44

12/07/2020

Training Binary Neural Networks using the Bayesian Learning Rule

Xiangming Meng, Roman Bachmann, Mohammad Emtiyaz Khan

Keywords Paper

Deep Learning - General

0

0

0

0

10:27

12/07/2020

From Chaos to Order: Symmetry and Conservation Laws in Game Dynamics

Sai Ganesh Nagarajan, David Balduzzi, Georgios Piliouras

Keywords Paper

Learning Theory

0

0

0

0

15:35