Estimating α-Rank by Maximizing Information Gain

02/02/2021

Estimating α-Rank by Maximizing Information Gain

Tabish Rashid, Cheng Zhang, Kamil Ciosek

Keywords:

Abstract Paper Similar Papers

Abstract: Game theory has been increasingly applied in settings where the game is not known outright, but has to be estimated by sampling. For example, meta-games that arise in multi-agent evaluation can only be accessed by running a succession of expensive experiments that may involve simultaneous deployment of several agents. In this paper, we focus on α-rank, a popular game-theoretic solution concept designed to perform well in such scenarios. We aim to estimate the α-rank of the game using as few samples as possible. Our algorithm maximizes information gain between an epistemic belief over the α-ranks and the observed payoff. This approach has two main benefits. First, it allows us to focus our sampling on the entries that matter the most for identifying the α-rank. Second, the Bayesian formulation provides a facility to build in modeling assumptions by using a prior over game payoffs. We show the benefits of using information gain as compared to the confidence interval criterion of ResponseGraphUCB, and provide theoretical results justifying our method.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38949265

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

A Distributional Framework For Data Valuation

Amirata Ghorbani, Michael Kim, James Zou

Keywords Paper

Learning Theory

0

0

0

0

14:15

06/12/2021

Stateful Strategic Regression

Keegan Harris, Hoda Heidari, Steven Wu

Keywords Paper

optimization

0

0

0

0

14:02

06/12/2021

Adaptive Online Packing-guided Search for POMDPs

Chenyang Wu, Guoyu Yang, Zongzhang Zhang and
Yang Yu, Dong Li, Wulong Liu, Jianye Hao

Keywords Paper

0

0

0

0

13:30

18/07/2021

Provably Efficient Algorithms for Multi-Objective Competitive RL

Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

17:04

02/02/2021

From Behavioral Theories to Econometrics: Inferring Preferences of Human Agents from Data on Repeated Interactions

Gali Noti

Keywords Paper

0

0

0

0

20:02

06/12/2020

Evaluating and Rewarding Teamwork Using Cooperative Game Abstractions

Tom Yan, Christian Kroer, Alexander Peysakhovich

Keywords Paper

0

0

0

0

3:09

18/07/2021

Estimating $\alpha$-Rank from A Few Entries with Low Rank Matrix Completion

Yali Du, Xue Yan, Xu Chen and
Jun Wang, Haifeng Zhang

Keywords Paper

Optimization, Probabilistic Methods, Distributed Inference, Algorithms, Algorithms Evaluation

0

0

0

0

4:52

13/04/2021

Efficient computation and analysis of distributional shapley values

Yongchan Kwon, Manuel A. Rivas, James Zou

Keywords Paper

0

0

0

0

2:43

03/05/2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Zhenggang Tang, Chao Yu, Boyuan Chen and
Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu

Keywords Paper

reward randomization, strategic behavior, diverse strategies, multi-agent reinforcement learning

0

0

0

0

2:40

06/12/2021

EDGE: Explaining Deep Reinforcement Learning Policies

Wenbo Guo, Xian Wu, Usmann Khan, Xinyu Xing

Keywords Paper

reinforcement learning and planning, adversarial robustness and security, generative model, kernel methods, interpretability

0

0

0

0

12:16

19/10/2020

Match tracing: A unified framework for real-time win prediction and quantifiable performance evaluation

Kai Wang, Hao Li, Linxia Gong and
Jianrong Tao, Runze Wu, Changjie Fan, Liang Chen, Peng Cui

Keywords Paper

machine learning, performance evaluation, win prediction, sports analytics, online games

0

0

0

0

9:55

12/07/2020

Learning From Strategic Agents: Accuracy, Improvement, and Causality

Yonadav Shavit, Benjamin Edelman, Brian Axelrod

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

12:37

19/08/2021

Approximating the Shapley Value Using Stratified Empirical Bernstein Sampling

Mark A. Burgess, Archie C. Chapman

Keywords Paper

Agent-based and Multi-agent Systems, Cooperative Games, Uncertainty Representations

0

0

0

0

11:46

02/02/2021

Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions

Zhe Feng, Guru Guruganesh, Christopher Liaw and
Aranyak Mehta, Abhishek Sethi

Keywords Paper

0

0

0

0

20:14

18/07/2021

Understanding and Mitigating Accuracy Disparity in Regression

Jianfeng Chi, Yuan Tian, Geoff Gordon, Han Zhao

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

5:17

12/07/2020

Predicting deliberative outcomes

Vikas Garg, Tommi Jaakkola

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

10:06

06/12/2021

Test-time Collective Prediction

Celestine Mendler-Dünner, Wenshuo Guo, Stephen Bates, Michael Jordan

Keywords Paper

machine learning, federated learning

0

0

0

0

14:10

18/07/2021

Mixed Nash Equilibria in the Adversarial Examples Game

Laurent Meunier, Meyer Scetbon, Rafael Pinot and
Jamal Atif, Yann Chevaleyre

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:30

06/12/2021

Continuous Mean-Covariance Bandits

Yihan Du, Siwei Wang, Zhixuan Fang, Longbo Huang

Keywords Paper

bandits

0

0

0

0

11:33

06/12/2021

Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games

Yu Bai, Chi Jin, Huan Wang, Caiming Xiong

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

12:14

18/07/2021

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

3:50

18/07/2021

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games

Dustin Morrill, Ryan D'Orazio, Marc Lanctot and
James Wright, Michael Bowling, Amy Greenwald

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:12

06/12/2020

Contextual Games: Multi-Agent Learning with Side Information

Pier Giuseppe Sessa, Ilija Bogunovic, Andreas Krause, Maryam Kamgarpour

Keywords Paper

0

0

0

0

3:30

12/07/2020

Off-Policy Actor-Critic with Shared Experience Replay

Simon Schmitt, Matteo Hessel, Karen Simonyan

Keywords Paper

Reinforcement Learning - Deep RL

1

0

0

1

14:38

26/08/2020

Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity

Aaron Sidford, Mengdi Wang, Lin Yang, Yinyu Ye

Keywords Paper

0

0

0

0

14:51

02/02/2021

Bounded Risk-Sensitive Markov Games: Forward Policy Design and Inverse Reward Learning with Iterative Reasoning and Cumulative Prospect Theory

Ran Tian, Liting Sun, Masayoshi Tomizuka

Keywords Paper

0

0

0

0

16:28

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

13/04/2021

Linear models are robust optimal under strategic behavior

Wei Tang, Chien-Ju Ho, Yang Liu

Keywords Paper

0

0

0

0

3:32

12/07/2020

Balancing Competing Objectives with Noisy Data: Score-Based Classifiers for Welfare-Aware Machine Learning

Esther Rolf, Max Simchowitz, Sarah Dean and
Lydia T. Liu, Daniel Bjorkegren, University of California Moritz Hardt, Joshua Blumenstock

Keywords Paper

Fairness, Equity, Justice, and Safety

0

0

0

0

14:28

06/12/2021

Neural Auto-Curricula in Two-Player Zero-Sum Games

Xidong Feng, Oliver Slumbers, Ziyu Wan and
Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang

Keywords Paper

deep learning, optimization, reinforcement learning and planning, meta learning

0

0

0

0

14:46

26/04/2020

A Generalized Training Approach for Multiagent Learning

Paul Muller, Shayegan Omidshafiei, Mark Rowland and
Karl Tuyls, Julien Perolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Remi Munos

Keywords Paper

multiagent learning, game theory, training, games

0

0

0

0

14:11

13/04/2021

Improving KernelSHAP: Practical shapley value estimation using linear regression

Ian Covert, Su-In Lee

Keywords Paper

0

0

0

0

2:52

02/02/2021

Sequential Generative Exploration Model for Partially Observable Reinforcement Learning

Haiyan Yin, Jianda Chen, Sinno Jialin Pan, Sebastian Tschiatschek

Keywords Paper

0

0

0

0

14:40

06/12/2021

Shapley Residuals: Quantifying the limits of the Shapley value for explanations

Indra Kumar, Carlos Scheidegger, Suresh Venkatasubramanian, Sorelle Friedler

Keywords Paper

interpretability

0

0

0

0

11:42

19/04/2021

An empirical study on the generalization power of neural representations learned via visual guessing games

Alessandro Suglia, Yonatan Bisk, Ioannis Konstas and
Antonio Vergari, Emanuele Bastianelli, Andrea Vanzo, Oliver Lemon

Keywords Paper

0

0

0

0

7:16

03/05/2021

Auction Learning as a Two-Player Game

Jad Rahme, Samy Jelassi, S. M Weinberg

Keywords Paper

Game Theory, Auction Theory, Mechanism Design, Deep Learning

0

0

0

0

4:54

06/12/2020

Learning Strategy-Aware Linear Classifiers

Yiling Chen, Yang Liu, Chara Podimata

Keywords Paper

0

0

0

0

3:15

18/07/2021

DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning

Wei-Fang Sun, Cheng-Kuang Lee, Chun-Yi Lee

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:43

06/12/2020

A Game Theoretic Analysis of Additive Adversarial Attacks and Defenses

Ambar Pal, Rene Vidal

Keywords Paper

0

0

0

0

3:19

06/12/2021

The Many Faces of Adversarial Risk

Muni Sreenivas Pydi, Varun Jog

Keywords Paper

theory, machine learning, robustness, adversarial robustness and security, optimal transport

0

0

0

0

10:33