OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning

12/07/2020

OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning

Alexander Vezhnevets, Yuhuai Wu, Maria Eckstein, Rémi Leblond, Joel Z Leibo

Keywords: Reinforcement Learning - Deep RL

Abstract Paper Similar Papers

Abstract: This paper investigates generalisation in multi-agent games, where the generality of the agent can be evaluated by playing against opponents it hasn't seen during training. We propose two new games with concealed information and complex, non-transitive reward structure (think rock-paper-scissors). It turns out that most current deep reinforcement learning methods fail to efficiently explore the strategy space, thus learning policies that generalise poorly to unseen opponents. We then propose a novel hierarchical agent architecture, where the hierarchy is grounded in the game-theoretic structure of the game -- the top level chooses strategic responses to opponents, while the low level implements them into policy over primitive actions. This grounding facilitates credit assignment across the levels of hierarchy. Our experiments show that the proposed hierarchical agent is capable of generalisation to unseen opponents, while conventional baselines fail to generalise whatsoever.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Provable Self-Play Algorithms for Competitive Reinforcement Learning

Yu Bai, Chi Jin

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

14:28

18/07/2021

Learning in Nonzero-Sum Stochastic Games with Potentials

David Mguni, Yutong Wu, Yali Du and
Yaodong Yang, Ziyi Wang, M. Li, Ying Wen, Joel Jennings, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:36

06/12/2020

Preference-based Reinforcement Learning with Finite-Time Guarantees

Yichong Xu, Ruosong Wang, Lin Yang and
Aarti Singh, Artur Dubrawski

Keywords Paper

0

0

0

0

3:04

26/04/2020

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

Hengyuan Hu, Jakob N Foerster

Keywords Paper

multi-agent RL, theory of mind

0

0

0

0

5:20

06/12/2020

Instance-based Generalization in Reinforcement Learning

Martin Bertran, Natalia Martinez, Mariano Phielipp, Guillermo Sapiro

Keywords Paper

0

0

0

0

3:26

06/12/2020

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Stephen Mcaleer, J.B. Lanier, Roy Fox, Pierre Baldi

Keywords Paper

0

0

0

0

3:12

03/05/2021

Iterative Empirical Game Solving via Single Policy Best Response

Max Smith, Thomas Anthony, Michael Wellman

Keywords Paper

Reinforcement Learning, Multiagent Learning, Empirical Game Theory

0

0

0

0

8:49

06/12/2021

Continuous Mean-Covariance Bandits

Yihan Du, Siwei Wang, Zhixuan Fang, Longbo Huang

Keywords Paper

bandits

0

0

0

0

11:33

02/02/2021

Hindsight and Sequential Rationality of Correlated Play

Dustin Morrill, Ryan D'Orazio, Reca Sarfati and
Marc Lanctot, James R Wright, Amy R Greenwald, Michael Bowling

Keywords Paper

0

0

0

0

18:34

19/08/2021

Boosting Offline Reinforcement Learning with Residual Generative Modeling

Hua Wei, Deheng Ye, Zhao Liu and
Hao Wu, Bo Yuan, Qiang Fu, Wei Yang, Zhenhui Li

Keywords Paper

Machine Learning Applications, Applications of Reinforcement Learning, Game Playing, Reinforcement Learning

0

0

0

0

11:32

18/07/2021

Trajectory Diversity for Zero-Shot Coordination

Andrei Lupu, Brandon Cui, Hengyuan Hu, Jakob Foerster

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:16

06/12/2020

Learning Strategy-Aware Linear Classifiers

Yiling Chen, Yang Liu, Chara Podimata

Keywords Paper

0

0

0

0

3:15

06/12/2020

Online Bayesian Persuasion

Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Nicola Gatti

Keywords Paper

0

0

0

0

3:00

06/12/2020

Learning to Play Sequential Games versus Unknown Opponents

Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

Keywords Paper

0

0

0

0

3:04

12/07/2020

“Other-Play” for Zero-Shot Coordination

Hengyuan Hu, Alexander Peysakhovich, Adam Lerer, Jakob Foerster

Keywords Paper

Planning, Control, and Multiagent Learning

0

0

0

0

13:45

06/12/2021

Exploiting Opponents Under Utility Constraints in Sequential Games

Martino Bernasconi-de-Luca, Federico Cacciamani, Simone Fioravanti and
Nicola Gatti, Alberto Marchesi, Francesco Trovò

Keywords Paper

online learning

0

0

0

0

12:59

03/05/2021

Adversarially Guided Actor-Critic

Yannis Flet-Berliac, Johan Ferret, Olivier Pietquin and
philippe preux, Matthieu Geist

Keywords Paper

0

0

0

0

4:22

02/02/2021

Solving Common-Payoff Games with Approximate Policy Iteration

Samuel Sokota, Edward Lockhart, Finbarr Timbers and
Elnaz Davoodi, Ryan D'Orazio, Neil Burch, Martin Schmid, Michael Bowling, Marc Lanctot

Keywords Paper

0

0

0

0

18:14

18/07/2021

Adversarial Policy Learning in Two-player Competitive Games

Wenbo Guo, Xian Wu, Sui Huang, Xinyu Xing

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:14

06/12/2021

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?

Kate Rakelly, Abhishek Gupta, Carlos Florensa, Sergey Levine

Keywords Paper

reinforcement learning and planning, representation learning

1

0

0

0

10:44

06/12/2021

Decentralized Q-learning in Zero-sum Markov Games

Muhammed Sayin, Kaiqing Zhang, David Leslie and
Tamer Basar, Asuman Ozdaglar

Keywords Paper

reinforcement learning and planning

0

0

0

0

15:07

06/12/2021

Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games

Yu Bai, Chi Jin, Huan Wang, Caiming Xiong

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

12:14

06/12/2021

Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

Xiangyu Liu, Hangtian Jia, Ying Wen and
Yaodong Yang, Yujing Hu, Yingfeng Chen, Changjie Fan, ZHIPENG HU

Keywords Paper

0

0

0

0

13:43

06/12/2020

Near-Optimal Reinforcement Learning with Self-Play

Yu Bai, Chi Jin, Tiancheng Yu

Keywords Paper

Theory -> Regularization, Applications -> Fairness, Accountability, and Transparency

0

0

0

0

3:33

06/12/2021

Neural Auto-Curricula in Two-Player Zero-Sum Games

Xidong Feng, Oliver Slumbers, Ziyu Wan and
Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang

Keywords Paper

deep learning, optimization, reinforcement learning and planning, meta learning

0

0

0

0

14:46

06/12/2021

Learning Diverse Policies in MOBA Games via Macro-Goals

Yiming Gao, Bei Shi, Xueying Du and
Liang Wang, Guangwei Chen, Zhenjie Lian, Fuhao Qiu, GUOAN HAN, Weixuan Wang, Deheng Ye, Qiang Fu, Wei Yang, Lanxiao Huang

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:49

02/02/2021

GaussianPath:A Bayesian Multi-Hop Reasoning Framework for Knowledge Graph Reasoning

Guojia Wan, Bo Du

Keywords Paper

0

0

0

0

13:52

02/02/2021

Meta-Learning Effective Exploration Strategies for Contextual Bandits

Amr Sharaf, Hal Daumé III

Keywords Paper

0

0

0

0

13:56

16/11/2020

Generalization Guarantees for Imitation Learning

Allen Ren, Sushant Veer, Anirudha Majumdar

Keywords Paper

0

0

0

0

4:58

03/05/2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Zhenggang Tang, Chao Yu, Boyuan Chen and
Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu

Keywords Paper

reward randomization, strategic behavior, diverse strategies, multi-agent reinforcement learning

0

0

0

0

2:40

06/12/2021

When Is Generalizable Reinforcement Learning Tractable?

Dhruv Malik, Yuanzhi Li, Pradeep Ravikumar

Keywords Paper

reinforcement learning and planning, generative model, representation learning

0

0

0

0

12:38

03/05/2021

Chaos of Learning Beyond Zero-sum and Coordination via Game Decompositions

Yun Kuen Cheung, Yixin Tao

Keywords Paper

Dynamical Systems, Volume Analysis, Follow-the-Regularized-Leader, Multiplicative Weights Update, Game Decomposition, Lyapunov Chaos, Learning in Games

0

0

0

0

3:53

02/02/2021

Defending against Backdoors in Federated Learning with Robust Learning Rate

Mustafa Safa Ozdayi, Murat Kantarcioglu, Yulia R. Gel

Keywords Paper

0

0

0

0

16:19

06/12/2020

A Game Theoretic Analysis of Additive Adversarial Attacks and Defenses

Ambar Pal, Rene Vidal

Keywords Paper

0

0

0

0

3:19

12/07/2020

Dual-Path Distillation: A Unified Framework to Improve Black-Box Attacks

Yonggang Zhang, Ya Li, Tongliang Liu, Xinmei Tian

Keywords Paper

Adversarial Examples

0

0

0

0

11:33

02/02/2021

Estimating α-Rank by Maximizing Information Gain

Tabish Rashid, Cheng Zhang, Kamil Ciosek

Keywords Paper

0

0

0

0

14:52

06/12/2021

EDGE: Explaining Deep Reinforcement Learning Policies

Wenbo Guo, Xian Wu, Usmann Khan, Xinyu Xing

Keywords Paper

reinforcement learning and planning, adversarial robustness and security, generative model, kernel methods, interpretability

0

0

0

0

12:16

18/07/2021

Estimating $\alpha$-Rank from A Few Entries with Low Rank Matrix Completion

Yali Du, Xue Yan, Xu Chen and
Jun Wang, Haifeng Zhang

Keywords Paper

Optimization, Probabilistic Methods, Distributed Inference, Algorithms, Algorithms Evaluation

0

0

0

0

4:52

18/07/2021

SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II

Xiangjun Wang, Junxiao SONG, Penghui Qi and
Peng Peng, Zhenkun Tang, Wei Zhang, Weimin Li, Xiongjun Pi, Jujie He, Chao Gao, Haitao Long, Quan Yuan

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:11

06/12/2021

Deep Synoptic Monte-Carlo Planning in Reconnaissance Blind Chess

Gregory Clark

Keywords Paper

deep learning, reinforcement learning and planning, bandits

0

0

0

0

8:29