Implicit Learning Dynamics in Stackelberg Games: Equilibria Characterization, Convergence Analysis, and Empirical Study

Abstract: Contemporary work on learning in continuous games has commonly overlooked the hierarchical decision-making structure present in machine learning problems formulated as games, instead treating them as simultaneous play games and adopting the Nash equilibrium solution concept. We deviate from this paradigm and provide a comprehensive study of learning in Stackelberg games. This work provides insights into the optimization landscape of zero-sum games by establishing connections between Nash and Stackelberg equilibria along with the limit points of simultaneous gradient descent. We derive novel gradient-based learning dynamics emulating the natural structure of a Stackelberg game using the Implicit Function Theorem and provide convergence analysis for deterministic and stochastic updates for zero-sum and general-sum games. Notably, in zero-sum games using deterministic updates, we show the only critical points the dynamics converge to are Stackelberg equilibria and provide a local convergence rate. Empirically, the proposed learning dynamics mitigate rotational behavior and exhibit benefits for training Generative Adversarial Networks compared to gradient play.

09/07/2020

Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Deep Learning, Applications -> Computational Photography

3:10

02/02/2021

Implicit Learning Dynamics in Stackelberg Games: Equilibria Characterization, Convergence Analysis, and Empirical Study

Tanner Fiez, Benjamin Chasnov, Lillian Ratliff

Comments

Similar Papers

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang

Keywords Abstract Paper

Reinforcement learning, Planning and control

Learning in two-player zero-sum partially observable Markov games with perfect recall

Tadashi Kozuno, Pierre Ménard, Remi Munos, Michal Valko

Keywords Abstract Paper

reinforcement learning and planning, bandits, online learning

Reinforcement learning for mean field games with strategic complementarities

Kiyeob Lee, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

Keywords Abstract Paper

Decentralized Q-learning in Zero-sum Markov Games

Muhammed Sayin, Kaiqing Zhang, David Leslie and Tamer Basar, Asuman Ozdaglar

Keywords Abstract Paper

reinforcement learning and planning

Global Convergence to Local Minmax Equilibrium in Classes of Nonconvex Zero-Sum Games

Tanner Fiez, Lillian Ratliff, Eric Mazumdar and Evan Faulkner, Adhyyan Narang

Keywords Abstract Paper

theory, optimization

No-Regret Learning and Mixed Nash Equilibria: They Do Not Mix

Manolis Vlatakis-Gkaragkounis, Lampros Flokas, Thanasis Lianeas and Panayotis Mertikopoulos, Georgios Piliouras

Keywords Abstract Paper

Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Deep Learning, Applications -> Computational Photography

Newton Optimization on Helmholtz Decomposition for Continuous Games

Giorgia Ramponi, Marcello Restelli

Keywords Abstract Paper

Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games

Arun Suggala, Praneeth Netrapalli

Keywords Abstract Paper

Information Theoretic Regret Bounds for Online Nonlinear Control

Sham Kakade, Akshay Krishnamurthy, Kendall Lowrey and Motoya Ohnishi, Wen Sun

Keywords Abstract Paper

Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré Recurrence

Yun Kuen Cheung, Georgios Piliouras

Keywords Abstract Paper

Theory, RL, Decisions and Control Theory

Invariant Risk Minimization Games

Kartik Ahuja, Karthikeyan Shanmugam, Kush Varshney, Amit Dhurandhar

Keywords Abstract Paper

Causality

Neural Active Learning with Performance Guarantees

Zhilei Wang, Pranjal Awasthi, Christoph Dann and Ayush Sekhari, Claudio Gentile

Keywords Abstract Paper

deep learning, active learning

Provable Self-Play Algorithms for Competitive Reinforcement Learning

Yu Bai, Chi Jin

Keywords Abstract Paper

Reinforcement Learning - Theory

Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality

Stefanos Leonardos, Georgios Piliouras, Kelly Spendlove

Keywords Abstract Paper

reinforcement learning and planning

A limited-capacity minimax theorem for non-convex games or: How i learned to stop worrying about mixed-nash and love neural nets

Gauthier Gidel, David Balduzzi, Wojciech Czarnecki and Marta Garnelo, Yoram Bachrach

Keywords Abstract Paper

Learning in Matrix Games can be Arbitrarily Complex

Gabriel P Andrade, Rafael Frongillo, Georgios Piliouras

Keywords Abstract Paper

Near-Optimal Reinforcement Learning with Self-Play

Yu Bai, Chi Jin, Tiancheng Yu

Keywords Abstract Paper

Theory -> Regularization, Applications -> Fairness, Accountability, and Transparency

On the generalization properties of adversarial training

Yue Xing, Qifan Song, Guang Cheng

Keywords Abstract Paper

Hindsight and Sequential Rationality of Correlated Play

Dustin Morrill, Ryan D'Orazio, Reca Sarfati and Marc Lanctot, James R Wright, Amy R Greenwald, Michael Bowling

Keywords Abstract Paper

Temporal Induced Self-Play for Stochastic Bayesian Games

Weizhe Chen, Zihan Zhou, Yi Wu, Fei Fang

Keywords Abstract Paper

Agent-based and Multi-agent Systems, Multi-agent Learning, Applications of Reinforcement Learning

Online Markov Decision Processes with Aggregate Bandit Feedback

Alon Cohen, Haim Kaplan, Tomer Koren, Yishay Mansour

Keywords Abstract Paper

POMDPs in Continuous Time and Discrete Spaces

Keywords Paper

Keywords Paper

Keywords Paper

Muhammed Sayin, Kaiqing Zhang, David Leslie and
Tamer Basar, Asuman Ozdaglar

Keywords Paper

Tanner Fiez, Lillian Ratliff, Eric Mazumdar and
Evan Faulkner, Adhyyan Narang

Keywords Paper

Manolis Vlatakis-Gkaragkounis, Lampros Flokas, Thanasis Lianeas and
Panayotis Mertikopoulos, Georgios Piliouras

Keywords Paper

Keywords Paper

Keywords Paper

Sham Kakade, Akshay Krishnamurthy, Kendall Lowrey and
Motoya Ohnishi, Wen Sun

Keywords Paper

Keywords Paper

Keywords Paper

Zhilei Wang, Pranjal Awasthi, Christoph Dann and
Ayush Sekhari, Claudio Gentile

Keywords Paper

Keywords Paper

Keywords Paper

Gauthier Gidel, David Balduzzi, Wojciech Czarnecki and
Marta Garnelo, Yoram Bachrach

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Dustin Morrill, Ryan D'Orazio, Reca Sarfati and
Marc Lanctot, James R Wright, Amy R Greenwald, Michael Bowling

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yasaman Esfandiari, Sin Yong Tan, Zhanhong Jiang and
Aditya Balu, Ethan Herron, Chinmay Hegde, Soumik Sarkar

Keywords Paper

David Mguni, Yutong Wu, Yali Du and
Yaodong Yang, Ziyi Wang, M. Li, Ying Wen, Joel Jennings, Jun Wang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Tanner Fiez, Ryann Sim, Stratis Skoulakis and
Georgios Piliouras, Lillian Ratliff

Keywords Paper

Keywords Paper

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xidong Feng, Oliver Slumbers, Ziyu Wan and
Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang

Keywords Paper

Yuanhao Wang, Guodong Zhang, Jimmy Ba

Keywords Paper

Xiang Li, Kaixuan Huang, Wenhao Yang and
Shusen Wang, Zhihua Zhang

Keywords Paper