Discretization Drift in Two-Player Games

18/07/2021

Discretization Drift in Two-Player Games

Mihaela Rosca, Yan Wu, Benoit Dherin, David GT Barrett

Keywords: Theory, Deep learning Theory

Abstract Paper Similar Papers

Abstract: Gradient-based methods for two-player games produce rich dynamics that can solve challenging problems, yet can be difficult to stabilize and understand. Part of this complexity originates from the discrete update steps given by simultaneous or alternating gradient descent, which causes each player to drift away from the continuous gradient flow -- a phenomenon we call discretization drift. Using backward error analysis, we derive modified continuous dynamical systems that closely follow the discrete dynamics. These modified dynamics provide an insight into the notorious challenges associated with zero-sum games, including Generative Adversarial Networks. In particular, we identify distinct components of the discretization drift that can alter performance and in some cases destabilize the game. Finally, quantifying discretization drift allows us to identify regularizers that explicitly cancel harmful forms of drift or strengthen beneficial forms of drift, and thus improve performance of GAN training.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

13/04/2021

Improving KernelSHAP: Practical shapley value estimation using linear regression

Ian Covert, Su-In Lee

Keywords Paper

0

0

0

0

2:52

06/12/2020

Chaos, Extremism and Optimism: Volume Analysis of Learning in Games

Yun Kuen Cheung, Georgios Piliouras

Keywords Paper

0

0

0

0

3:22

03/05/2021

Chaos of Learning Beyond Zero-sum and Coordination via Game Decompositions

Yun Kuen Cheung, Yixin Tao

Keywords Paper

Dynamical Systems, Volume Analysis, Follow-the-Regularized-Leader, Multiplicative Weights Update, Game Decomposition, Lyapunov Chaos, Learning in Games

0

0

0

0

3:53

06/12/2020

Training Generative Adversarial Networks by Solving Ordinary Differential Equations

Chongli Qin, Yan Wu, Jost Tobias Springenberg and
Andy Brock, Jeff Donahue, Timothy Lillicrap, Pushmeet Kohli

Keywords Paper

0

0

0

0

3:20

06/12/2021

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

deep learning, optimization

0

0

0

0

14:26

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

12/07/2020

From Chaos to Order: Symmetry and Conservation Laws in Game Dynamics

Sai Ganesh Nagarajan, David Balduzzi, Georgios Piliouras

Keywords Paper

Learning Theory

0

0

0

0

15:35

06/12/2020

Softmax Deep Double Deterministic Policy Gradients

Ling Pan, Qingpeng Cai, Longbo Huang

Keywords Paper

0

0

0

0

3:23

26/08/2020

Discrete Action On-Policy Learning with Action-Value Critic

Yuguang Yue, Yunhao Tang, Mingzhang Yin, Mingyuan Zhou

Keywords Paper

0

0

0

0

14:23

12/07/2020

Representations for Stable Off-Policy Reinforcement Learning

Dibya Ghosh, Marc Bellemare

Keywords Paper

Reinforcement Learning - General

0

0

0

0

14:38

25/04/2020

Enemy Within: Long-term Motivation Effects of Deep Player Behavior Models for Dynamic Difficulty Adjustment

Johannes Pfau, Jan Smeddinck, Rainer Malaka

Keywords Paper

dynamic difficulty adjustment, player modeling, neural networks, deep learning, mmorpgs, games

0

0

0

0

9:39

25/07/2020

DVGAN: A minimax game for search result diversification combining explicit and implicit features

Jiongnan Liu, Zhicheng Dou, Xiaojie Wang and
Shuqi Lu, Ji-Rong Wen

Keywords Paper

generative adversarial network, search result diversification

0

0

0

0

12:46

14/06/2020

Gradually Vanishing Bridge for Adversarial Domain Adaptation

Shuhao Cui, Shuhui Wang, Junbao Zhuo and
Chi Su, Qingming Huang, Qi Tian

Keywords Paper

bridge, domain adaptation, adversarial learning

0

0

0

0

1:01

26/04/2020

CAQL: Continuous Action Q-Learning

Moonkyung Ryu, Yinlam Chow, Ross Anderson and
Christian Tjandraatmadja, Craig Boutilier

Keywords Paper

Reinforcement learning (RL), DQN, Continuous control, Mixed-Integer Programming (MIP)

0

0

0

0

5:36

18/07/2021

DriftSurf: Stable-State / Reactive-State Learning under Concept Drift

Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phil Gibbons

Keywords Paper

Algorithms, Online Learning Algorithms

0

0

0

0

5:07

17/08/2020

Learning temporal coherence via self-supervision for GAN-based video generation

Mengyu Chu, You Xie, Jonas Mayer and
Laura Leal-Taixé, Nils Thuerey

Keywords Paper

self-supervision, temporal cycle-consistency, video super-resolution, generative adversarial network, unpaired video translation

0

0

0

0

16:59

12/07/2020

Extrapolation for Large-batch Training in Deep Learning

Tao LIN, Lingjing Kong, Sebastian Stich, Martin Jaggi

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:21

02/02/2021

Right for Better Reasons: Training Differentiable Models by Constraining their Influence Functions

Xiaoting Shao, Arseny Skryagin, Wolfgang Stammer and
Patrick Schramowski, Kristian Kersting

Keywords Paper

0

0

0

0

19:08

02/02/2021

Maximum Roaming Multi-Task Learning

Lucas Pascal, Pietro Michiardi, Xavier Bost and
Benoit Huet, Maria A. Zuluaga

Keywords Paper

0

0

0

0

19:54

18/11/2020

CCA-flow: Deep multi-view subspace learning with inverse autoregressive flow

Jia He, Feiyang Pan, Fuzhen Zhuang, Qing He

Keywords Paper

0

0

0

0

11:33

14/06/2020

Closed-Loop Matters: Dual Regression Networks for Single Image Super-Resolution

Yong Guo, Jian Chen, Jingdong Wang and
Qi Chen, Jiezhang Cao, Zeshuai Deng, Yanwu Xu, Mingkui Tan

Keywords Paper

computer vision, image super-resolution, dual regression scheme, closed-loop

0

0

0

0

1:01

18/07/2021

Towards Better Robust Generalization with Shift Consistency Regularization

Shufei Zhang, Zhuang Qian, Kaizhu Huang and
Qiufeng Wang, Rui Zhang, Xinping Yi

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

5:44

18/07/2021

Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Afsaneh Mastouri, Yuchen Zhu, Limor Gultchin and
Anna Korba, Ricardo Silva, Matt J. Kusner, Arthur Gretton, Krikamol Muandet

Keywords Paper

Algorithms, Kernel Methods

0

0

0

0

5:10

06/12/2020

Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation

Yogesh Balaji, Rama Chellappa, Soheil Feizi

Keywords Paper

0

0

0

0

3:24

02/02/2021

Improving Sample Efficiency in Model-Free Reinforcement Learning from Images

Denis Yarats, Amy Zhang, Ilya Kostrikov and
Brandon Amos, Joelle Pineau, Rob Fergus

Keywords Paper

0

0

0

0

12:19

13/04/2021

Self-concordant analysis of generalized linear bandits with forgetting

Yoan Russac, Louis Faury, Olivier Cappé, Aurélien Garivier

Keywords Paper

0

0

0

0

3:06

09/07/2020

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang

Keywords Paper

Reinforcement learning, Planning and control

0

0

0

0

15:16

19/08/2021

Relaxed Core Stability in Fractional Hedonic Games

Angelo Fanelli, Gianpiero Monaco, Luca Moscardelli

Keywords Paper

Agent-based and Multi-agent Systems, Algorithmic Game Theory, Noncooperative Games, Coordination and Cooperation

0

0

0

0

14:14

14/06/2020

Cogradient Descent for Bilinear Optimization

Li'an Zhuo, Baochang Zhang, Linlin Yang and
Hanlin Chen, Qixiang Ye, David Doermann, Rongrong Ji, Guodong Guo

Keywords Paper

bilinear optimization, gradient descent algorithm, convolutional sparse coding, network pruning

0

0

0

0

1:01

06/12/2021

Reliable Estimation of KL Divergence using a Discriminator in Reproducing Kernel Hilbert Space

Sandesh Ghimire, Aria Masoomi, Jennifer Dy

Keywords Paper

theory, deep learning, machine learning, kernel methods

0

0

0

0

14:58

06/12/2021

Adversarial Robustness with Semi-Infinite Constrained Learning

Alexander Robey, Luiz Chamon, George J. Pappas and
Hamed Hassani, Alejandro Ribeiro

Keywords Paper

theory, deep learning, optimization, robustness, adversarial robustness and security

0

0

0

0

14:48

04/08/2021

Learning in Matrix Games can be Arbitrarily Complex

Gabriel P Andrade, Rafael Frongillo, Georgios Piliouras

Keywords Paper

0

0

0

0

14:59

06/12/2020

Interior Point Solving for LP-based prediction+optimisation

Jayanta Mandi, Tias Guns

Keywords Paper

0

0

0

1

3:28

12/07/2020

Understanding and Stabilizing GANs' Training Dynamics Using Control Theory

Kun Xu, Chongxuan Li, Jun Zhu, Bo Zhang

Keywords Paper

Deep Learning - General

0

0

0

0

15:10

12/07/2020

Implicit competitive regularization in GANs

Florian Schaefer, Hongkai Zheng, Anima Anandkumar

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:07

12/07/2020

Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Jang-Hyun Kim, Wonho Choo, Hyun Oh Song

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:40

18/07/2021

Fundamental Tradeoffs in Distributionally Adversarial Training

Mohammad Mehrabi, Adel Javanmard, Ryan A. Rossi and
Anup Rao, Tung Mai

Keywords Paper

Theory

0

0

0

1

5:50

06/12/2021

Time-series Generation by Contrastive Imitation

Daniel Jarrett, Ioana Bica, Mihaela van der Schaar

Keywords Paper

generative model

0

0

0

0

8:47

02/02/2021

Hindsight and Sequential Rationality of Correlated Play

Dustin Morrill, Ryan D'Orazio, Reca Sarfati and
Marc Lanctot, James R Wright, Amy R Greenwald, Michael Bowling

Keywords Paper

0

0

0

0

18:34

06/12/2021

COMBO: Conservative Offline Model-Based Policy Optimization

Tianhe Yu, Aviral Kumar, Rafael Rafailov and
Aravind Rajeswaran, Sergey Levine, Chelsea Finn

Keywords Paper

deep learning, optimization, reinforcement learning and planning

0

0

0

0

12:35