The Mean-Squared Error of Double Q-Learning

06/12/2020

The Mean-Squared Error of Double Q-Learning

Wentao Weng, Harsh Gupta, Niao He, Lei Ying, R. Srikant

Keywords:

Abstract Paper Similar Papers

Abstract: In this paper, we establish a theoretical comparison between the asymptotic mean square errors of double Q-learning and Q-learning. Our result builds upon an analysis for linear stochastic approximation based on Lyapunov equations and applies to both tabular setting or with linear function approximation, provided that the optimal policy is unique and the algorithms converge. We show that the asymptotic mean-square error of Double Q-learning is exactly equal to that of Q-learning if Double Q-learning uses twice the learning rate of Q-learning and the output of Double Q-learning is the average of its two estimators. We also present some practical implications of this theoretical observation using simulations.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

13/04/2021

Alternating direction method of multipliers for quantization

Tianjian Huang, Prajwal Singhania, Maziar Sanjabi and
Pabitra Mitra, Meisam Razaviyayn

Keywords Paper

1

0

0

0

2:43

13/04/2021

Reinforcement learning for constrained markov decision processes

Ather Gattami, Qinbo Bai, Vaneet Aggarwal

Keywords Paper

0

0

0

0

3:08

12/07/2020

Sequential Transfer in Reinforcement Learning with a Generative Model

Andrea Tirinzoni, Riccardo Poiani, Marcello Restelli

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:54

06/12/2021

A Mathematical Framework for Quantifying Transferability in Multi-source Transfer Learning

Xinyi Tong, Xiangxiang Xu, Shao-Lun Huang, Lizhong Zheng

Keywords Paper

theory, deep learning, machine learning, vision, transfer learning

2

1

0

0

13:27

26/08/2020

A Rule for Gradient Estimator Selection, with an Application to Variational Inference

Tomas Geffner, Justin Domke

Keywords Paper

0

0

0

0

8:36

03/05/2021

Entropic gradient descent algorithms and wide flat minima

Fabrizio Pittorino, Carlo Lucibello, Christoph Feinauer and
Gabriele Perugini, Carlo Baldassi, Elizaveta Demyanenko, Riccardo Zecchina

Keywords Paper

flat minima, belief-propagation, statistical physics, entropic algorithms

0

0

0

0

5:38

26/08/2020

A Wasserstein Minimum Velocity Approach to Learning Unnormalized Models

Ziyu Wang, Shuyu Cheng, Li Yueru and
Jun Zhu, Bo Zhang

Keywords Paper

0

0

0

0

9:58

14/09/2020

End-to-End Learning for Prediction and Optimization with Gradient Boosting

Takuya Konishi, Takuro Fukunaga

Keywords Paper

combinatorial optimization, boosting/ensemble methods

0

0

0

0

15:14

12/07/2020

Expert Learning through Generalized Inverse Multiobjective Optimization: Models, Insights and Algorithms

Chaosheng Dong, Bo Zeng

Keywords Paper

Learning Theory

0

0

0

0

12:11

13/04/2021

SONIA: A symmetric blockwise truncated optimization algorithm

Majid Jahani, MohammadReza Nazari, Rachael Tappenden and
Albert Berahas, Martin Takac

Keywords Paper

0

0

0

0

2:55

06/12/2020

Learning by Minimizing the Sum of Ranked Range

Shu Hu, Yiming Ying, xin wang, Siwei Lyu

Keywords Paper

Algorithms -> Sparsity and Compressed Sensing, Theory -> Frequentist Statistics

0

0

0

0

3:12

06/12/2021

Generalization Guarantee of SGD for Pairwise Learning

Yunwen Lei, Mingrui Liu, Yiming Ying

Keywords Paper

optimization, machine learning

0

0

0

0

14:30

12/07/2020

Few-shot Relation Extraction via Bayesian Meta-learning on Task Graphs

Meng Qu, Tianyu Gao, Louis-Pascal Xhonneux, Jian Tang

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

6:45

06/12/2021

Generalization of Model-Agnostic Meta-Learning Algorithms: Recurring and Unseen Tasks

Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

Keywords Paper

theory, optimization, meta learning

0

0

0

0

14:42

06/12/2021

Particle Dual Averaging: Optimization of Mean Field Neural Network with Global Convergence Rate Analysis

Atsushi Nitanda, Denny Wu, Taiji Suzuki

Keywords Paper

theory, deep learning, optimization

0

0

0

0

12:59

06/12/2020

A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions

Wei Deng, Guang Lin, Faming Liang

Keywords Paper

0

0

0

0

3:26

18/07/2021

Stability and Generalization of Stochastic Gradient Methods for Minimax Problems

Yunwen Lei, Zhenhuan Yang, Tianbao Yang, Yiming Ying

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

16:24

26/08/2020

Ordered SGD: A New Stochastic Optimization Framework for Empirical Risk Minimization

Kenji Kawaguchi, Haihao Lu

Keywords Paper

0

0

0

0

14:10

06/12/2021

On learning sparse vectors from mixture of responses

Nikita Polyanskii

Keywords Paper

generative model

0

0

0

0

10:55

06/12/2021

Analyzing the Generalization Capability of SGLD Using Properties of Gaussian Channels

Hao Wang, Yizhe Huang, Rui Gao, Flavio Calmon

Keywords Paper

theory, optimization, machine learning

0

0

0

0

12:27

06/12/2020

Modeling and Optimization Trade-off in Meta-learning

Katelyn Gao, Ozan Sener

Keywords Paper

0

0

0

0

3:21

06/12/2021

Lower Bounds and Optimal Algorithms for Smooth and Strongly Convex Decentralized Optimization Over Time-Varying Networks

Dmitry Kovalev, Elnur Gasanov, Alexander Gasnikov, Peter Richtarik

Keywords Paper

optimization

0

0

0

0

15:02

06/12/2020

A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms

Donghwan Lee, Niao He

Keywords Paper

0

0

0

0

3:56

26/04/2020

Gradient Descent Maximizes the Margin of Homogeneous Neural Networks

Kaifeng Lyu, Jian Li

Keywords Paper

margin, homogeneous, gradient descent

0

0

0

0

15:02

12/07/2020

Learning Selection Strategies in Buchberger’s Algorithm

Dylan Peifer, Michael Stillman, Daniel Halpern-Leistner

Keywords Paper

Applications - Other

0

0

0

0

14:50

06/12/2020

Lower Bounds and Optimal Algorithms for Personalized Federated Learning

Filip Hanzely, Slavomír Hanzely, Samuel Horváth, Peter Richtarik

Keywords Paper

, Theory -> Learning Theory

0

0

0

0

3:24

03/05/2021

Fast convergence of stochastic subgradient method under interpolation

Huang Fang, Zhenan Fan, Michael Friedlander

Keywords Paper

interpolation, stochastic subgradient method, convergence analysis, Optimization

0

0

0

0

4:42

06/12/2021

Robustness between the worst and average case

Leslie Rice, Anna Bair, Huan Zhang, J. Zico Kolter

Keywords Paper

machine learning, robustness, adversarial robustness and security, generative model

0

0

0

0

10:46

18/07/2021

Learning While Playing in Mean-Field Games: Convergence and Optimality

Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, Andreea Minca

Keywords Paper

Applications, Privacy, Anonymity, and Security, Algorithms, Components Analysis (e.g., CCA, ICA, LDA, PCA), Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:24

18/07/2021

Implicit rate-constrained optimization of non-decomposable objectives

Abhishek Kumar, Harikrishna Narasimhan, Andrew Cotter

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

3:48

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

06/12/2020

Exact expressions for double descent and implicit regularization via surrogate random design

Michal Derezinski, Feynman Liang, Michael W Mahoney

Keywords Paper

0

0

0

0

3:24

06/12/2021

Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach

Fan Yang, Kai He, Linxiao Yang and
Hongxia Du, Jingbang Yang, Bo Yang, Liang Sun

Keywords Paper

optimization

0

0

0

0

4:43

04/08/2021

Outlier-Robust Learning of Ising Models Under Dobrushin's Condition

Ilias Diakonikolas, Daniel M. Kane, Alistair Stewart, Yuxin Sun

Keywords Paper

0

0

0

0

16:22

18/07/2021

Bilevel Optimization: Convergence Analysis and Enhanced Design

Kaiyi Ji, Junjie Yang, Yingbin LIANG

Keywords Paper

Optimization, Non-Convex Optimization

0

0

0

0

5:02

13/04/2021

Regularized ERM on random subspaces

Andrea Della Vecchia, Jaouad Mourtada, Ernesto De Vito, Lorenzo Rosasco

Keywords Paper

0

0

0

0

2:57

06/12/2020

Cross-validation Confidence Intervals for Test Error

Pierre Bayle, Alexandre Bayle, Lucas Janson, Lester Mackey

Keywords Paper

Deep Learning; Deep Learning -> Optimization for Deep Networks; Theory -> Regularization, Theory

0

0

0

0

3:24

03/05/2021

Optimal Regularization can Mitigate Double Descent

Preetum Nakkiran, Prayaag Venkat, Sham M Kakade, Tengyu Ma

Keywords Paper

regression, double descent, regularization, generalization, monotonicity

0

0

0

0

5:05

12/07/2020

Quadratically Regularized Subgradient Methods for Weakly Convex Optimization with Weakly Convex Constraints

Runchao Ma, Qihang Lin, Tianbao Yang

Keywords Paper

Optimization - Non-convex

0

0

0

0

12:52

06/12/2021

Fractal Structure and Generalization Properties of Stochastic Optimization Algorithms

Alexander Camuto, George Deligiannidis, Murat Erdogdu and
Mert Gurbuzbalaban, Umut Simsekli, Lingjiong Zhu

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:36