Moreau-Yosida $f$-divergences

Abstract: Variational representations of $f$-divergences are central to many machine learning algorithms, with Lipschitz constrained variants recently gaining attention. Inspired by this, we define the Moreau-Yosida approximation of $f$-divergences with respect to the Wasserstein-$1$ metric. The corresponding variational formulas provide a generalization of a number of recent results, novel special cases of interest and a relaxation of the hard Lipschitz constraint. Additionally, we prove that the so-called tight variational representation of $f$-divergences can be to be taken over the quotient space of Lipschitz functions, and give a characterization of functions achieving the supremum in the variational representation. On the practical side, we propose an algorithm to calculate the tight convex conjugate of $f$-divergences compatible with automatic differentiation frameworks. As an application of our results, we propose the Moreau-Yosida $f$-GAN, providing an implementation of the variational formulas for the Kullback-Leibler, reverse Kullback-Leibler, $\chi^2$, reverse $\chi^2$, squared Hellinger, Jensen-Shannon, Jeffreys, triangular discrimination and total variation divergences as GANs trained on CIFAR-10, leading to competitive results and a simple solution to the problem of uniqueness of the optimal critic.

12/07/2020

Moreau-Yosida $f$-divergences

Comments

Similar Papers

Linear Convergence of Randomized Primal-Dual Coordinate Method for Large-scale Linear Constrained Convex Programming

Daoli Zhu, Lei Zhao

Keywords Abstract Paper

Optimization - Large Scale, Parallel and Distributed

Kernel distributionally robust optimization: Generalized duality theorem and stochastic approximation

Jia-Jie Zhu, Wittawat Jitkrittum, Moritz Diehl, Bernhard Schölkopf

Keywords Abstract Paper

Fine-grained Generalization Analysis of Vector-Valued Learning

Liang Wu, Antoine Ledent, Yunwen Lei, Marius Kloft

Keywords Abstract Paper

Differentiable Segmentation of Sequences

Erik Scharwächter, Jonathan Lennartz, Emmanuel Müller

Keywords Abstract Paper

warping functions, concept drift, change point detection, segmented models, segmentation, gradient descent

An online passive-aggressive algorithm for difference-of-squares classification

Keywords Abstract Paper

machine learning, online learning

Analyzing the Generalization Capability of SGLD Using Properties of Gaussian Channels

Hao Wang, Yizhe Huang, Rui Gao, Flavio Calmon

Keywords Abstract Paper

theory, optimization, machine learning

Bilevel Optimization: Convergence Analysis and Enhanced Design

Kaiyi Ji, Junjie Yang, Yingbin LIANG

Keywords Abstract Paper

Optimization, Non-Convex Optimization

Expert Learning through Generalized Inverse Multiobjective Optimization: Models, Insights and Algorithms

Chaosheng Dong, Bo Zeng

Keywords Abstract Paper

The Last-Iterate Convergence Rate of Optimistic Mirror Descent in Stochastic Variational Inequalities

Waïss Azizian, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

Keywords Abstract Paper

Optimistic bounds for multi-output learning

Henry Reeve, Ata Kaban

Keywords Abstract Paper

One-sided Frank-Wolfe algorithms for saddle problems

Vladimir Kolmogorov, Thomas Pock

Keywords Abstract Paper

Optimization, Convex Optimization

Alternating direction method of multipliers for quantization

Tianjian Huang, Prajwal Singhania, Maziar Sanjabi and Pabitra Mitra, Meisam Razaviyayn

Keywords Abstract Paper

Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation

Devavrat Shah, Dogyoon Song, Zhi Xu, Yuzhe Yang

Keywords Abstract Paper

Dynamic Submodular Maximization

Keywords Abstract Paper

Convergence of adaptive algorithms for constrained weakly convex optimization

Ahmet Alacaoglu, Yura Malitsky, Volkan Cevher

Keywords Abstract Paper

Learning Gaussian Mixtures with Generalized Linear Models: Precise Asymptotics in High-dimensions

Bruno Loureiro, Gabriele Sicuro, Cedric Gerbelot and Alessandro Pacco, Florent Krzakala, Lenka Zdeborová

Keywords Abstract Paper

theory, machine learning

Efficient methods for structured nonconvex-nonconcave min-max optimization

Jelena Diakonikolas, Constantinos Daskalakis, Michael Jordan

Keywords Abstract Paper

Automatic and Harmless Regularization with Constrained and Lexicographic Optimization: A Dynamic Barrier Approach

Chengyue Gong, Xingchao Liu, Qiang Liu

Keywords Abstract Paper

optimization, machine learning, graph learning

Accelerated Message Passing for Entropy-Regularized MAP Inference

Jonathan Lee, Aldo Pacchiano, Peter Bartlett, Michael Jordan

Keywords Abstract Paper

How Data Augmentation affects Optimization for Linear Regression

Boris Hanin, Yi Sun

Keywords Abstract Paper

optimization, machine learning

On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning

Alireza Fallah, Kristian Georgiev, Aryan Mokhtari, Asuman Ozdaglar

Keywords Abstract Paper

theory, optimization, reinforcement learning and planning, meta learning

Fast convergence of stochastic subgradient method under interpolation

Huang Fang, Zhenan Fan, Michael Friedlander

Keywords Abstract Paper

interpolation, stochastic subgradient method, convergence analysis, Optimization

Localization, Convexity, and Star Aggregation

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Tianjian Huang, Prajwal Singhania, Maziar Sanjabi and
Pabitra Mitra, Meisam Razaviyayn

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Bruno Loureiro, Gabriele Sicuro, Cedric Gerbelot and
Alessandro Pacco, Florent Krzakala, Lenka Zdeborová

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ba-Hien Tran, Simone Rossi, Dimitrios Milios and
Pietro Michiardi, Edwin Bonilla, Maurizio Filippone

Keywords Paper

Nutan Chen, Alexej Klushyn, Francesco Ferroni and
Justin Bayer, Patrick van der Smagt

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Long Yang, Gang Zheng, Yu Zhang and
Qian Zheng, Pengfei Li, Gang Pan

Keywords Paper

Xin Zhang, Zhuqing Liu, Jia Liu and
Zhengyuan Zhu, Songtao Lu

Keywords Paper

Keywords Paper

Keywords Paper

Jincheng Mei, Yue Gao, Bo Dai and
Csaba Szepesvari, Dale Schuurmans

Keywords Paper

Keywords Paper