Differentiating the value function by using convex duality

13/04/2021

Differentiating the value function by using convex duality

Sheheryar Mehmood, Peter Ochs

Keywords:

Abstract Paper Similar Papers

Abstract: We consider the differentiation of the value function for parametric optimization problems. Such problems are ubiquitous in machine learning applications such as structured support vector machines, matrix factorization and min-min or minimax problems in general. Existing approaches for computing the derivative rely on strong assumptions of the parametric function. Therefore, in several scenarios there is no theoretical evidence that a given algorithmic differentiation strategy computes the true gradient information of the value function. We leverage a well known result from convex duality theory to relax the conditions and to derive convergence rates of the derivative approximation for several classes of parametric optimization problems in Machine Learning. We demonstrate the versatility of our approach in several experiments, including non-smooth parametric functions. Even in settings where other approaches are applicable, our duality based strategy shows a favorable performance.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Non-parametric Models for Non-negative Functions

Ulysse Marteau-Ferey, Francis Bach, Alessandro Rudi

Keywords Paper

0

0

0

0

3:11

06/12/2021

Closing the Gap: Tighter Analysis of Alternating Stochastic Gradient Methods for Bilevel Problems

Tianyi Chen, Yuejiao Sun, Wotao Yin

Keywords Paper

theory, optimization, reinforcement learning and planning, machine learning

0

0

0

0

7:27

18/07/2021

Implicit rate-constrained optimization of non-decomposable objectives

Abhishek Kumar, Harikrishna Narasimhan, Andrew Cotter

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

3:48

06/12/2021

Stability and Generalization of Bilevel Programming in Hyperparameter Optimization

Fan Bao, Guoqiang Wu, Chongxuan LI and
Jun Zhu, Bo Zhang

Keywords Paper

optimization

0

0

0

0

8:58

12/07/2020

Optimal approximation for unconstrained non-submodular minimization

Marwa El Halabi, Stefanie Jegelka

Keywords Paper

Optimization - General

0

0

0

0

14:41

12/07/2020

An Accelerated DFO Algorithm for Finite-sum Convex Functions

Yuwen Chen, Antonio Orvieto, Aurelien Lucchi

Keywords Paper

Optimization - Convex

0

0

0

0

13:01

06/12/2020

Parabolic Approximation Line Search for DNNs

Maximus Mutschler, Andreas Zell

Keywords Paper

0

0

0

0

3:19

26/08/2020

Uncertainty Quantification for Sparse Deep Learning

Yuexi Wang, Veronika Rockova

Keywords Paper

0

0

0

0

15:12

18/07/2021

Principal Component Hierarchy for Sparse Quadratic Programs

Robbie Vreugdenhil, Viet Anh Nguyen, Armin Eftekhari, Peyman Mohajerin Esfahani

Keywords Paper

Deep Learning, Optimization, Convex Optimization, Applications, Natural Language Processing

0

0

0

0

5:14

12/07/2020

Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization

Vien Mai, Mikael Johansson

Keywords Paper

Optimization - Non-convex

0

0

0

0

15:49

26/04/2020

Transferring Optimality Across Data Distributions via Homotopy Methods

Matilde Gargiani, Andrea Zanelli, Quoc Tran Dinh and
Moritz Diehl, Frank Hutter

Keywords Paper

deep learning, numerical optimization, transfer learning

0

0

0

0

5:25

02/02/2021

Counterfactual Explanations for Oblique Decision Trees:Exact, Efficient Algorithms

Miguel Á. Carreira-Perpiñán, Suryabhan Singh Hada

Keywords Paper

0

0

0

0

16:16

06/12/2020

On Correctness of Automatic Differentiation for Non-Differentiable Functions

Wonyeol Lee, Hangyeol Yu, Xavier Rival, Hongseok Yang

Keywords Paper

0

0

0

0

3:15

30/11/2020

Progressive Batching for Efficient Non-linear Least Squares

Huu Le, Christopher Zach, Edward Rosten, Oliver J. Woodford

Keywords Paper

0

0

0

0

8:23

06/12/2020

A Catalyst Framework for Minimax Optimization

Junchi Yang, Siqi Zhang, Negar Kiyavash, Niao He

Keywords Paper

0

0

0

0

3:01

06/12/2020

Efficient Learning of Generative Models via Finite-Difference Score Matching

Tianyu Pang, Kun Xu, Chongxuan LI and
Yang Song, Stefano Ermon, Jun Zhu

Keywords Paper

0

0

0

0

2:59

06/12/2020

Hard Shape-Constrained Kernel Machines

Pierre-Cyril Aubin-Frankowski, Zoltan Szabo

Keywords Paper

0

0

0

0

3:22

06/12/2021

Hessian Eigenspectra of More Realistic Nonlinear Models

Zhenyu Liao, Michael W Mahoney

Keywords Paper

theory, optimization, machine learning

0

0

0

0

15:49

14/06/2020

Learning to Optimize on SPD Manifolds

Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi

Keywords Paper

riemannian optimization, symmetric positive definite (spd) manifolds, optimization-based meta-learning, automatical spd optimizer design, learning to optimize, gradiend-based spd optimization, optimization problems with spd constraints

0

0

0

0

0:50

18/07/2021

ConvexVST: A Convex Optimization Approach to Variance-stabilizing Transformation

Mengfan Wang, Boyu Lyu, Guoqiang Yu

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

4:59

06/12/2021

A Continuous Mapping For Augmentation Design

Keyu Tian, Chen Lin, Ser Nam Lim and
Wanli Ouyang, Puneet Dokania, Philip Torr

Keywords Paper

optimization

0

0

0

0

9:23

12/07/2020

Quadratically Regularized Subgradient Methods for Weakly Convex Optimization with Weakly Convex Constraints

Runchao Ma, Qihang Lin, Tianbao Yang

Keywords Paper

Optimization - Non-convex

0

0

0

0

12:52

19/08/2021

Stability and Generalization for Randomized Coordinate Descent

Puyu Wang, Liang Wu, Yunwen Lei

Keywords Paper

Machine Learning, Learning Theory, Online Learning

0

0

0

0

13:18

12/07/2020

Convex Representation Learning for Generalized Invariance in Semi-Inner-Product Space

Yingyi Ma, Vignesh Ganapathiraman, Yaoliang Yu, Xinhua Zhang

Keywords Paper

Representation Learning

0

0

0

0

14:18

06/12/2021

Smooth Bilevel Programming for Sparse Regularization

Clarice Poon, Gabriel Peyré

Keywords Paper

machine learning

0

0

0

0

13:06

06/12/2021

Linear Convergence of Gradient Methods for Estimating Structured Transition Matrices in High-dimensional Vector Autoregressive Models

Xiao Lv, Wei Cui, Yulong Liu

Keywords Paper

theory, optimization

0

0

0

0

13:56

18/07/2021

The Limits of Min-Max Optimization Algorithms: Convergence to Spurious Non-Critical Sets

Ya-Ping Hsieh, Panayotis Mertikopoulos, Volkan Cevher

Keywords Paper

Theory

0

0

0

0

16:38

13/04/2021

The base measure problem and its solution

Alexey Radul, Boris Alexeev

Keywords Paper

0

0

0

0

3:30

04/08/2021

Convergence rates and approximation results for SGD and its continuous-time counterpart

Xavier Fontaine, Valentin De Bortoli, Alain Durmus

Keywords Paper

0

0

0

0

17:35

12/07/2020

A simpler approach to accelerated optimization: iterative averaging meets optimism

Pooria Joulani, Anant Raj, András György, Csaba Szepesvari

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

1

1

16:17

13/04/2021

Revisiting the role of euler numerical integration on acceleration and stability in convex optimization

Peiyuan Zhang, Antonio Orvieto, Hadi Daneshmand and
Thomas Hofmann, Roy S. Smith

Keywords Paper

0

0

0

0

3:02

06/12/2021

USCO-Solver: Solving Undetermined Stochastic Combinatorial Optimization Problems

Guangmo Tong

Keywords Paper

optimization

0

0

0

0

15:00

26/04/2020

Learning to Guide Random Search

Ozan Sener, Vladlen Koltun

Keywords Paper

Random search, Derivative-free optimization, Learning continuous control

0

0

0

0

4:58

13/04/2021

Efficient methods for structured nonconvex-nonconcave min-max optimization

Jelena Diakonikolas, Constantinos Daskalakis, Michael Jordan

Keywords Paper

0

0

0

0

3:33

06/12/2020

Hybrid Variance-Reduced SGD Algorithms For Minimax Problems with Nonconvex-Linear Function

Quoc Tran Dinh, Deyi Liu, Lam Nguyen

Keywords Paper

0

0

0

0

3:07

26/04/2020

Kernelized Wasserstein Natural Gradient

M Arbel, A Gretton, W Li, G Montufar

Keywords Paper

kernel methods, natural gradient, information geometry, Wasserstein metric

0

0

0

0

4:56

12/07/2020

Random extrapolation for primal-dual coordinate descent

Ahmet Alacaoglu, Olivier Fercoq, Volkan Cevher

Keywords Paper

Optimization - Convex

0

0

0

0

14:34

06/12/2021

Unifying Width-Reduced Methods for Quasi-Self-Concordant Optimization

Deeksha Adil, Brian Bullins, Sushant Sachdeva

Keywords Paper

optimization

0

0

0

0

12:14

18/07/2021

Distributed Second Order Methods with Fast Rates and Compressed Communication

Rustem Islamov, Xun Qian, Peter Richtarik

Keywords Paper

Optimization

0

0

0

0

4:51

13/04/2021

Stochastic polyak step-size for SGD: An adaptive learning rate for fast convergence

Nicolas Loizou, Sharan Vaswani, Issam Hadj Laradji, Simon Lacoste-Julien

Keywords Paper

0

0

0

0

3:30