Exploiting Higher Order Smoothness in Derivative-free Optimization and Continuous Bandits

06/12/2020

Exploiting Higher Order Smoothness in Derivative-free Optimization and Continuous Bandits

Arya Akhavan, Massimiliano Pontil, Alexandre Tsybakov

Keywords: Reinforcement Learning and Planning -> Reinforcement Learning, Applications -> Privacy, Anonymity, and Security

Abstract Paper Similar Papers

Abstract: We address the problem of zero-order optimization of a strongly convex function. The goal is to find the minimizer of the function by a sequential exploration of its function values, under measurement noise. We study the impact of higher order smoothness properties of the function on the optimization error and on the online regret. To solve this problem we consider a randomized approximation of the projected gradient descent algorithm. The gradient is estimated by a randomized procedure involving two function evaluations and a smoothing kernel. We derive upper bounds for this algorithm both in the constrained and unconstrained settings and prove minimax lower bounds for any sequential search method. Our results imply that the zero-order algorithm is nearly optimal in terms of sample complexity and the problem parameters. Based on this algorithm, we also propose an estimator of the minimum value of the function achieving almost sharp oracle behavior. We compare our results with the state-of-the-art, highlighting a number of key improvements.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Approximate optimization of convex functions with outlier noise

Anindya De, Sanjeev Khanna, Huan Li, MohammadHesam NikpeySalekde

Keywords Paper

optimization

0

0

0

0

14:45

18/07/2021

Distributionally Robust Optimization with Markovian Data

Mengmeng Li, Tobias Sutter, Daniel Kuhn

Keywords Paper

Optimization

0

0

0

0

5:18

02/02/2021

Smooth Convex Optimization Using Sub-Zeroth-Order Oracles

Mustafa O. Karabag, Cyrus Neary, Ufuk Topcu

Keywords Paper

0

0

0

0

19:42

18/07/2021

On Lower Bounds for Standard and Robust Gaussian Process Bandit Optimization

Xu Cai, Jonathan Scarlett

Keywords Paper

Applications, Natural Language Processing, Applications, Network Analysis, Reinforcement Learning and Planning, Bandits

0

0

0

0

4:19

06/12/2020

Dynamic Regret of Convex and Smooth Functions

Peng Zhao, Yu-Jie Zhang, Lijun Zhang, Zhi-Hua Zhou

Keywords Paper

1

1

1

1

3:09

02/02/2021

Wasserstein Distributionally Robust Inverse Multiobjective Optimization

Chaosheng Dong, Bo Zeng

Keywords Paper

0

0

0

0

14:45

26/08/2020

Ordered SGD: A New Stochastic Optimization Framework for Empirical Risk Minimization

Kenji Kawaguchi, Haihao Lu

Keywords Paper

0

0

0

0

14:10

06/12/2021

On the Bias-Variance-Cost Tradeoff of Stochastic Optimization

Yifan Hu, Xin Chen, Niao He

Keywords Paper

theory, optimization, machine learning

0

0

0

0

14:56

18/07/2021

Dueling Convex Optimization

Aadirupa Saha, Tomer Koren, Yishay Mansour

Keywords Paper

Algorithms, Multitask and Transfer Learning, Algorithms, Ranking and Preference Learning, Algorithms, Classification

0

0

0

0

6:19

06/12/2021

Distributed Zero-Order Optimization under Adversarial Noise

Arya Akhavan, Massimiliano Pontil, Alexandre Tsybakov

Keywords Paper

theory, optimization, online learning

0

0

0

0

8:12

13/04/2021

Rate-improved inexact augmented lagrangian method for constrained nonconvex optimization

Zichong Li, Pin-Yu Chen, Sijia Liu and
Songtao Lu, Yangyang Xu

Keywords Paper

0

0

0

0

3:04

06/12/2021

Optimal Rates for Random Order Online Optimization

Uri Sherman, Tomer Koren, Yishay Mansour

Keywords Paper

optimization, online learning

0

0

0

0

18:43

09/07/2020

The estimation error of general first order methods

Michael V Celentano, Andrea Montanari, Yuchen Wu

Keywords Paper

High-dimensional statistics, Computational complexity, Matrix/tensor estimation, Regression

0

0

0

0

14:10

12/07/2020

Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks

Sijia Liu, Songtao Lu, Xiangyi Chen and
Yao Feng, Kaidi Xu, Abdullah Al-Dujaili, Mingyi Hong, Una-May O'Reilly

Keywords Paper

Optimization - Non-convex

0

0

0

0

11:59

18/07/2021

Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization

Wes Chung, Valentin Thomas, Marlos C. Machado, Nicolas Le Roux

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:13

06/12/2021

Risk Bounds and Calibration for a Smart Predict-then-Optimize Method

Heyuan Liu, Paul Grigas

Keywords Paper

theory, optimization, machine learning

0

0

0

0

14:56

18/07/2021

Regret Minimization in Stochastic Non-Convex Learning via a Proximal-Gradient Approach

Nadav Hallak, Panayotis Mertikopoulos, Volkan Cevher

Keywords Paper

Optimization, Non-Convex Optimization

0

0

0

0

5:06

13/04/2021

Tracking regret bounds for online submodular optimization

Tatsuya Matsuoka, Shinji Ito, Naoto Ohsaka

Keywords Paper

0

0

0

0

2:10

06/12/2020

The Statistical Cost of Robust Kernel Hyperparameter Turning

Raphael Meyer, Christopher Musco

Keywords Paper

0

0

0

0

3:22

18/07/2021

Agnostic Learning of Halfspaces with Gradient Descent via Soft Margins

Spencer Frei, Yuan Cao, Quanquan Gu

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

16:05

04/08/2021

Information-Theoretic Generalization Bounds for Stochastic Gradient Descent

Gergely Neu, Gintare Karolina Dziugiate, Mahdi Haghifam, Daniel M. Roy

Keywords Paper

0

0

0

0

18:01

12/07/2020

Lower Complexity Bounds for Finite-Sum Convex-Concave Minimax Optimization Problems

Guangzeng Xie, Luo Luo, yijiang lian, Zhihua Zhang

Keywords Paper

Optimization - Convex

0

0

0

0

12:09

06/12/2020

Truncated Linear Regression in High Dimensions

Constantinos Daskalakis, Dhruv Rohatgi, Emmanouil Zampetakis

Keywords Paper

0

0

0

0

3:17

13/04/2021

SONIA: A symmetric blockwise truncated optimization algorithm

Majid Jahani, MohammadReza Nazari, Rachael Tappenden and
Albert Berahas, Martin Takac

Keywords Paper

0

0

0

0

2:55

09/07/2020

Calibrated Surrogate Losses for Adversarially Robust Classification

Han Bao, Clayton Scott, Masashi Sugiyama

Keywords Paper

Loss functions, Adversarial learning and robustness, Classification, Excess risk bounds and generalization error bounds, Supervised learning

0

0

0

0

14:21

06/12/2021

Regret Bounds for Gaussian-Process Optimization in Large Domains

Manuel Wuethrich, Bernhard Schölkopf, Andreas Krause

Keywords Paper

optimization, bandits, kernel methods

0

0

0

0

13:02

06/12/2020

SLIP: Learning to predict in unknown dynamical systems with long-term memory

Paria Rashidinejad, Jiantao Jiao, Stuart Russell

Keywords Paper

Algorithms -> Online Learning; Theory -> Learning Theory, Algorithms -> Bandit Algorithms

0

0

0

0

3:22

26/08/2020

Distributionally Robust Bayesian Optimization

Johannes Kirschner, Ilija Bogunovic, Stefanie Jegelka, Andreas Krause

Keywords Paper

0

0

0

0

14:35

18/07/2021

Implicit rate-constrained optimization of non-decomposable objectives

Abhishek Kumar, Harikrishna Narasimhan, Andrew Cotter

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

3:48

03/05/2021

Learning to Make Decisions via Submodular Regularization

Ayya Alieva, Aiden Aceves, Jialin Song and
Stephen Mayo, Yisong Yue, Yuxin Chen

Keywords Paper

0

0

0

0

5:53

18/07/2021

Lenient Regret and Good-Action Identification in Gaussian Process Bandits

Xu Cai, Selwyn Gomes, Jonathan Scarlett

Keywords Paper

Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

0

0

0

0

5:10

06/12/2020

Fair regression via plug-in estimator and recalibration with statistical guarantees

Evgenii Chzhen, Christophe Denis, Mohamed Hebiri and
Luca Oneto, Massimiliano Pontil

Keywords Paper

0

0

0

0

3:16

13/04/2021

Learning prediction intervals for regression: Generalization and calibration

Haoxian Chen, Ziyi Huang, Henry Lam and
Huajie Qian, Haofeng Zhang

Keywords Paper

0

0

0

0

3:26

04/08/2021

Efficient Bandit Convex Optimization: Beyond Linear Losses

Arun Sai Suggala, Pradeep Ravikumar, Praneeth Netrapalli

Keywords Paper

0

0

0

0

20:29

06/12/2020

A convex optimization formulation for multivariate regression

Yunzhang Zhu

Keywords Paper

0

0

0

0

3:23

18/07/2021

Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning

Yaqi Duan, Chi Jin, Zhiyuan Li

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

5:18

06/12/2021

Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge

Reda Ouhamma, Odalric-Ambrym Maillard, Vianney Perchet

Keywords Paper

robustness, bandits

0

0

0

0

11:30

18/07/2021

Fast Stochastic Bregman Gradient Methods: Sharp Analysis and Variance Reduction

Radu Alexandru Dragomir, Mathieu Even, Hadrien Hendrikx

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:22

06/12/2020

Adaptive Importance Sampling for Finite-Sum Optimization and Sampling with Decreasing Step-Sizes

Ayoub El Hanchi, David Stephens

Keywords Paper

0

0

0

0

3:33

06/12/2021

Surrogate Regret Bounds for Polyhedral Losses

Rafael Frongillo, Bo Waggoner

Keywords Paper

machine learning

0

0

0

0

15:05