Comparator-Adaptive Convex Bandits

Abstract: We study bandit convex optimization methods that adapt to the norm of the comparator, a topic that has only been studied before for its full-information counterpart. Specifically, we develop convex bandit algorithms with regret bounds that are small whenever the norm of the comparator is small. We first use techniques from the full-information setting to develop comparator-adaptive algorithms for linear bandits. Then, we extend the ideas to convex bandits with Lipschitz or smooth loss functions, using a new single-point gradient estimator and carefully designed surrogate losses.

06/12/2020

Algorithms; Algorithms -> Regression; Algorithms -> Similarity and Distance Learning; Optimization -> Combinatorial Optimizatio, Optimization

3:21

06/12/2020

Comparator-Adaptive Convex Bandits

Dirk van der Hoeven, Ashok Cutkosky, Haipeng Luo

Comments

Similar Papers

Distributionally Robust Federated Averaging

Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

Keywords Abstract Paper

An efficient nonconvex reformulation of stagewise convex optimization problems

Rudy Bunel, Oliver Hinder, Srinadh Bhojanapalli, Krishnamurthy Dvijotham

Keywords Abstract Paper

Conditional gradient methods for stochastically constrained convex minimization

Maria-Luiza Vladarean, Ahmet Alacaoglu, Ya-Ping Hsieh, Volkan Cevher

Keywords Abstract Paper

Optimal Rates for Random Order Online Optimization

Uri Sherman, Tomer Koren, Yishay Mansour

Keywords Abstract Paper

optimization, online learning

Lipschitz and Comparator-Norm Adaptivity in Online Learning

Zakaria Mhammedi, Wouter M Koolen

Keywords Abstract Paper

Online learning,

Efficient Bandit Convex Optimization: Beyond Linear Losses

Arun Sai Suggala, Pradeep Ravikumar, Praneeth Netrapalli

Keywords Abstract Paper

The Statistical Complexity of Early-Stopped Mirror Descent

Tomas Vaskevicius, Varun Kanade, Patrick Rebeschini

Keywords Abstract Paper

Algorithms; Algorithms -> Regression; Algorithms -> Similarity and Distance Learning; Optimization -> Combinatorial Optimizatio, Optimization

Better Full-Matrix Regret via Parameter-Free Online Learning

Keywords Abstract Paper

Optimal approximation for unconstrained non-submodular minimization

Marwa El Halabi, Stefanie Jegelka

Keywords Abstract Paper

Exploiting Higher Order Smoothness in Derivative-free Optimization and Continuous Bandits

Arya Akhavan, Massimiliano Pontil, Alexandre Tsybakov

Keywords Abstract Paper

Reinforcement Learning and Planning -> Reinforcement Learning, Applications -> Privacy, Anonymity, and Security

Sampling with Trusthworthy Constraints: A Variational Gradient Framework

Xingchao Liu, Xin Tong, Qiang Liu

Keywords Abstract Paper

optimization, machine learning, fairness, interpretability

Fine-grained Generalization Analysis of Vector-Valued Learning

Liang Wu, Antoine Ledent, Yunwen Lei, Marius Kloft

Keywords Abstract Paper

Provably adaptive reinforcement learning in metric spaces

Tongyi Cao, Akshay Krishnamurthy

Keywords Abstract Paper

The estimation error of general first order methods

Michael V Celentano, Andrea Montanari, Yuchen Wu

Keywords Abstract Paper

High-dimensional statistics, Computational complexity, Matrix/tensor estimation, Regression

Amortized variance reduction for doubly stochastic objective

Ayman Boustati, Sattar Vakili, James Hensman, ST John

Keywords Abstract Paper

Large-Scale Methods for Distributionally Robust Optimization

Daniel Levy, Yair Carmon, John Duchi, Aaron Sidford

Keywords Abstract Paper

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Phil Chen, Mikhal Itkina, Ransalu Senanayake, Mykel J Kochenderfer

Keywords Abstract Paper

deep learning, generative model

Low-Rank Extragradient Method for Nonsmooth and Low-Rank Matrix Optimization Problems

Atara Kaplan, Dan Garber

Keywords Abstract Paper

optimization, machine learning

Bandit Linear Control

Asaf Cassel, Tomer Koren

Keywords Abstract Paper

Information-Theoretic Generalization Bounds for Stochastic Gradient Descent

Gergely Neu, Gintare Karolina Dziugiate, Mahdi Haghifam, Daniel M. Roy

Keywords Abstract Paper

Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective

Dylan Foster, Alexander Rakhlin, David Simchi-Levi, Yunzong Xu

Keywords Abstract Paper

Can Implicit Bias Explain Generalization? Stochastic Convex Optimization as a Case Study

Assaf Dauber, Meir Feder, Tomer Koren, Roi Livni

Keywords Abstract Paper

Differentiable Segmentation of Sequences

Erik Scharwächter, Jonathan Lennartz, Emmanuel Müller

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Qijia Jiang, Olaoluwa Adigun, Harikrishna Narasimhan and
Mahdi Milani Fard, Maya Gupta

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper