Non-Exponentially Weighted Aggregation: Regret Bounds for Unbounded Loss Functions

Abstract: We tackle the problem of online optimization with a general, possibly unbounded, loss function. It is well known that when the loss is bounded, the exponentially weighted aggregation strategy (EWA) leads to a regret in $\sqrt{T}$ after $T$ steps. In this paper, we study a generalized aggregation strategy, where the weights no longer depend exponentially on the losses. Our strategy is based on Follow The Regularized Leader (FTRL): we minimize the expected losses plus a regularizer, that is here a $\phi$-divergence. When the regularizer is the Kullback-Leibler divergence, we obtain EWA as a special case. Using alternative divergences enables unbounded losses, at the cost of a worst regret bound in some cases.

06/12/2020

Non-Exponentially Weighted Aggregation: Regret Bounds for Unbounded Loss Functions

Pierre Alquier

Comments

Similar Papers

Dynamic Regret of Convex and Smooth Functions

Peng Zhao, Yu-Jie Zhang, Lijun Zhang, Zhi-Hua Zhou

Keywords Abstract Paper

A Primal-Dual Online Algorithm for Online Matching Problem in Dynamic Environments

Yu-Hang Zhou, Peng Hu, Chen Liang and Huan Xu, Guangda Huzhang, Yinfu Feng, Qing Da, Xinshang Wang, An-Xiang Zeng

Keywords Abstract Paper

Tracking regret bounds for online submodular optimization

Tatsuya Matsuoka, Shinji Ito, Naoto Ohsaka

Keywords Abstract Paper

Online Continuous DR-Submodular Maximization with Long-Term Budget Constraints

Omid Sadeghi, Maryam Fazel

Keywords Abstract Paper

Learning piecewise Lipschitz functions in changing environments

Dravyansh Sharma, Maria-Florina Balcan, Travis Dick

Keywords Abstract Paper

Efficient improper learning for online logistic regression

Pierre Gaillard, Rémi Jézéquel, Alessandro Rudi

Keywords Abstract Paper

Online learning, Classification, Convex optimization

Learning-to-learn non-convex piecewise-Lipschitz functions

Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet S Talwalkar

Keywords Abstract Paper

optimization, machine learning, robustness, meta learning, online learning

Best-case lower bounds in online learning

Cristóbal Guzmán, Nishant Mehta, Ali Mortazavi

Keywords Abstract Paper

theory, optimization, online learning, fairness

No-Regret Prediction in Marginally Stable Systems

Udaya Ghai, Holden Lee, Karan Singh and Cyril Zhang, Yi Zhang

Keywords Abstract Paper

Online learning, Planning and control

On Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

Ting-Jui Chang, Shahin Shahrampour

Keywords Abstract Paper

Projection-free Online Learning over Strongly Convex Sets

Yuanyu Wan, Lijun Zhang

Keywords Abstract Paper

Policy Optimization as Online Learning with Mediator Feedback

Alberto Maria Metelli, Matteo Papini, Pierluca D'Oro, Marcello Restelli

Keywords Abstract Paper

Online Optimal Control with Affine Constraints

Yingying Li, Subhro Das, Na Li

Keywords Abstract Paper

Logistic Regression Regret: What’s the Catch?

Gil I Shamir

Keywords Abstract Paper

Online learning, Convex optimization, Information theory, Regression

Tight First- and Second-Order Regret Bounds for Adversarial Linear Bandits

Shinji Ito, Shuichi Hirahara, Tasuku Soma, Yuichi Yoshida

Keywords Abstract Paper

Efficient Bandit Convex Optimization: Beyond Linear Losses

Arun Sai Suggala, Pradeep Ravikumar, Praneeth Netrapalli

Keywords Abstract Paper

Regret Bounds for Gaussian-Process Optimization in Large Domains

Manuel Wuethrich, Bernhard Schölkopf, Andreas Krause

Keywords Abstract Paper

optimization, bandits, kernel methods

Variational Bayesian Optimistic Sampling

Brendan O'Donoghue, Tor Lattimore

Keywords Abstract Paper

optimization, reinforcement learning and planning, generative model, bandits, online learning

Improved Algorithms for Online Submodular Maximization via First-order Regret Bounds

Nick Harvey, Christopher Liaw, Tasuku Soma

Keywords Abstract Paper

Projection-free Online Learning in Dynamic Environments

Yuanyu Wan, Bo Xue, Lijun Zhang

Keywords Abstract Paper

Surrogate Regret Bounds for Polyhedral Losses

Rafael Frongillo, Bo Waggoner

Keywords Abstract Paper

machine learning

Finite Regret and Cycles with Fixed Step-Size via Alternating Gradient Descent-Ascent

James P Bailey, Gauthier Gidel, Georgios Piliouras

Keywords Abstract Paper

Economics, game theory, and incentives, Online learning

Logarithmic Regret for Online Control with Adversarial Noise

Keywords Paper

Yu-Hang Zhou, Peng Hu, Chen Liang and
Huan Xu, Guangda Huzhang, Yinfu Feng, Qing Da, Xinshang Wang, An-Xiang Zeng

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Udaya Ghai, Holden Lee, Karan Singh and
Cyril Zhang, Yi Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ayya Alieva, Aiden Aceves, Jialin Song and
Stephen Mayo, Yisong Yue, Yuxin Chen

Keywords Paper

Chenkai Yu, Guanya Shi, Soon-Jo Chung and
Yisong Yue, Adam Wierman

Keywords Paper

Shinji Ito, Daisuke Hatano, Hanna Sumita and
Kei Takemura, Takuro Fukunaga, Naonori Kakimura, Ken-Ichi Kawarabayashi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Sattar Vakili, Nacime Bouziani, Sepehr Jalali and
Alberto Bernacchia, Da-shan Shiu

Keywords Paper

Keywords Paper