Self-concordant analysis of generalized linear bandits with forgetting

13/04/2021

Self-concordant analysis of generalized linear bandits with forgetting

Yoan Russac, Louis Faury, Olivier Cappé, Aurélien Garivier

Keywords:

Abstract Paper Similar Papers

Abstract: Contextual sequential decision problems with categorical or numerical observations are ubiquitous and Generalized Linear Bandits (GLB) offer a solid theoretical framework to address them. In contrast to the case of linear bandits, existing algorithms for GLB have two drawbacks undermining their applicability. First, they rely on excessively pessimistic concentration bounds due to the non-linear nature of the model. Second, they require either non-convex projection steps or burn-in phases to enforce boundedness of the estimators. Both of these issues are worsened when considering non-stationary models, in which the GLB parameter may vary with time. In this work, we focus on self-concordant GLB (which include logistic and Poisson regression) with forgetting achieved either by the use of a sliding window or exponential weights. We propose a novel confidence-based algorithm for the maximum-likehood estimator with forgetting and analyze its perfomance in abruptly changing environments. These results as well as the accompanying numerical simulations highlight the potential of the proposed approach to address non-stationarity in GLB.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Self-Modulating Nonparametric Event-Tensor Factorization

Zheng Wang, Xinqi Chu, Shandian Zhe

Keywords Paper

General Machine Learning Techniques

0

0

0

0

15:40

13/04/2021

Learning GPLVM with arbitrary kernels using the unscented transformation

Daniel Souza, Diego Mesquita, João Paulo Gomes, César Lincoln Mattos

Keywords Paper

0

0

0

0

2:59

26/08/2020

Distributionally Robust Bayesian Optimization

Johannes Kirschner, Ilija Bogunovic, Stefanie Jegelka, Andreas Krause

Keywords Paper

0

0

0

0

14:35

06/12/2021

Learning to Select Exogenous Events for Marked Temporal Point Process

Ping Zhang, Rishabh Iyer, Ashish Tendulkar and
Gaurav Aggarwal, Abir De

Keywords Paper

0

0

0

0

12:27

02/02/2021

Learning Compositional Sparse Gaussian Processes with a Shrinkage Prior

Anh Tong, Toan M Tran, Hung Bui, Jaesik Choi

Keywords Paper

0

0

0

0

18:06

12/07/2020

Implicit differentiation of Lasso-type models for hyperparameter optimization

Quentin Bertrand, Quentin Klopfenstein, Mathieu Blondel and
Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

Keywords Paper

Optimization - General

0

0

0

0

16:18

06/12/2021

Twice regularized MDPs and the equivalence between robustness and regularization

Esther Derman, Matthieu Geist, Shie Mannor

Keywords Paper

optimization, reinforcement learning and planning, robustness

0

0

0

0

14:19

18/07/2021

Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization

Wes Chung, Valentin Thomas, Marlos C. Machado, Nicolas Le Roux

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:13

06/12/2021

Adversarial Robustness with Semi-Infinite Constrained Learning

Alexander Robey, Luiz Chamon, George J. Pappas and
Hamed Hassani, Alejandro Ribeiro

Keywords Paper

theory, deep learning, optimization, robustness, adversarial robustness and security

0

0

0

0

14:48

13/04/2021

Multi-fidelity high-order gaussian processes for physical simulation

Zheng Wang, Wei Xing, Robert Kirby, Shandian Zhe

Keywords Paper

0

0

0

0

3:34

06/12/2020

Interior Point Solving for LP-based prediction+optimisation

Jayanta Mandi, Tias Guns

Keywords Paper

0

0

0

1

3:28

18/07/2021

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Kilian Fatras, Thibault Séjourné, Rémi Flamary, Nicolas Courty

Keywords Paper

Algorithms, Optimal Transport

0

0

0

2

4:57

06/12/2020

NeuMiss networks: differentiable programming for supervised learning with missing values.

Marine Le Morvan, Julie Josse, Thomas Moreau and
Erwan Scornet, Gael Varoquaux

Keywords Paper

0

0

0

0

3:20

09/07/2020

Data-driven confidence bands for distributed nonparametric regression

Valeriy Avanesov

Keywords Paper

Kernel methods, Excess risk bounds and generalization error bounds, Regression, Sampling algorithms, Supervised learning

0

0

0

0

14:42

06/12/2021

Non-convex Distributionally Robust Optimization: Non-asymptotic Analysis

Jikai Jin, Bohang Zhang, Haiyang Wang, Liwei Wang

Keywords Paper

optimization

0

0

0

0

14:05

06/12/2021

STORM+: Fully Adaptive SGD with Recursive Momentum for Nonconvex Optimization

Kfir Levy, Ali Kavis, Volkan Cevher

Keywords Paper

optimization

0

0

0

0

12:23

06/12/2020

An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits

Andrea Tirinzoni, Matteo Pirotta, Marcello Restelli, Alessandro Lazaric

Keywords Paper

0

0

0

0

3:13

04/08/2021

SGD Generalizes Better Than GD (And Regularization Doesn't Help)

Idan Amir, Tomer Koren, Roi Livni

Keywords Paper

0

0

0

0

15:53

06/12/2020

Hamiltonian Monte Carlo using an adjoint-differentiated Laplace approximation: Bayesian inference for latent Gaussian models and beyond

Charles Margossian, Aki Vehtari, Daniel Simpson, Raj Agrawal

Keywords Paper

0

0

0

0

3:05

14/09/2020

Escaping Saddle Points of Empirical Risk Privately and Scalably via DP-Trust Region Method

Di Wang, Jinhui Xu

Keywords Paper

differential privacy, empirical risk minimization, private machine learning

0

0

0

0

15:13

06/12/2020

Adapting to Misspecification in Contextual Bandits

Dylan Foster, Claudio Gentile, Mehryar Mohri, Julian Zimmert

Keywords Paper

0

0

0

0

3:05

02/02/2021

Improved Penalty Method via Doubly Stochastic Gradients for Bilevel Hyperparameter Optimization

Wanli Shi, Bin Gu

Keywords Paper

0

0

0

0

14:47

26/04/2020

Short and Sparse Deconvolution --- A Geometric Approach

Yenson Lau, Qing Qu, Han-Wen Kuo and
Pengcheng Zhou, Yuqian Zhang, John Wright

Keywords Paper

0

0

0

0

7:18

06/12/2021

Misspecified Gaussian Process Bandit Optimization

Ilija Bogunovic, Andreas Krause

Keywords Paper

optimization, bandits, kernel methods

0

0

0

0

11:41

12/07/2020

Rate-distortion optimization guided autoencoder for isometric embedding in Euclidean latent space

Keizo Kato, Jing Zhou, Tomotake Sasaki, Akira Nakagawa

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

14:48

26/08/2020

Corruption-Tolerant Gaussian Process Bandit Optimization

Ilija Bogunovic, Andreas Krause, Jonathan Scarlett

Keywords Paper

0

0

0

0

12:01

06/12/2021

Bayesian decision-making under misspecified priors with applications to meta-learning

Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy and
Daniel Hsu, Thodoris Lykouris, Miro Dudik, Robert E Schapire

Keywords Paper

meta learning, bandits

0

0

0

0

14:58

06/12/2020

One Ring to Rule Them All: Certifiably Robust Geometric Perception with Outliers

Heng Yang, Luca Carlone

Keywords Paper

0

0

0

0

3:24

06/12/2021

Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity

Kaiqing Zhang, Xiangyuan Zhang, Bin Hu, Tamer Basar

Keywords Paper

theory, optimization, reinforcement learning and planning

0

0

0

0

15:57

06/12/2021

Rectangular Flows for Manifold Learning

Anthony Caterini, Gabriel Loaiza-Ganem, Geoff Pleiss, John Cunningham

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

12:26

08/07/2020

On Skolem-hardness and saturation points in Markov decision processes

Jakob Piribauer and Christel Baier

Keywords Paper

Markov decision process, Skolem problem, stochastic shortest path, conditional expectation, conditional value-at-risk, model checking, frequency-LTL

0

0

0

0

26:13

06/12/2020

Matrix Completion with Quantified Uncertainty through Low Rank Gaussian Copula

Yuxuan Zhao, Madeleine Udell

Keywords Paper

0

0

0

0

3:21

18/07/2021

Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Afsaneh Mastouri, Yuchen Zhu, Limor Gultchin and
Anna Korba, Ricardo Silva, Matt J. Kusner, Arthur Gretton, Krikamol Muandet

Keywords Paper

Algorithms, Kernel Methods

0

0

0

0

5:10

06/12/2020

An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits

Julian Katz-Samuels, Lalit Jain, zohar karnin, Kevin Jamieson

Keywords Paper

0

0

0

0

3:20

06/12/2021

Self-Adaptable Point Processes with Nonparametric Time Decays

Zhimeng Pan, Zheng Wang, Jeff M Phillips, Shandian Zhe

Keywords Paper

deep learning, kernel methods

0

0

0

0

10:01

12/07/2020

Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound

Lin Yang, Mengdi Wang

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

15:14

12/07/2020

dS^2LBI: Exploring Structural Sparsity on Deep Network via Differential Inclusion Paths

Yanwei Fu, Chen Liu, Donghao Li and
Xinwei Sun, Jinshan ZENG, Yuan Yao

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

12:45

06/12/2021

Calibration and Consistency of Adversarial Surrogate Losses

Pranjal Awasthi, Natalie Frank, Anqi Mao and
Mehryar Mohri, Yutao Zhong

Keywords Paper

theory, optimization, machine learning, robustness, adversarial robustness and security

0

0

0

0

13:30

18/07/2021

Matrix Completion with Model-free Weighting

Jiayi Wang, Raymond K. W. Wong, Xiaojun Mao, Kwun Chuen Gary Chan

Keywords Paper

Probabilistic Methods

0

0

0

0

5:14

02/02/2021

Stabilizing Q Learning Via Soft Mellowmax Operator

Yaozhong Gan, Zhe Zhang, Xiaoyang Tan

Keywords Paper

0

0

0

0

18:39