Online Learning in Contextual Bandits using Gated Linear Networks

06/12/2020

Online Learning in Contextual Bandits using Gated Linear Networks

Eren Sezener, Marcus Hutter, David Budden, Jianan Wang, Joel Veness

Keywords:

Abstract Paper Similar Papers

Abstract: We introduce a new and completely online contextual bandit algorithm called Gated Linear Contextual Bandits (GLCB). This algorithm is based on Gated Linear Networks (GLNs), a recently introduced deep learning architecture with properties well-suited to the online setting. Leveraging data-dependent gating properties of the GLN we are able to estimate prediction uncertainty with effectively zero algorithmic overhead. We empirically evaluate GLCB compared to 9 state-of-the-art algorithms that leverage deep neural networks, on a standard benchmark suite of discrete and continuous contextual bandit problems. GLCB obtains mean first-place despite being the only online method, and we further support these results with a theoretical study of its convergence properties.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games

Darren Lin, Zhengyuan Zhou, Panayotis Mertikopoulos, Michael Jordan

Keywords Paper

Learning Theory

1

1

0

0

12:31

26/04/2020

Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data

Sergei Popov, Stanislav Morozov, Artem Babenko

Keywords Paper

tabular data, architectures, DNN

0

0

0

0

5:05

06/12/2021

Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination

Dylan J Foster, Akshay Krishnamurthy

Keywords Paper

theory, reinforcement learning and planning, bandits, online learning

0

0

0

0

19:34

06/12/2020

Gaussian Gated Linear Networks

David Budden, Adam Marblestone, Eren Sezener and
Tor Lattimore, Greg Wayne, Joel Veness

Keywords Paper

0

0

0

0

3:28

02/02/2021

A Recipe for Global Convergence Guarantee in Deep Neural Networks

Kenji Kawaguchi, Qingyun Sun

Keywords Paper

0

0

0

0

17:15

26/04/2020

Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks

Wei Hu, Lechao Xiao, Jeffrey Pennington

Keywords Paper

deep learning theory, non-convex optimization, orthogonal initialization

0

0

0

0

5:10

12/07/2020

dS^2LBI: Exploring Structural Sparsity on Deep Network via Differential Inclusion Paths

Yanwei Fu, Chen Liu, Donghao Li and
Xinwei Sun, Jinshan ZENG, Yuan Yao

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

12:45

06/12/2021

Neural Active Learning with Performance Guarantees

Zhilei Wang, Pranjal Awasthi, Christoph Dann and
Ayush Sekhari, Claudio Gentile

Keywords Paper

deep learning, active learning

0

0

0

0

10:43

06/12/2021

Hyperparameter Tuning is All You Need for LISTA

Xiaohan Chen, Jialin Liu, Zhangyang Wang, Wotao Yin

Keywords Paper

deep learning

0

0

0

0

15:05

03/05/2021

Linear Convergent Decentralized Optimization with Compression

Xiaorui Liu, Yao Li, Rongrong Wang and
Jiliang Tang, Ming Yan

Keywords Paper

Decentralized Optimization, Heterogeneous data, Linear Convergence, Communication Compression

0

0

0

0

5:20

06/12/2020

Zap Q-Learning With Nonlinear Function Approximation

Shuhang Chen, Adithya M Devraj, Fan Lu and
Ana Busic, Sean Meyn

Keywords Paper

0

0

0

0

2:47

18/07/2021

Online Limited Memory Neural-Linear Bandits with Likelihood Matching

Ofir Nabati, Tom Zahavy, Shie Mannor

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:08

06/12/2021

Generalization Bounds For Meta-Learning: An Information-Theoretic Analysis

Qi CHEN, Changjian Shui, Mario Marchand

Keywords Paper

deep learning, meta learning, few shot learning

0

0

0

0

11:45

13/04/2021

Stability and differential privacy of stochastic gradient descent for pairwise learning with non-smooth loss

Zhenhuan Yang, Yunwen Lei, Siwei Lyu, Yiming Ying

Keywords Paper

0

0

0

0

2:59

12/07/2020

Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles

Dylan Foster, Alexander Rakhlin

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

14:39

06/12/2020

A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions

Wei Deng, Guang Lin, Faming Liang

Keywords Paper

0

0

0

0

3:26

06/12/2021

EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization

Ondrej Bohdal, Yongxin Yang, Timothy Hospedales

Keywords Paper

deep learning, optimization, graph learning, meta learning, few shot learning

0

0

0

0

14:09

13/04/2021

Convergence and accuracy trade-offs in federated learning and meta-learning

Zachary Charles, Jakub Konečný

Keywords Paper

0

0

0

0

3:04

06/12/2021

Sparse Deep Learning: A New Framework Immune to Local Traps and Miscalibration

Yan Sun, Wenjun Xiong, Faming Liang

Keywords Paper

deep learning, machine learning

0

0

0

0

10:23

12/07/2020

Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case

shuai zhang, Meng Wang, Sijia Liu and
Pin-Yu Chen, Jinjun Xiong

Keywords Paper

Learning Theory

0

0

0

0

15:06

18/07/2021

The Heavy-Tail Phenomenon in SGD

Mert Gurbuzbalaban, Umut Simsekli, Lingjiong Zhu

Keywords Paper

Optimization, Stochastic Optimization

0

0

0

0

5:37

12/07/2020

Information-Theoretic Local Minima Characterization and Regularization

Zhiwei Jia, Hao Su

Keywords Paper

Deep Learning - Theory

0

0

0

0

14:11

26/08/2020

Bayesian Reinforcement Learning via Deep, Sparse Sampling

Divya Grover, Debabrota Basu, Christos Dimitrakakis

Keywords Paper

0

0

0

0

15:44

03/05/2021

Global Convergence of Three-layer Neural Networks in the Mean Field Regime

Huy Tuan Pham, Phan-Minh Nguyen

Keywords Paper

deep learning theory

0

0

0

0

15:41

12/07/2020

A Mean Field Analysis Of Deep ResNet And Beyond: Towards Provably Optimization Via Overparameterization From Depth

Yiping Lu, Chao Ma, Yulong Lu and
Jianfeng Lu, Lexing Ying

Keywords Paper

Deep Learning - Theory

0

0

0

0

4:37

06/12/2021

NTopo: Mesh-free Topology Optimization using Implicit Neural Representations

Jonas Zehnder, Yue Li, Stelian Coros, Bernhard Thomaszewski

Keywords Paper

deep learning, optimization, machine learning, self-supervised learning, representation learning

0

0

0

0

9:24

06/12/2021

Adaptive Proximal Gradient Methods for Structured Neural Networks

Jihun Yun, Aurelie Lozano, Eunho Yang

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

10:46

14/06/2020

MAST: A Memory-Augmented Self-Supervised Tracker

Zihang Lai, Erika Lu, Weidi Xie

Keywords Paper

self-supervised learning, video segmentation, memory-augmented model, video understanding, tracking, unsupervised learning, generalization, attention, representation learning, metric learning

0

0

0

0

1:01

06/12/2020

Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach

Luofeng Liao, You-Lin Chen, Zhuoran Yang and
Bo Dai, Mladen Kolar, Zhaoran Wang

Keywords Paper

Theory -> Information Theory, Algorithms -> Stochastic Methods

0

0

0

0

3:23

02/02/2021

Gated Linear Networks

Joel Veness, Tor Lattimore, David Budden and
Avishkar Bhoopchand, Christopher Mattern, Agnieszka Grabska-Barwinska, Eren Sezener, Jianan Wang, Peter Toth, Simon Schmitt, Marcus Hutter

Keywords Paper

0

0

0

0

19:07

06/12/2020

Efficient Variational Inference for Sparse Deep Learning with Theoretical Guarantee

Jincheng Bai, Qifan Song, Guang Cheng

Keywords Paper

0

0

0

0

3:11

26/04/2020

Learning Space Partitions for Nearest Neighbor Search

Yihe Dong, Piotr Indyk, Ilya Razenshteyn, Tal Wagner

Keywords Paper

space partition, lsh, locality sensitive hashing, nearest neighbor search

0

0

0

0

5:17

06/12/2021

Asynchronous Decentralized SGD with Quantized and Local Updates

Giorgi Nadiradze, Amirmojtaba Sabour, Peter Davies and
Shigang Li, Dan Alistarh

Keywords Paper

optimization, machine learning, graph learning

0

0

0

0

12:37

26/08/2020

Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction

Boyue Li, Shicong Cen, Yuxin Chen, Yuejie Chi

Keywords Paper

0

0

0

0

13:50

02/02/2021

Learning Graph Neural Networks with Approximate Gradient Descent

Qunwei Li, Shaofeng Zou, Wenliang Zhong

Keywords Paper

0

0

0

0

19:10

03/05/2021

Federated Learning Based on Dynamic Regularization

Durmus Alp Emre Acar, Yue Zhao, Ramon Matas and
Matthew Mattina, Paul Whatmough, Venkatesh Saligrama

Keywords Paper

Distributed Optimization, Deep Neural Networks, Federated Learning

1

0

0

0

17:21

06/12/2020

Pruning neural networks without any data by iteratively conserving synaptic flow

Hidenori Tanaka, Daniel Kunin, Daniel Yamins, Surya Ganguli

Keywords Paper

Deep Learning -> Optimization for Deep Networks; Optimization -> Non-Convex Optimization, Theory

1

0

0

0

3:19

18/07/2021

A Hybrid Variance-Reduced Method for Decentralized Stochastic Non-Convex Optimization

Ran Xin, Usman Khan, Soummya Kar

Keywords Paper

Optimization, Distributed and Parallel Optimization

0

0

0

0

5:10

26/04/2020

Continual Learning with Bayesian Neural Networks for Non-Stationary Data

Richard Kurle, Botond Cseke, Alexej Klushyn and
Patrick van der Smagt, Stephan Günnemann

Keywords Paper

Continual Learning, Online Variational Bayes, Non-Stationary Data, Bayesian Neural Networks, Variational Inference, Lifelong Learning, Concept Drift, Episodic Memory

0

0

0

0

5:26

13/04/2021

Adaptive approximate policy iteration

Botao Hao, Nevena Lazic, Yasin Abbasi-Yadkori and
Pooria Joulani, Csaba Szepesvari

Keywords Paper

0

0

0

0

3:01