Exploration by Optimisation in Partial Monitoring

09/07/2020

Exploration by Optimisation in Partial Monitoring

Tor Lattimore, Csaba Szepesvari

Keywords: Bandit problems, Online learning

Abstract Paper Similar Papers

Abstract: We provide a novel algorithm for adversarial k-action d-outcome partial monitoring that is adaptive, intuitive and efficient. The highlight is that for the non-degenerate locally observable games, the n-round minimax regret is bounded by 2mk^(3/2)sqrt(3n log(k)), where m is the number of signals. This matches the best known information-theoretic upper bound derived via Bayesian minimax duality. The same algorithm also achieves near-optimal regret for full information, bandit and globally observable games. High probability bounds and simple experiments are also provided.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLT 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

Hindsight Trust Region Policy Optimization

Hanbo Zhang, Site Bai, Xuguang Lan and
David Hsu, Nanning Zheng

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning

0

0

0

0

13:14

12/07/2020

Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting

Zixin Zhong, Wang Chi Cheung, Vincent Tan

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

12:52

18/07/2021

Randomized Algorithms for Submodular Function Maximization with a $k$-System Constraint

Shuang Cui, Kai Han, Tianshuai Zhu and
Jing Tang, Benwei Wu, He Huang

Keywords Paper

Optimization

0

0

0

0

4:48

09/07/2020

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang

Keywords Paper

Reinforcement learning, Planning and control

0

0

0

0

15:16

06/12/2021

Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination

Dylan J Foster, Akshay Krishnamurthy

Keywords Paper

theory, reinforcement learning and planning, bandits, online learning

0

0

0

0

19:34

09/07/2020

Efficient and robust algorithms for adversarial linear contextual bandits

Gergely Neu, Julia Olkhovskaya

Keywords Paper

Bandit problems, Online learning

0

0

0

0

9:53

18/07/2021

Sample-Optimal PAC Learning of Halfspaces with Malicious Noise

Jie Shen

Keywords Paper

Theory, Computational Learning Theory

0

0

0

0

4:37

18/07/2021

On Lower Bounds for Standard and Robust Gaussian Process Bandit Optimization

Xu Cai, Jonathan Scarlett

Keywords Paper

Applications, Natural Language Processing, Applications, Network Analysis, Reinforcement Learning and Planning, Bandits

0

0

0

0

4:19

03/05/2021

Deciphering and Optimizing Multi-Task Learning: a Random Matrix Approach

Malik Tiomoko, Hafiz Tiomoko Ali, Romain Couillet

Keywords Paper

Transfer Learning, Random Matrix Theory, Multi Task Learning

0

0

0

0

11:15

12/07/2020

Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes

Chen-Yu Wei, Mehdi Jafarnia, Haipeng Luo and
Hiteshi Sharma, Rahul Jain

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

13:40

06/12/2020

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs

Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei, Mengxiao Zhang

Keywords Paper

0

0

0

0

3:16

06/12/2021

Better Algorithms for Individually Fair $k$-Clustering

Maryam Negahbani, Deeparnab Chakrabarty

Keywords Paper

theory, self-supervised learning, clustering, fairness

0

0

0

0

14:02

18/07/2021

Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm

sajad khodadadian, Zaiwei Chen, Siva Maguluri

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:05

06/12/2020

Markovian Score Climbing: Variational Inference with KL(p||q)

Christian Naesseth, Fredrik Lindsten, David Blei

Keywords Paper

0

0

0

0

2:30

06/12/2020

Truncated Linear Regression in High Dimensions

Constantinos Daskalakis, Dhruv Rohatgi, Emmanouil Zampetakis

Keywords Paper

0

0

0

0

3:17

06/12/2020

Extrapolation Towards Imaginary 0-Nearest Neighbour and Its Improved Convergence Rate

Akifumi Okuno, Hidetoshi Shimodaira

Keywords Paper

0

0

0

0

3:14

14/06/2020

Select to Better Learn: Fast and Accurate Deep Learning Using Data Selection From Nonlinear Manifolds

Mohsen Joneidi, Saeed Vahidian, Ashkan Esmaeili and
Weijia Wang, Nazanin Rahnavard, Bill Lin, Mubarak Shah

Keywords Paper

data sebset selection, spectrum pursuit, open-set identification, few shot classification, generative adversarial networks and deep learning

0

0

0

0

1:00

06/12/2020

Consistent Plug-in Classifiers for Complex Objectives and Constraints

Shiv Tavker, Harish Guruprasad Ramaswamy, Harikrishna Narasimhan

Keywords Paper

0

0

0

0

3:16

06/12/2020

Robust, Accurate Stochastic Optimization for Variational Inference

Akash Kumar Dhaka, Alejandro Catalina, Michael Andersen and
Måns Magnusson, Jonathan Huggins, Aki Vehtari

Keywords Paper

0

0

0

0

3:23

09/07/2020

Estimating Principal Components under Adversarial Perturbations

Pranjal Awasthi, Xue Chen, Aravindan Vijayaraghavan

Keywords Paper

Unsupervised and semi-supervised learning, Adversarial learning and robustness

0

0

0

0

15:40

06/12/2021

A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms

Anand Kalvit, Assaf Zeevi

Keywords Paper

bandits

0

0

0

0

15:13

06/12/2021

Bayesian decision-making under misspecified priors with applications to meta-learning

Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy and
Daniel Hsu, Thodoris Lykouris, Miro Dudik, Robert E Schapire

Keywords Paper

meta learning, bandits

0

0

0

0

14:58

18/07/2021

Finding the Stochastic Shortest Path with Low Regret: the Adversarial Cost and Unknown Transition Case

Liyu Chen, Haipeng Luo

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

5:08

06/12/2020

Latent Bandits Revisited

Joey Hong, Branislav Kveton, Manzil Zaheer and
Yinlam Chow, Amr Ahmed, Craig Boutilier

Keywords Paper

0

0

0

0

3:11

04/08/2021

Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon

Zihan Zhang, Xiangyang Ji, Simon Du

Keywords Paper

0

0

0

0

12:37

12/07/2020

Minimally distorted Adversarial Examples with a Fast Adaptive Boundary Attack

Francesco Croce, Matthias Hein

Keywords Paper

Adversarial Examples

0

0

0

0

15:12

06/12/2021

Fast Algorithms for $L_\infty$-constrained S-rectangular Robust MDPs

Bahram Behzadian, Marek Petrik, Chin Pang Ho

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

6:14

26/08/2020

'Bring Your Own Greedy'+Max: Near-Optimal 1/2-Approximations for Submodular Knapsack

Grigory Yaroslavtsev, Samson Zhou, Dmitrii Avdiukhin

Keywords Paper

0

0

0

0

13:14

18/07/2021

PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees

Jonas Rothfuss, Vincent Fortuin, Martin Josifoski, Andreas Krause

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

1

1

0

0

5:46

06/12/2021

Linear-Time Probabilistic Solution of Boundary Value Problems

Nicholas Krämer, Philipp Hennig

Keywords Paper

kernel methods

0

0

0

0

2:01

03/05/2021

Geometry-Aware Gradient Algorithms for Neural Architecture Search

Liam Li, Misha Khodak, Nina Balcan, Ameet Talwalkar

Keywords Paper

weight-sharing, neural architecture search, optimization, automated machine learning

0

0

0

0

12:16

06/12/2020

Spike and slab variational Bayes for high dimensional logistic regression

Kolyan Ray, Botond Szabo, Gabriel Clara

Keywords Paper

0

0

0

0

3:16

18/07/2021

Robust Pure Exploration in Linear Bandits with Limited Budget

Ayya Alieva, Ashok Cutkosky, Abhimanyu Das

Keywords Paper

Algorithms, Adversarial Learning, Algorithms, Unsupervised Learning, Reinforcement Learning and Planning, Bandits

0

0

0

0

6:02

22/11/2021

Noisy Differentiable Architecture Search

Xiangxiang Chu, Bo Zhang

Keywords Paper

Neural architecture search, AutoML

0

0

0

0

2:30

18/07/2021

Sparse Bayesian Learning via Stepwise Regression

Sebastian Ament, Carla Gomes

Keywords Paper

Algorithms, Sparsity and Compressed Sensing

0

0

0

0

5:17

18/07/2021

Provably Efficient Algorithms for Multi-Objective Competitive RL

Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

17:04

13/04/2021

Experimental design for regret minimization in linear bandits

Andrew Wagenmaker, Julian Katz-Samuels, Kevin Jamieson

Keywords Paper

0

0

0

0

3:05

06/12/2021

Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

Gaon An, Seungyong Moon, Jang-Hyun Kim, Hyun Oh Song

Keywords Paper

deep learning, reinforcement learning and planning

1

0

0

0

13:50

12/07/2020

Structure Adaptive Algorithms for Stochastic Bandits

Rémy Degenne, Han Shao, Wouter Koolen

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

16:05

13/04/2021

Non-asymptotic performance guarantees for neural estimation of f-divergences

Sreejith Sreekumar, Zhengxin Zhang, Ziv Goldfeld

Keywords Paper

0

0

0

0

3:02