Fast Pure Exploration via Frank-Wolfe

06/12/2021

Fast Pure Exploration via Frank-Wolfe

Po-An Wang, Ruo-Chun Tzeng, Alexandre Proutiere

Keywords: theory, optimization, reinforcement learning and planning, bandits

Abstract Paper Similar Papers

Abstract: We study the problem of active pure exploration with fixed confidence in generic stochastic bandit environments. The goal of the learner is to answer a query about the environment with a given level of certainty while minimizing her sampling budget. For this problem, instance-specific lower bounds on the expected sample complexity reveal the optimal proportions of arm draws an Oracle algorithm would apply. These proportions solve an optimization problem whose tractability strongly depends on the structural properties of the environment, but may be instrumental in the design of efficient learning algorithms. We devise Frank-Wolfe-based Sampling (FWS), a simple algorithm whose sample complexity matches the lower bounds for a wide class of pure exploration problems. The algorithm is computationally efficient as, to learn and track the optimal proportion of arm draws, it relies on a single iteration of Frank-Wolfe algorithm applied to the lower-bound optimization problem. We apply FWS to various pure exploration tasks, including best arm identification in unstructured, thresholded, linear, and Lipschitz bandits. Despite its simplicity, FWS is competitive compared to state-of-art algorithms.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks

Rong Zhu, Mattia Rigotti

Keywords Paper

theory, deep learning, reinforcement learning and planning, bandits

0

0

0

0

8:45

18/07/2021

Robust Pure Exploration in Linear Bandits with Limited Budget

Ayya Alieva, Ashok Cutkosky, Abhimanyu Das

Keywords Paper

Algorithms, Adversarial Learning, Algorithms, Unsupervised Learning, Reinforcement Learning and Planning, Bandits

0

0

0

0

6:02

06/12/2020

An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits

Julian Katz-Samuels, Lalit Jain, zohar karnin, Kevin Jamieson

Keywords Paper

0

0

0

0

3:20

06/12/2020

Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration

Andrea Zanette, Alessandro Lazaric, Mykel J Kochenderfer, Emma Brunskill

Keywords Paper

0

0

0

0

3:11

18/07/2021

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

Shuang Qiu, Jieping Ye, Zhaoran Wang, Zhuoran Yang

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

11:19

06/12/2021

Pure Exploration in Kernel and Neural Bandits

Yinglun Zhu, Dongruo Zhou, Ruoxi Jiang and
Quanquan Gu, Rebecca Willett, Robert Nowak

Keywords Paper

theory, deep learning, reinforcement learning and planning, bandits, representation learning

0

0

0

0

14:47

18/07/2021

Learning Online Algorithms with Distributional Advice

Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos and
Ali Vakilian, Nikos Zarifis

Keywords Paper

Algorithms

0

0

0

0

5:45

18/07/2021

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL

Seyed Kamyar Seyed Ghasemipour, Dale Schuurmans, Shixiang Gu

Keywords Paper

Reinforcement Learning and Planning

0

0

0

1

5:54

18/07/2021

Task-Optimal Exploration in Linear Dynamical Systems

Andrew Wagenmaker, Max Simchowitz, Kevin Jamieson

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

19:33

13/04/2021

Experimental design for regret minimization in linear bandits

Andrew Wagenmaker, Julian Katz-Samuels, Kevin Jamieson

Keywords Paper

0

0

0

0

3:05

06/12/2021

Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization

Yusuke Iwasawa, Yutaka Matsuo

Keywords Paper

deep learning, optimization, transformers, domain adaptation

0

0

0

0

13:50

06/12/2021

Combinatorial Pure Exploration with Bottleneck Reward Function

Yihan Du, Yuko Kuroki, Wei Chen

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

11:53

12/07/2020

Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles

Dylan Foster, Alexander Rakhlin

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

14:39

12/07/2020

Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning

Dipendra Misra, Mikael Henaff, Akshay Krishnamurthy, John Langford

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

15:43

18/07/2021

Improved Algorithms for Agnostic Pool-based Active Classification

Julian Katz-Samuels, Jifan Zhang, Lalit Jain, Kevin Jamieson

Keywords Paper

Algorithms, Active Learning

0

0

0

0

5:18

26/04/2020

Sign-OPT: A Query-Efficient Hard-label Adversarial Attack

Minhao Cheng, Simranjit Singh, Patrick H. Chen and
Pin-Yu Chen, Sijia Liu, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

4:56

18/07/2021

Bayesian Deep Learning via Subnetwork Inference

Erik Daxberger, Eric Nalisnick, James Allingham and
Javier Antorán, Jose Miguel Hernandez-Lobato

Keywords Paper

, Reinforcement Learning and Planning, Multi-Agent RL, Deep Learning, Bayesian Deep Learning

0

0

0

0

5:18

12/07/2020

Adaptive Sampling for Estimating Probability Distributions

Shubhanshu Shekhar, Tara Javidi, Mohammad Ghavamzadeh

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

15:11

06/12/2020

Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games

Arun Suggala, Praneeth Netrapalli

Keywords Paper

1

1

0

0

3:29

04/08/2021

Adversarially Robust Low Dimensional Representations

Pranjal Awasthi, Vaggos Chatziafratis, Xue Chen, Aravindan Vijayaraghavan

Keywords Paper

0

0

0

0

20:19

06/12/2021

Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination

Arpan Mukherjee, Ali Tajer, Pin-Yu Chen, Payel Das

Keywords Paper

theory, bandits

0

0

0

0

15:07

18/07/2021

Density Constrained Reinforcement Learning

Zengyi Qin, Yuxiao Chen, Chuchu Fan

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:50

06/12/2021

Conditioning Sparse Variational Gaussian Processes for Online Decision-making

Wesley Maddox, Samuel Stanton, Andrew Wilson

Keywords Paper

optimization, reinforcement learning and planning, kernel methods, active learning

0

0

0

0

11:30

06/12/2020

Synthetic Data Generators -- Sequential and Private

Olivier Bousquet, Roi Livni, Shay Moran

Keywords Paper

Algorithms -> Stochastic Methods; Deep Learning -> Optimization for Deep Networks, Optimization -> Stochastic Optimization

0

0

0

0

3:15

12/07/2020

Implicit Generative Modeling for Efficient Exploration

Neale Ratzlaff, Qinxun Bai, Fuxin Li, Wei Xu

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:01

06/12/2021

List-Decodable Mean Estimation in Nearly-PCA Time

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and
Jerry Li, Kevin Tian

Keywords Paper

theory, clustering

0

0

0

0

14:21

14/06/2020

GeoDA: A Geometric Framework for Black-Box Adversarial Attacks

Ali Rahmati, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard, Huaiyu Dai

Keywords Paper

adversarial example, black-box attack, query efficient, neural networks, geometry, normal vector, estimation

0

0

0

0

1:01

02/02/2021

Active Feature Selection for the Mutual Information Criterion

Shachar Schnapp, Sivan Sabato

Keywords Paper

0

0

0

0

15:03

03/08/2020

Semi-bandit Optimization in the Dispersed Setting

Travis Dick, Wesley Pegden, Maria-Florina Balcan

Keywords Paper

0

0

0

0

8:04

06/12/2020

Optimal Best-arm Identification in Linear Bandits

Yassir Jedra, Alexandre Proutiere

Keywords Paper

0

0

0

0

3:21

06/12/2020

Differentiable Meta-Learning of Bandit Policies

Craig Boutilier, Chih-wei Hsu, Branislav Kveton and
Martin Mladenov, Csaba Szepesvari, Manzil Zaheer

Keywords Paper

0

0

0

0

3:10

12/07/2020

Reward-Free Exploration for Reinforcement Learning

Chi Jin, Akshay Krishnamurthy, Max Simchowitz, Tiancheng Yu

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

14:37

03/05/2021

Sample-Efficient Automated Deep Reinforcement Learning

Jörg Franke, Gregor Koehler, André Biedenkapp, Frank Hutter

Keywords Paper

Neuroevolution, Hyperparameter Optimization, Deep Reinforcement Learning, AutoRL

0

0

0

0

4:36

26/04/2020

Making Sense of Reinforcement Learning and Probabilistic Inference

Brendan O'Donoghue, Ian Osband, Catalin Ionescu

Keywords Paper

Reinforcement learning, Bayesian inference, Exploration

0

0

0

0

5:21

03/05/2021

DrNAS: Dirichlet Neural Architecture Search

Xiangning Chen, Ruochen Wang, Minhao Cheng and
Xiaocheng Tang, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

5:00

06/12/2020

Deep Inverse Q-learning with Constraints

Gabriel Kalweit, Maria Huegle, Moritz Werling, Joschka Boedecker

Keywords Paper

0

0

0

0

3:14

26/08/2020

'Bring Your Own Greedy'+Max: Near-Optimal 1/2-Approximations for Submodular Knapsack

Grigory Yaroslavtsev, Samson Zhou, Dmitrii Avdiukhin

Keywords Paper

0

0

0

0

13:14

06/12/2021

Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning

Gen Li, Laixi Shi, Yuxin Chen and
Yuantao Gu, Yuejie Chi

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

15:32

16/11/2020

Exploratory Grasping: Asymptotically Optimal Algorithms for Grasping Challenging Polyhedral Objects

Michael Danielczuk, Ashwin Balakrishna, Daniel S Brown and
Shivin Devgon, Ken Goldberg

Keywords Paper

0

0

0

0

5:04

06/12/2021

Sample Selection for Fair and Robust Training

Yuji Roh, Kangwook Lee, Steven Whang, Changho Suh

Keywords Paper

optimization, robustness, fairness

0

0

0

0

13:44