Near-Optimal Confidence Sequences for Bounded Random Variables

Abstract: Many inference problems, such as sequential decision problems like A/B testing, adaptive sampling schemes like bandit selection, are often online in nature. The fundamental problem for online inference is to provide a sequence of confidence intervals that are valid uniformly over the growing-into-infinity sample sizes. To address this question, we provide a near-optimal confidence sequence for bounded random variables by utilizing Bentkus' concentration results. We show that it improves on the existing approaches that use the Cram{\'e}r-Chernoff technique such as the Hoeffding, Bernstein, and Bennett inequalities. The resulting confidence sequence is confirmed to be favorable in synthetic coverage problems, adaptive stopping algorithms, and multi-armed bandit problems.

06/12/2020

Stochastic optimization, Computational complexity, Convex optimization, Excess risk bounds and generalization error bounds

15:10

06/12/2021

Near-Optimal Confidence Sequences for Bounded Random Variables

Arun Kuchibhotla, Qinqing Zheng

Comments

Similar Papers

A General Method for Robust Learning from Batches

Ayush Jain, Alon Orlitsky

Keywords Abstract Paper

Divergence-Based Motivation for Online EM and Combining Hidden Variable Models

Ehsan Amid, Manfred K. Warmuth

Keywords Abstract Paper

Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge

Reda Ouhamma, Odalric-Ambrym Maillard, Vianney Perchet

Keywords Abstract Paper

robustness, bandits

High probability guarantees for stochastic convex optimization

Damek Davis, Dmitriy Drusvyatskiy

Keywords Abstract Paper

Stochastic optimization, Computational complexity, Convex optimization, Excess risk bounds and generalization error bounds

Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination

Dylan J Foster, Akshay Krishnamurthy

Keywords Abstract Paper

theory, reinforcement learning and planning, bandits, online learning

'Bring Your Own Greedy'+Max: Near-Optimal 1/2-Approximations for Submodular Knapsack

Grigory Yaroslavtsev, Samson Zhou, Dmitrii Avdiukhin

Keywords Abstract Paper

Sparse algorithms for markovian gaussian processes

William Wilkinson, Arno Solin, Vincent Adam

Keywords Abstract Paper

Consistent Plug-in Classifiers for Complex Objectives and Constraints

Shiv Tavker, Harish Guruprasad Ramaswamy, Harikrishna Narasimhan

Keywords Abstract Paper

Kernel Conditional Density Operators

Ingmar Schuster, Mattes Mollenhauer, Stefan Klus, Krikamol Muandet

Keywords Abstract Paper

Nonparametric Score Estimators

Yuhao Zhou, Jiaxin Shi, Jun Zhu

Keywords Abstract Paper

General Machine Learning Techniques

Greed Meets Sparsity: Understanding and Improving Greedy Coordinate Descent for Sparse Optimization

Huang Fang, Zhenan Fan, Yifan Sun, Michael Friedlander

Keywords Abstract Paper

Distributional Robustness with IPMs and links to Regularization and GANs

Hisham Husain

Keywords Abstract Paper

Faster Wasserstein Distance Estimation with the Sinkhorn Divergence

Lénaïc Chizat, Pierre Roussillon, Flavien Léger and François-Xavier Vialard, Gabriel Peyré

Keywords Abstract Paper

A nonasymptotic law of iterated logarithm for general M-estimators

Nicolas Schreuder, Victor-Emmanuel Brunel, Arnak Dalalyan,

Keywords Abstract Paper

Automatic structured variational inference

Luca Ambrogioni, Kate Lin, Emily Fertig and Sharad Vikram, Max Hinne, Dave Moore, Marcel Gerven

Keywords Abstract Paper

Composable Sketches for Functions of Frequencies: Beyond the Worst Case

Edith Cohen, Ofir Geri, Rasmus Pagh

Keywords Abstract Paper

Optimization - Large Scale, Parallel and Distributed

Localization, Convexity, and Star Aggregation

Suhas Vijaykumar

Keywords Abstract Paper

theory, online learning

Dynamic cutset networks

Chiradeep Roy, Tahrima Rahman, Hailiang Dong and Nicholas Ruozzi, Vibhav Gogate

Keywords Abstract Paper

Noise-tolerant, Reliable Active Classification with Comparison Queries

Max Hopkins, Shachar Lovett, Daniel Kane, Gaurav Mahajan

Keywords Abstract Paper

Active learning, Classification, Learning with algebraic or combinatorial structure, PAC learning

Slice Sampling Reparameterization Gradients

David M Zoltowski, Diana Cai, Ryan Adams

Keywords Abstract Paper

optimization, machine learning, generative model

Control Variates for Slate Off-Policy Evaluation

Nikos Vlassis, Ashok Chandrashekar, Fernando Amat, Nathan Kallus

Keywords Abstract Paper

optimization, bandits

Adversarially-learned Inference via an Ensemble of Discrete Undirected Graphical Models

Adarsh K Jeewajee, Leslie Kaelbling

Keywords Abstract Paper

, Algorithms -> Semi-Supervised Learning

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Lénaïc Chizat, Pierre Roussillon, Flavien Léger and
François-Xavier Vialard, Gabriel Peyré

Keywords Paper

Keywords Paper

Luca Ambrogioni, Kate Lin, Emily Fertig and
Sharad Vikram, Max Hinne, Dave Moore, Marcel Gerven

Keywords Paper

Keywords Paper

Keywords Paper

Chiradeep Roy, Tahrima Rahman, Hailiang Dong and
Nicholas Ruozzi, Vibhav Gogate

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Silviu-Marian Udrescu, Andrew Tan, Jiahai Feng and
Orisvaldo Neto, Tailin Wu, Max Tegmark

Keywords Paper

Keywords Paper

Bo Dai, Ofir Nachum, Yinlam Chow and
Lihong Li, Csaba Szepesvari, Dale Schuurmans

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zheng Wen, Doina Precup, Morteza Ibrahimi and
Andre Barreto, Benjamin Van Roy, Satinder Singh

Keywords Paper

Vu Nguyen, Vaden Masrani, Rob Brekelmans and
Michael A Osborne, Frank Wood

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Son Nguyen, Duong Nguyen, Khai Nguyen and
Khoat Than, Hung Bui, Nhat Ho

Keywords Paper