Stochastic Graphical Bandits with Adversarial Corruptions

Abstract: We study bandits with graph-structured feedback, where a learner repeatedly selects an arm and then observes rewards of the chosen arm as well as its neighbors in the feedback graph. Existing work on graphical bandits assumes either stochastic rewards or adversarial rewards, both of which are extremes and appear rarely in real-world scenarios. In this paper, we study graphical bandits with a reward model that interpolates between the two extremes, where the rewards are overall stochastically generated but a small fraction of them can be adversarially corrupted. For this problem, we propose an online algorithm that can utilize the stochastic pattern and also tolerate the adversarial corruptions. The main idea is to restrict exploration to carefully-designed independent sets of the feedback graph and perform exploitation by adopting a soft version of arm elimination. Theoretical analysis shows that our algorithm attains an $O(\alpha \ln{K} \ln{T} + \alpha C)$ regret, where $\alpha$ is the independence number of the feedback graph, $K$ is the number of arms, $T$ is the time horizon, and $C$ quantifies the total corruptions introduced by the adversary. The effectiveness of our algorithm is demonstrated by numerical experiments.

02/02/2021

Stochastic Graphical Bandits with Adversarial Corruptions

Shiyin Lu, Guanghui Wang, Lijun Zhang

Comments

Similar Papers

Stochastic Bandits with Graph Feedback in Non-Stationary Environments

Shiyin Lu, Yao Hu, Lijun Zhang

Keywords Abstract Paper

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

Siwei Wang, Haoyun Wang, Longbo Huang

Keywords Abstract Paper

Multi-armed bandits with cost subsidy

Deeksha Sinha, Karthik Abinav Sankararaman, Abbas Kazerouni, Vashist Avadhanula

Keywords Abstract Paper

Recurrent Submodular Welfare and Matroid Blocking Semi-Bandits

Orestis Papadigenopoulos, Constantine Caramanis

Keywords Abstract Paper

bandits

Incentivized Bandit Learning with Self-Reinforcing User Preferences

Tianchen Zhou, Jia Liu, Chaosheng Dong, jingyuan deng

Keywords Abstract Paper

Reinforcement Learning and Planning, Bandits

Causal Discovery with Reinforcement Learning

Shengyu Zhu, Ignavier Ng, Zhitang Chen

Keywords Abstract Paper

causal discovery, structure learning, reinforcement learning, directed acyclic graph

Online Markov Decision Processes with Aggregate Bandit Feedback

Alon Cohen, Haim Kaplan, Tomer Koren, Yishay Mansour

Keywords Abstract Paper

Cooperative Stochastic Bandits with Asynchronous Agents and Constrained Feedback

Lin Yang, Yu-Zhen Janice Chen, Stephen Pasteris and Mohammad Hajiesmaili, John C. S. Lui, Don Towsley

Keywords Abstract Paper

bandits

Thresholding Graph Bandits with GrAPL

Daniel LeJeune, Gautam Dasarathy, Richard Baraniuk

Keywords Abstract Paper

Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

Yuval Emek, Ron Lavi, Rad Niazadeh, Yangguang Shi

Keywords Abstract Paper

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits

Siwei Wang, Longbo Huang, John C. S. Lui

Keywords Abstract Paper

Adversarial Linear Contextual Bandits with Graph-Structured Side Observations

Lingda Wang, Bingcong Li, Huozhi Zhou and Georgios B. Giannakis, Lav R. Varshney, Zhizhen Zhao

Keywords Abstract Paper

Beyond Bandit Feedback in Online Multiclass Classification

Dirk van der Hoeven, Federico Fusco, Nicolò Cesa-Bianchi

Keywords Abstract Paper

reinforcement learning and planning, machine learning, graph learning, bandits, online learning

Efficient Bandit Convex Optimization: Beyond Linear Losses

Arun Sai Suggala, Pradeep Ravikumar, Praneeth Netrapalli

Keywords Abstract Paper

Robust Collective Classification against Structural Attacks

Kai Zhou, Yevgeniy Vorobeychik

Keywords Abstract Paper

Budgeted and non-budgeted causal bandits

Vineet Nair, Vishakha Patil, Gaurav Sinha

Keywords Abstract Paper

Online Algorithm for Unsupervised Sequential Selection with Contextual Information

Arun Verma, Manjesh Kumar Hanawal, Csaba Szepesvari, Venkatesh Saligrama

Keywords Abstract Paper

Adversarial Attacks on Deep Graph Matching

Zijie Zhang, Zeru Zhang, Yang Zhou and Yelong Shen, Ruoming Jin, Dejing Dou

Keywords Abstract Paper

Latent Bandits Revisited

Joey Hong, Branislav Kveton, Manzil Zaheer and Yinlam Chow, Amr Ahmed, Craig Boutilier

Keywords Abstract Paper

Differentiable Meta-Learning of Bandit Policies

Craig Boutilier, Chih-wei Hsu, Branislav Kveton and Martin Mladenov, Csaba Szepesvari, Manzil Zaheer

Keywords Abstract Paper

Towards More Practical Adversarial Attacks on Graph Neural Networks

Jiaqi Ma, Shuangrui Ding, Qiaozhu Mei

Keywords Abstract Paper

Parametric Graph for Unimodal Ranking Bandit

CamilleS GAUTHIER, Romaric Gaudel, Elisa Fromont, Boammani Aser Lompo

Keywords Abstract Paper

Reinforcement Learning and Planning, Bandits

DORB: Dynamically Optimizing Multiple Rewards with Bandits

Ramakanth Pasunuru, Han Guo, Mohit Bansal

Keywords Abstract Paper

language tasks, optimization rewards, nlg tasks, question generation

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Lin Yang, Yu-Zhen Janice Chen, Stephen Pasteris and
Mohammad Hajiesmaili, John C. S. Lui, Don Towsley

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Lingda Wang, Bingcong Li, Huozhi Zhou and
Georgios B. Giannakis, Lav R. Varshney, Zhizhen Zhao

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zijie Zhang, Zeru Zhang, Yang Zhou and
Yelong Shen, Ruoming Jin, Dejing Dou

Keywords Paper

Joey Hong, Branislav Kveton, Manzil Zaheer and
Yinlam Chow, Amr Ahmed, Craig Boutilier

Keywords Paper

Craig Boutilier, Chih-wei Hsu, Branislav Kveton and
Martin Mladenov, Csaba Szepesvari, Manzil Zaheer

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Shuai Li, Fang Kong, Kejie Tang and
Qizhi Li, Wei Chen

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Alexia Atsidakou, Orestis Papadigenopoulos, Soumya Basu and
Constantine Caramanis, Sanjay Shakkottai

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper