Online Learning for Active Cache Synchronization

12/07/2020

Online Learning for Active Cache Synchronization

Andrey Kolobov, Sebastien Bubeck, Julian Zimmert

Keywords: Online Learning, Active Learning, and Bandits

Abstract Paper Similar Papers

Abstract: Existing multi-armed bandit (MAB) models make two implicit assumptions: an arm generates a payoff only when it is played, and the agent observes every payoff that is generated. This paper introduces synchronization bandits, a MAB variant where all arms generate costs at all times, but the agent observes an arm's instantaneous cost only when the arm is played. Synchronization MABs are inspired by online caching scenarios such as Web crawling, where an arm corresponds to a cached item and playing the arm means downloading its fresh copy from a server. While not refreshed, each cached item grows progressively stale with time, continuously generating stochastic costs due to degraded cache performance, but the cache doesn't know how much until it refreshes the item and computes the difference between the item’s fresh version and the old one. We present MirrorSync, an online learning algorithm for synchronization bandits, establish an adversarial regret of $O(T^{2/3})$ for it, and show how to make it efficient in practice.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Recurrent Submodular Welfare and Matroid Blocking Semi-Bandits

Orestis Papadigenopoulos, Constantine Caramanis

Keywords Paper

bandits

0

0

0

0

12:28

02/02/2021

Stochastic Graphical Bandits with Adversarial Corruptions

Shiyin Lu, Guanghui Wang, Lijun Zhang

Keywords Paper

0

0

0

0

17:05

06/12/2020

Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes

Yuval Emek, Ron Lavi, Rad Niazadeh, Yangguang Shi

Keywords Paper

0

0

0

0

3:10

12/08/2020

ETHBMC: A Bounded Model Checker for Smart Contracts

Joel Frank, Cornelius Aschermann, Thorsten Holz

Keywords Paper

0

0

0

0

12:12

18/07/2021

Memory Efficient Online Meta Learning

Durmus Alp Emre Acar, Ruizhao Zhu, Venkatesh Saligrama

Keywords Paper

Algorithms

1

1

1

1

5:20

06/12/2021

Observation-Free Attacks on Stochastic Bandits

Yinglun Xu, Bhuvesh Kumar, Jacob Abernethy

Keywords Paper

bandits

0

0

0

0

3:07

13/07/2020

Desperately Seeking ... Optimal Multi-Tier Cache Configurations

Tyler Estro, Pranav Bhandari, Avani Wildani, Erez Zadok

Keywords Paper

0

0

0

0

14:42

12/07/2020

An Imitation Learning Approach for Cache Replacement

Evan Liu, Milad Hashemi, Kevin Swersky and
Parthasarathy Ranganathan, Junwhan Ahn

Keywords Paper

Applications - Other

0

0

0

0

14:20

15/06/2020

BlankIt library debloating: Getting what you want instead of cutting what you don’t

Chris Porter, Girish Mururu, Prithayan Barua, Santosh Pande

Keywords Paper

software debloating, program security

0

0

0

0

16:47

02/02/2021

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

Siwei Wang, Haoyun Wang, Longbo Huang

Keywords Paper

0

0

0

0

19:29

12/08/2020

RELOAD+REFRESH: Abusing Cache Replacement Policies to Perform Stealthy Cache Attacks

Samira Briongos, Pedro Malagón, José M. Moya, Thomas Eisenbarth

Keywords Paper

0

0

0

0

10:04

15/06/2020

OSCA: An Online-Model Based Cache Allocation Scheme in Cloud Block Storage Systems

Yu Zhang, Ping Huang, Ke Zhou and
Hua Wang, Jianying Hu, Yongguang Ji, Bin Cheng

Keywords Paper

0

0

0

0

21:30

06/12/2020

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits

Siwei Wang, Longbo Huang, John C. S. Lui

Keywords Paper

0

0

0

0

3:19

13/04/2021

Multi-armed bandits with cost subsidy

Deeksha Sinha, Karthik Abinav Sankararaman, Abbas Kazerouni, Vashist Avadhanula

Keywords Paper

0

0

0

0

2:52

12/07/2020

Thompson Sampling Algorithms for Mean-Variance Bandits

Qiuyu Zhu, Vincent Tan

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

14:31

06/12/2020

Online Algorithm for Unsupervised Sequential Selection with Contextual Information

Arun Verma, Manjesh Kumar Hanawal, Csaba Szepesvari, Venkatesh Saligrama

Keywords Paper

0

0

0

0

3:21

19/08/2021

Asynchronous Active Learning with Distributed Label Querying

Sheng-Jun Huang, Chen-Chen Zong, Kun-Peng Ning, Hai-Bo Ye

Keywords Paper

Machine Learning, Active Learning, Weakly Supervised Learning, Semi-Supervised Learning

0

0

0

0

14:17

18/07/2021

Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism

Brijen Thananjeyan, Kirthevasan Kandasamy, Ion Stoica and
Michael Jordan, Ken Goldberg, Joseph E Gonzalez

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

20:41

18/07/2021

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Zhuangdi Zhu, Junyuan Hong, Jiayu Zhou

Keywords Paper

Algorithms

0

1

0

0

5:15

04/11/2020

Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache

Xingda Wei, Rong Chen, Haibo Chen

Keywords Paper

0

0

0

0

18:58

06/12/2020

Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes

Mengdi Xu, Wenhao Ding, Jiacheng Zhu and
ZUXIN LIU, Baiming Chen, Ding Zhao

Keywords Paper

0

0

0

0

3:21

19/08/2021

Online Credit Payment Fraud Detection via Structure-Aware Hierarchical Recurrent Neural Network

Wangli Lin, Li Sun, Qiwei Zhong and
Can Liu, Jinghua Feng, Xiang Ao, Hao Yang

Keywords Paper

Multidisciplinary Topics and Applications, Economic and Finance, Mining Text, Web, Social Media, Personalization and User Modeling

0

0

0

0

12:27

03/08/2020

Recoverable mutual exclusion with constant amortized RMR complexity from standard primitives

David Yu Cheng Chan, Philipp Woelfel

Keywords Paper

shared memory, fetch and increment, recoverable mutual exclusion, asynchronous system

0

0

0

0

18:37

06/12/2021

Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits

Reda Ouhamma, Rémy Degenne, Vianney Perchet, Pierre Gaillard

Keywords Paper

bandits, online learning

0

0

0

0

14:36

13/04/2021

Multitask bandit learning through heterogeneous feedback aggregation

Zhi Wang, Chicheng Zhang, Manish Kumar Singh and
Laurel Riek, Kamalika Chaudhuri

Keywords Paper

0

0

0

0

3:07

13/04/2021

Contextual blocking bandits

Soumya Basu, Orestis Papadigenopoulos, Constantine Caramanis, Sanjay Shakkottai

Keywords Paper

0

0

0

0

2:47

06/12/2021

Accumulative Poisoning Attacks on Real-time Data

Tianyu Pang, Xiao Yang, Yinpeng Dong and
Hang Su, Jun Zhu

Keywords Paper

machine learning, online learning, federated learning

0

0

0

0

5:55

18/07/2021

Online Limited Memory Neural-Linear Bandits with Likelihood Matching

Ofir Nabati, Tom Zahavy, Shie Mannor

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:08

12/08/2020

Temporal System Call Specialization for Attack Surface Reduction

Seyedhamed Ghavamnia, Tapti Palit, Shachee Mishra, Michalis Polychronakis

Keywords Paper

0

0

0

0

11:56

06/12/2021

Asynchronous Decentralized Online Learning

Jiyan Jiang, Wenpeng Zhang, Jinjie GU, Wenwu Zhu

Keywords Paper

optimization, online learning

0

0

0

0

9:12

04/08/2021

Efficient Bandit Convex Optimization: Beyond Linear Losses

Arun Sai Suggala, Pradeep Ravikumar, Praneeth Netrapalli

Keywords Paper

0

0

0

0

20:29

12/08/2020

An Off-Chip Attack on Hardware Enclaves via the Memory Bus

Dayeol Lee, Dongha Jung, Ian T. Fang and
Chia-Che Tsai, Raluca Ada Popa

Keywords Paper

0

0

0

0

12:13

18/07/2021

Parametric Graph for Unimodal Ranking Bandit

CamilleS GAUTHIER, Romaric Gaudel, Elisa Fromont, Boammani Aser Lompo

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

4:49

13/04/2021

Active online learning with hidden shifting domains

Yining Chen, Haipeng Luo, Tengyu Ma, Chicheng Zhang

Keywords Paper

0

0

0

0

3:06

02/02/2021

Asynchronous Teacher Guided Bit-wise Hard Mining for Online Hashing

Sheng Jin, Qin Zhou, Hongxun Yao and
Yao Liu, Xian-Sheng Hua

Keywords Paper

0

0

0

0

17:01

13/04/2021

Free-rider attacks on model aggregation in federated learning

Yann Fraboni, Richard Vidal, Marco Lorenzi

Keywords Paper

0

0

0

0

3:02

13/04/2021

An efficient algorithm for generalized linear bandit: Online stochastic gradient descent and thompson sampling

Qin Ding, Cho-Jui Hsieh, James Sharpnack

Keywords Paper

0

0

0

0

3:03

06/12/2021

Leveraging Spatial and Temporal Correlations in Sparsified Mean Estimation

Divyansh Jhunjhunwala, Ankur Mallick, Advait Gadhikar and
Swanand Kadhe, Gauri Joshi

Keywords Paper

optimization, federated learning

0

0

0

0

13:14

14/06/2020

Online Depth Learning Against Forgetting in Monocular Videos

Zhenyu Zhang, Stéphane Lathuilière, Elisa Ricci and
Nicu Sebe, Yan Yan, Jian Yang

Keywords Paper

depth estimation, online adaptation, domain adaptation, meta-learning, online learning

0

0

0

0

0:59

02/02/2021

Revisiting Consistent Hashing with Bounded Loads

John Chen, Benjamin Coleman, Anshumali Shrivastava

Keywords Paper

0

0

0

0

18:38