Learning I/O Access patterns to Improve Prefetching in SSDs

Abstract: Flash based solid state drives (SSDs) have established themselves as a higher-performance alternative to hard disk drives in cloud and mobile environments. Nevertheless, SSDs remain a performance bottleneck of computer systems due to their high I/O access latency. A common approach for improving the access latency is prefetching. Prefetching predicts future block accesses and preloads them into main memory ahead of time. In this paper, we discuss the challenges of prefetching in SSDs, explain why prior approaches fail to achieve high accuracy, and present a neural network based prefetching approach that significantly outperforms the state-of the-art. To achieve high performance, we address the challenges of prefetching in very large sparse address spaces, as well as prefetching in a timely manner by predicting ahead of time. We collect I/O trace files from several real-world applications running on cloud servers and show that our proposed approach consistently outperforms the existing stride prefetchers by up to 800\(\times \) and prior prefetching approaches based on Markov chains by up to 8\(\times \). Furthermore, we propose an address mapping learning technique to demonstrate the applicability of our approach to previously unseen SSD workloads and perform a hyperparameter sensitivity study.

04/11/2020

Yu Zhou, Chen Sun, Hongqiang Harry Liu and
Rui Miao, Shi Bai, Bo Li, Zhilong Zheng, Lingjun Zhu, Zhen Shen, Yongqing Xi, Pengcheng Zhang, Dennis Cai, Ming Zhang, Mingwei Xu

distributed, asynchronous, large scale, gradient staleness, staleness penalization, sgd, deep learning, neural networks, optimization

4:36

11/08/2020

VTrace: Automatic diagnostic system for persistent packet loss in cloud-scale overlay network

Chongrong Fang, Haoyu Liu, Mao Miao and
Jie Ye, Lei Wang, Wansheng Zhang, Daxiang Kang, Biao Lyv, Peng Cheng, Jiming Chen

model compression, pruning, heat diffusion, Convolutional Neural Networks (CNN), undirected graphs, heat diffusion, skip connections, N2NSkip, scree diagram, connection sensitivity

8:47

03/05/2021

backpropagation, rtrl, real time recurrent learning, forward mode, biologically plausible, bptt, recurrent neural networks

10:12

15/06/2020

auto ml, hyperparameter optimization, meta learning, task aware, hyperband, hyperparameters, warm start, image classication, resnet, shufflenet

4:58

05/01/2021

Spike-Thrift: Towards Energy-Efficient Deep Spiking Neural Networks by Limiting Spiking Activity via Attention-Guided Compression

Yuliang Li, Gautam Kumar, Hema Hariharan and
Hassan Wassel, Peter Hochschild, Dave Platt, Simon Sabato, Minlan Yu, Nandita Dukkipati, Prashant Chandra, Amin Vahdat