An End-to-End Automatic Cache Replacement Policy Using Deep Reinforcement Learning

Authors

  • Yang Zhou Wuhan National Laboratory for Optoelectronics Huazhong University of Science and Technology
  • Fang Wang Wuhan National Laboratory for Optoelectronics Huazhong University of Science and Technology
  • Zhan Shi Wuhan National Laboratory for Optoelectronics Huazhong University of Science and Technology
  • Dan Feng Wuhan National Laboratory for Optoelectronics Huazhong University of Science and Technology

DOI:

https://doi.org/10.1609/icaps.v32i1.19840

Keywords:

Cache Replacement Policy, Reinforcement Learning, Machine Learning

Abstract

In the past few decades, much research has been conducted on the design of cache replacement policies. Prior work frequently relies on manually-engineered heuristics to capture the most common cache access patterns, or predict the reuse distance and try to identify the blocks that are either cache-friendly or cache-averse. Researchers are now applying recent advances in machine learning to guide cache replacement policy, augmenting or replacing traditional heuristics and data structures. However, most existing approaches depend on the certain environment which restricted their application, e.g, most of the approaches only consider the on-chip cache consisting of program counters (PCs). Moreover, those approaches with attractive hit rates are usually unable to deal with modern irregular workloads, due to the limited feature used. In contrast, we propose a pervasive cache replacement framework to automatically learn the relationship between the probability distribution of different replacement policies and workload distribution by using deep reinforcement learning. We train an end-to-end cache replacement policy only on the past requested address through two simple and stable cache replacement policies. Furthermore, the overall framework can be easily plugged into any scenario that requires cache. Our simulation results on 8 production storage traces run against 3 different cache configurations confirm that the proposed cache replacement policy is effective and outperforms several state-of-the-art approaches.

Downloads

Published

2022-06-13

How to Cite

Zhou, Y., Wang, F., Shi, Z., & Feng, D. (2022). An End-to-End Automatic Cache Replacement Policy Using Deep Reinforcement Learning. Proceedings of the International Conference on Automated Planning and Scheduling, 32(1), 537-545. https://doi.org/10.1609/icaps.v32i1.19840