SCART: Predicting STT-RAM Cache Retention Times Using Machine Learning
Dhruv Gajaria, Kyle Kuan, Tosiron Adegbija
TL;DR
The paper tackles the challenge of selecting appropriate STT-RAM cache retention times to balance latency and energy across diverse workloads. It introduces SCART, a KNN-based predictor that uses readily available SRAM performance-counter statistics to forecast the optimal STT-RAM retention time, enabling both design-time and runtime retention-time provisioning with reduced exploration overhead. Empirical results across SPEC CPU2006, MiBench, and GAP show SCART achieving average latency reductions of about 20.3% and energy reductions of about 29.1% over a homogeneous retention, while cutting exploration overhead by roughly 52.6% compared with exhaustive search. The approach demonstrates robustness in single- and multi-core scenarios and offers practical potential for runtime cache management in evolving STT-RAM architectures, with future work focusing on hardware integration and reducing labeling requirements.
Abstract
Prior studies have shown that the retention time of the non-volatile spin-transfer torque RAM (STT-RAM) can be relaxed in order to reduce STT-RAM's write energy and latency. However, since different applications may require different retention times, STT-RAM retention times must be critically explored to satisfy various applications' needs. This process can be challenging due to exploration overhead, and exacerbated by the fact that STT-RAM caches are emerging and are not readily available for design time exploration. This paper explores using known and easily obtainable statistics (e.g., SRAM statistics) to predict the appropriate STT-RAM retention times, in order to minimize exploration overhead. We propose an STT-RAM Cache Retention Time (SCART) model, which utilizes machine learning to enable design time or runtime prediction of right-provisioned STT-RAM retention times for latency or energy optimization. Experimental results show that, on average, SCART can reduce the latency and energy by 20.34% and 29.12%, respectively, compared to a homogeneous retention time while reducing the exploration overheads by 52.58% compared to prior work.
