Towards Efficient Flash Caches with Emerging NVMe Flexible Data Placement SSDs

Michael Allison; Arun George; Javier Gonzalez; Dan Helmick; Vikash Kumar; Roshan Nair; Vivek Shah

Towards Efficient Flash Caches with Emerging NVMe Flexible Data Placement SSDs

Michael Allison, Arun George, Javier Gonzalez, Dan Helmick, Vikash Kumar, Roshan Nair, Vivek Shah

TL;DR

The paper addresses the high device-level write amplification (DLWA) in Flash-based caches and its carbon implications. It proposes NVMe FDP-driven data placement to separate small, hot data from large, cold data within the CacheLib architecture, enabling targeted isolation of SOC and LOC data without altering cache design. A theoretical DLWA and CO2e model is developed and coupled with an implementation that introduces FDP-aware placement handles and I/O management, validated on production traces from Meta and Twitter. Results show DLWA near 1 and substantial reductions in embodied and operational carbon, along with improved SSD utilization and feasibility of multi-tenant deployments, illustrating a practical path to carbon-efficient Flash caches at scale.

Abstract

NVMe Flash-based SSDs are widely deployed in data centers to cache working sets of large-scale web services. As data centers face increasing sustainability demands, such as reduced carbon emissions, efficient management of Flash overprovisioning and endurance has become crucial. Our analysis demonstrates that mixing data with different lifetimes on Flash blocks results in high device garbage collection costs, which either reduce device lifetime or necessitate host overprovisioning. Targeted data placement on Flash to minimize data intermixing and thus device write amplification shows promise for addressing this issue. The NVMe Flexible Data Placement (FDP) proposal is a newly ratified technical proposal aimed at addressing data placement needs while reducing the software engineering costs associated with past storage interfaces, such as ZNS and Open-Channel SSDs. In this study, we explore the feasibility, benefits, and limitations of leveraging NVMe FDP primitives for data placement on Flash media in CacheLib, a popular open-source Flash cache widely deployed and used in Meta's software ecosystem as a caching building block. We demonstrate that targeted data placement in CacheLib using NVMe FDP SSDs helps reduce device write amplification, embodied carbon emissions, and power consumption with almost no overhead to other metrics. Using multiple production traces and their configurations from Meta and Twitter, we show that an ideal device write amplification of ~1 can be achieved with FDP, leading to improved SSD utilization and sustainable Flash cache deployments.

Towards Efficient Flash Caches with Emerging NVMe Flexible Data Placement SSDs

TL;DR

Abstract

Towards Efficient Flash Caches with Emerging NVMe Flexible Data Placement SSDs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (3)