CoGenT: A Content-oriented Generative-hit Framework for Content Delivery Networks
Peng Wang, Yu Liu, Ziqi Liu, Ming-Yang Wang, Ke Liu, Ke Zhou, Zhihai Huang
TL;DR
This paper tackles cache misses in CDNs by exploiting content locality through a novel edge-based generative-hit framework, CoGenT. It enables on-edge data generation from cached content using idle edge compute, addressing pseudo-missing requests while keeping generated data non-persistent to manage storage. The architecture comprises a pseudo-missing judgment module, a decision-tree-based two-pronged controller, and an extensible generative-hit processor, validated through a Tencent CDN deployment and simulator experiments. Real-world results show substantial latency and bandwidth reductions, with additional gains observed when applying CoGenT to existing caching strategies in simulation. The work lays groundwork for integrating data-generation techniques, including large language models, to further enhance CDN QoE and efficiency.
Abstract
The service provided by content delivery networks (CDNs) may overlook content locality, leaving room for potential performance improvement. In this study, we explore the feasibility of leveraging generated data as a replacement for fetching data in missing scenarios based on content locality. Due to sufficient local computing resources and reliable generation efficiency, we propose a content-oriented generative-hit framework (CoGenT) for CDNs. CoGenT utilizes idle computing resources on edge nodes to generate requested data based on similar or related cached data to achieve hits. Our implementation in a real-world system demonstrates that CoGenT reduces the average access latency by half. Additionally, experiments conducted on a simulator also confirm that CoGenT can enhance existing caching algorithms, resulting in reduced latency and bandwidth usage.
