Digital Twin-Assisted Data-Driven Optimization for Reliable Edge Caching in Wireless Networks
Zifan Zhang, Yuchen Liu, Zhiyuan Peng, Mingzhe Chen, Dongkuan Xu, Shuguang Cui
TL;DR
This work tackles reliable edge caching in dense nextG networks by integrating digital twins (DTs) with constrained Markov decision process–based reinforcement learning. The proposed D-REC framework uses a three-stage network DT creation/synchronization process (dynamic connectivity segmentation, vertical twinning, horizontal twinning) to generate rich data for training and safety evaluation, while embedding four reliability intervention modules that modify state, action, or reward to prevent BS overloads and ensure load balance. The authors prove that these reliability additions do not degrade convergence and demonstrate through simulations and real-data traces that D-REC improves cache hit rates and load balancing compared with traditional caching methods and basic DRL baselines. The DT-enabled, reliability-aware approach enhances robustness and predictability of caching decisions in nextG wireless networks, supporting scalable, low-latency content delivery in highly dynamic environments. The framework has practical implications for deploying reliable, data-driven caching policies in real-world edge networks and offers a pathway to safer, more stable RL-based networking solutions, even under distributional uncertainty such as Zipf-like content popularity.
Abstract
Optimizing edge caching is crucial for the advancement of next-generation (nextG) wireless networks, ensuring high-speed and low-latency services for mobile users. Existing data-driven optimization approaches often lack awareness of the distribution of random data variables and focus solely on optimizing cache hit rates, neglecting potential reliability concerns, such as base station overload and unbalanced cache issues. This oversight can result in system crashes and degraded user experience. To bridge this gap, we introduce a novel digital twin-assisted optimization framework, called D-REC, which integrates reinforcement learning (RL) with diverse intervention modules to ensure reliable caching in nextG wireless networks. We first develop a joint vertical and horizontal twinning approach to efficiently create network digital twins, which are then employed by D-REC as RL optimizers and safeguards, providing ample datasets for training and predictive evaluation of our cache replacement policy. By incorporating reliability modules into a constrained Markov decision process, D-REC can adaptively adjust actions, rewards, and states to comply with advantageous constraints, minimizing the risk of network failures. Theoretical analysis demonstrates comparable convergence rates between D-REC and vanilla data-driven methods without compromising caching performance. Extensive experiments validate that D-REC outperforms conventional approaches in cache hit rate and load balancing while effectively enforcing predetermined reliability intervention modules.
