Digital Twin-Empowered Deep Reinforcement Learning for Intelligent VNF Migration in Edge-Core Networks
Faisal Ahmed, Suresh Subramaniam, Motoharu Matsuura, Hiroshi Hasegawa, Shih-Chun Lin
TL;DR
The paper addresses efficient VNF migration in edge–core networks under dynamic workloads by formulating migration as a Markov Decision Process and solving it with Advantage Actor‑Critic (A2C). It introduces a Digital Twin (DT) module comprising a multi‑task VAE and a multi‑task LSTM to generate synthetic experiences and predict environment dynamics, enabling faster policy convergence and lower real‑world exploration costs. Empirical results show the DT‑empowered DRL framework delivering substantial reductions in both average E2E delay and energy consumption, outperforming non‑DT baselines and random migration. The approach offers scalable, adaptive, and energy‑aware orchestration for heterogeneous edge–core infrastructures, with practical implications for low‑latency and green networking.
Abstract
The growing demand for services and the rapid deployment of virtualized network functions (VNFs) pose significant challenges for achieving low-latency and energy-efficient orchestration in modern edge-core network infrastructures. To address these challenges, this study proposes a Digital Twin (DT)-empowered Deep Reinforcement Learning framework for intelligent VNF migration that jointly minimizes average end-to-end (E2E) delay and energy consumption. By formulating the VNF migration problem as a Markov Decision Process and utilizing the Advantage Actor-Critic model, the proposed framework enables adaptive and real-time migration decisions. A key innovation of the proposed framework is the integration of a DT module composed of a multi-task Variational Autoencoder and a multi-task Long Short-Term Memory network. This combination collectively simulates environment dynamics and generates high-quality synthetic experiences, significantly enhancing training efficiency and accelerating policy convergence. Simulation results demonstrate substantial performance gains, such as significant reductions in both average E2E delay and energy consumption, thereby establishing new benchmarks for intelligent VNF migration in edge-core networks.
