Distributed Simulation for Digital Twins of Large-Scale Real-World DiffServ-Based Networks
Zhuoyao Huang, Nan Zhang, Jingran Shen, Georgios Diamantopoulos, Zhengchang Hua, Nikos Tziritas, Georgios Theodoropoulos
TL;DR
This paper addresses the challenge of high-fidelity, scalable online simulation of DiffServ-based networks for Digital Twins. It introduces Quaint, an optimistic PDES toolkit built on the ROSS engine, paired with a novel high-efficiency event-handling model for DiffServ routers to drastically reduce simulation events. Through real-world metropolitan topology experiments, Quaint achieves up to 53.66x speedups in sequential mode and up to 232.88x in distributed mode compared with OMNeT++/INET, while maintaining accuracy in QoS metrics. The work also demonstrates the critical role of workload-aware partitioning for scalable distributed simulation and outlines directions for future QoS modelling and workload-aware partitioning strategies.
Abstract
Digital Twin technology facilitates the monitoring and online analysis of large-scale communication networks. Faster predictions of network performance thus become imperative, especially for analysing Quality of Service (QoS) parameters in large-scale city networks. Discrete Event Simulation (DES) is a standard network analysis technology, and can be further optimised with parallel and distributed execution for speedup, referred to as Parallel Discrete Event Simulation (PDES). However, modelling detailed QoS mechanisms such as DiffServ requires complex event handling for each network router, which can involve excessive simulation events. In addition, current PDES for network analysis mostly adopts conservative scheduling, which suffers from excessive global synchronisation to avoid causality problems. The performance analysis of optimistic PDES for real-world large-scale network topology and complex QoS mechanisms is still inadequate. To address these gaps, this paper proposes a simulation toolkit, Quaint, which leverages an optimistic PDES engine ROSS, for detailed modelling of DiffServ-based networks. A novel event-handling model for each network router is also proposed to significantly reduce the number of events in complex QoS modelling. Quaint has been evaluated using a real-world metropolitan-scale network topology with 5,000 routers/switches. Results show that compared to the conventional simulator OMNeT++/INET, even the sequential mode of Quaint can achieve a speedup of 53 times, and the distributed mode has a speedup of 232 times. Scalability characterisation is conducted to portray the efficiency of distributed execution, and the results indicate the future direction for workload-aware model partitioning.
