Data-driven Intra-Autonomous Systems Graph Generator
Caio Vinicius Dadauto, Nelson Luis Saldanha da Fonseca, Ricardo da Silva Torres
TL;DR
This work tackles the challenge of generating realistic intra-autonomous system topologies for Internet research, where traditional scale-free and structure-based generators fail to capture multi-metric properties. It introduces DGGI, a GraphRNN-based deep generative model, and the IGraphs dataset of 90,326 real intra-AS subgraphs derived from ITDK, enabling data-driven training and high-fidelity topology synthesis. Using a three-stage pipeline (FRM-based training-set construction, GraphRNN-based DGGM training, and threshold-based synthetic generation), the authors demonstrate substantial improvements in distributional similarity across node degree, clustering, betweenness, and assortativity, as measured by $MMD$. The results suggest that DGGI offers meaningful gains over baselines and that IGraphs provides a valuable resource for training and evaluating graph-based Internet models, with potential for broader DL-based topology research and protocol evaluation.
Abstract
Accurate modeling of realistic network topologies is essential for evaluating novel Internet solutions. Current topology generators, notably scale-free-based models, fail to capture multiple properties of intra-AS topologies. While scale-free networks encode node-degree distribution, they overlook crucial graph properties like betweenness, clustering, and assortativity. The limitations of existing generators pose challenges for training and evaluating deep learning models in communication networks, emphasizing the need for advanced topology generators encompassing diverse Internet topology characteristics. This paper introduces a novel deep-learning-based generator of synthetic graphs representing intra-autonomous in the Internet, named Deep-Generative Graphs for the Internet (DGGI). It also presents a novel massive dataset of real intra-AS graphs extracted from the project ITDK, called IGraphs. It is shown that DGGI creates synthetic graphs that accurately reproduce the properties of centrality, clustering, assortativity, and node degree. The DGGI generator overperforms existing Internet topology generators. On average, DGGI improves the MMD metric $84.4\%$, $95.1\%$, $97.9\%$, and $94.7\%$ for assortativity, betweenness, clustering, and node degree, respectively.
