Robust Parameter Fitting to Realistic Network Models via Iterative Stochastic Approximation
Thomas Bläsius, Sarel Cohen, Philipp Fischbeck, Tobias Friedrich, Martin S. Krejca
TL;DR
The paper tackles the challenge of selecting parameters for random graph models to reproduce observed network features, especially after reducing to the largest connected component. It introduces ParFit, an anytime parameter-fitting method based on the Robbins-Monro stochastic approximation with iterate averaging, which updates parameters using just a single network sample per iteration. Across Erdős–Rényi, Chung–Lu, and GIRG models, plus 35 real networks, ParFit achieves high feature fidelity (large $E$-based correlations and small MAEs) with relatively few iterations, enabling efficient and robust parameter recovery. The approach improves practical applicability of synthetic network generation for benchmarking and analysis and opens avenues for exploring higher-dimensional geometries and alternative feature mappings.
Abstract
Random graph models are widely used to understand network properties and graph algorithms. Key to such analyses are the different parameters of each model, which affect various network features, such as its size, clustering, or degree distribution. The exact effect of the parameters on these features is not well understood, mainly because we lack tools to thoroughly investigate this relation. Moreover, the parameters cannot be considered in isolation, as changing one affects multiple features. Existing approaches for finding the best model parameters of desired features, such as a grid search or estimating the parameter-feature relations, are not well suited, as they are inaccurate or computationally expensive. We introduce an efficient iterative fitting method, named ParFit, that finds parameters using only a few network samples, based on the Robbins-Monro algorithm. We test ParFit on three well-known graph models, namely Erdős-Rényi, Chung-Lu, and geometric inhomogeneous random graphs, as well as on real-world networks, including web networks. We find that ParFit performs well in terms of quality and running time across most parameter configurations.
