Fast Unbiased Sampling of Networks with Given Expected Degrees and Strengths
Xuanchi Li, Xin Wang, Sadamori Kojaku
TL;DR
This paper tackles bias in the Chung-Lu model and the computational bottleneck of MaxEnt configuration models by introducing fast Miller-Hagberg–based sampling algorithms for the Undirected Binary Configuration Model (UBCM) and the Undirected Enhanced Configuration Model (UECM). It demonstrates dramatic speedups (10–1000×) over brute-force sampling across 103 networks while preserving degree (and strength) constraints, yielding unbiased ensembles that reflect the intended degree heterogeneity. The approach extends to bipartite, directed, and hypergraph representations, enabling principled statistical testing of network structure at scale. An open-source Python implementation provides practical means to adopt these rigorous MaxEnt models in large-network analyses and comparisons.
Abstract
The configuration model is a cornerstone of statistical assessment of network structure. While the Chung-Lu model is among the most widely used configuration models, it systematically oversamples edges between large-degree nodes, leading to inaccurate statistical conclusions. Although the maximum entropy principle offers unbiased configuration models, its high computational cost has hindered widespread adoption, making the Chung-Lu model an inaccurate yet persistently practical choice. Here, we propose fast and efficient sampling algorithms for the max-entropy-based models by adapting the Miller-Hagberg algorithm. Evaluation on 103 empirical networks demonstrates 10-1000 times speedup, making theoretically rigorous configuration models practical and contributing to a more accurate understanding of network structure.
