Scale invariance and statistical significance in complex weighted networks
Filipi N. Silva, Sadamori Kojaku, Alessandro Flammini, Filippo Radicchi, Santo Fortunato
TL;DR
Addressed the problem that statistical significance in weighted networks, when assessed with the Weighted Configuration Model (WCM), depends on the weight scale. The authors show that while several measures are scale-invariant, the WCM yields null distributions whose width scales as $A^{-1/2}$, making $p$-values scale-dependent; they propose a two-step, scale-invariant null model that separates structure and weights by first randomizing topology and then drawing weights from a scale-invariant exponential distribution with mean related to $s_i$, $s_j$, $k_i$, $k_j$, and $W$. They present a practical CReMb variant with modularity compatibility that preserves mean strengths while allowing efficient sampling, and they validate the approach on four real networks (e.g., Zachary Karate Club, NKI Brain, World Trade, London Transport) along with Netzschleuder data. The results show that weighted clustering is often significant under the scale-invariant null, maximum eigenvector centrality is generally not, and modularity significance depends on network; overall the method enables unbiased assessments of weighted networks and clarifies the limitations of the WCM. The work clarifies the nontrivial role of scale in null models and provides a pathway to robust statistical inference in complex weighted networks.
Abstract
Most networks encountered in nature, society, and technology have weighted edges, representing the strength of the interaction/association between their vertices. Randomizing the structure of a network is a classic procedure used to estimate the statistical significance of properties of the network, such as transitivity, centrality and community structure. Randomization of weighted networks has traditionally been done via the weighted configuration model (WCM), a simple extension of the configuration model, where weights are interpreted as bundles of edges. It has previously been shown that the ensemble of randomizations provided by the WCM is affected by the specific scale used to compute the weights, but the consequences for statistical significance were unclear. Here we find that statistical significance based on the WCM is scale-dependent, whereas in most cases results should be independent of the choice of the scale. More generally, we find that designing a null model that does not violate scale invariance is challenging. A two-step approach, originally introduced for network reconstruction, in which one first randomizes the structure, then the weights, with a suitable distribution, restores scale invariance, and allows us to conduct unbiased assessments of significance on weighted networks.
