On sparsity, power-law and clustering properties of graphex processes
François Caron, Francesca Panero, Judith Rousseau
TL;DR
This work analyzes graphex processes, linking sparsity, power-law degree behavior, and transitivity to the tail of the marginal graphon via regular variation. It derives precise asymptotics for the number of nodes $N_lpha$, edges $N^{(e)}_lpha$, and degree counts $N_{lpha,j}$, delineating four regimes determined by the tail index $c3$ (with $bc^{-1}(x) hicksim x^{-c2}$ as $x o0$), including dense, sparse almost-dense, sparse with power-law ($c2 o0$, $0<c2<1$), and almost extremely sparse cases; in the sparse-power-law regime, the degree distribution exhibits a power-law with exponent in $(1,2)$. The paper also proves convergence results for global and local clustering coefficients and establishes central limit theorems for subgraph counts and $N_lpha$, under technical Assumptions that control small and large-degree behaviour and interactions. Finally, it offers a flexible parametrisation to separate latent structure from global sparsity and power-law properties, showing how to sparsify dense graphon models while preserving clustering characteristics, and demonstrates applicability to a broad class of graphex models including Caron2017. These results yield practical guidance for inference and model design in large, sparse networks.
Abstract
This paper investigates properties of the class of graphs based on exchangeable point processes. We provide asymptotic expressions for the number of edges, number of nodes and degree distributions, identifying four regimes: (i) a dense regime, (ii) a sparse almost dense regime, (iii) a sparse regime with power-law behaviour, and (iv) an almost extremely sparse regime. We show that under mild assumptions, both the global and local clustering coefficients converge to constants which may or may not be the same. We also derive a central limit theorem for the number of nodes. Finally, we propose a class of models within this framework where one can separately control the latent structure and the global sparsity/power-law properties of the graph.
