Normal approximation for number of edges in random intersection graphs
Katarzyna Rybarczyk, Grzegorz Serafin
TL;DR
This work analyzes normal approximation for the edge count in the random intersection graph ${\\mathcal G}(n,m,p)$. It develops a general bound for local statistics via a Stein–Malliavin–type approach applied to chaos decompositions, and introduces clique-cover based combinatorics to bound central moments of subgraph indicators. The authors apply the method to the standardized edge count $\\widetilde{N}_E$, deriving necessary and sufficient conditions for asymptotic normality and providing sharp convergence rates in the sparse regime $mp^3\le 1$ as well as partial results for larger $p$. A flexible framework linking contraction norms to subgraph probabilities through clique covers enables analysis beyond single edges and yields threshold-type behavior in parameter regimes, with potential extensions to larger cliques. Overall, the paper unifies probabilistic and combinatorial techniques to address dependence in random intersection graphs and offers practical bounds across a wide range of parameters.
Abstract
The random intersection graph model $\mathcal G(n,m,p)$ is considered. Due to substantial edge dependencies, studying even fundamental statistics such as the subgraph count is significantly more challenging than in the classical binomial model $\mathcal G(n,p)$. First, we establish normal approximation bound in both the Wasserstein and the Kolmogorov distances for a class of local statistics on $\mathcal G(n,m,p)$. Next, we apply these results to derive such bounds for the standardised number of edges, and determine the necessary and sufficient conditions for its asymptotic normality. We develop a new method that provides a combinatorial interpretation and facilitates the estimation of analytical expressions related to general distance bounds. In particular, this allows us to control the behaviour of central moments of subgraph existence indicators. The presented method can also be extended to count copies of subgraphs larger than a single edge.
