Consistency of empirical distributions of sequences of graph statistics in networks with dependent edges
Jonathan R. Stewart
TL;DR
The paper addresses the challenge of obtaining stable empirical distributions of sequences of graph statistics from networks with dependent edges by developing non-asymptotic, high-probability concentration bounds in the $\ell_\infty$-norm. It introduces weak dependence measures $\mathcal{C}_N$, $\Delta_N$, and $\bm{\mathcal{D}}_N$ and provides two concentration approaches (martingale and covariance-based) to bound $||\widehat{F}_N(\mathbf{X}) - F_N||_\infty$, yielding explicit rates like $\sqrt{\dfrac{\bm{\mathcal{D}}_N \log(\max\{M,1+p\})}{M}}$ and, for a chosen $\alpha$, $\sqrt{\dfrac{1 + \min\{|\mathcal{C}_N|,|\Delta_N|\}}{\alpha M}}$. The results specialize to degree distributions and edgewise shared partner distributions, with corollaries proving uniform convergence and almost-sure convergence under mild local-dependence assumptions, and are validated via simulation studies on curved exponential-family models and $\beta$-models, as well as an application to school-class friendship data. This provides a rigorous statistical foundation for interpreting empirical charts of network statistics in practice, even when edges are not independent. Overall, the work delivers non-asymptotic guarantees for the stability of graph-statistic distributions, facilitating reliable network analysis in dependent-edge settings.
Abstract
One of the first steps in applications of statistical network analysis is frequently to produce summary charts of important features of the network. Many of these features take the form of sequences of graph statistics counting the number of realized events in the network, examples of which include the degree distribution, as well as the edgewise shared partner distribution, and more. We provide conditions under which the empirical distributions of sequences of graph statistics are consistent in the $\ell_{\infty}$-norm in settings where edges in the network are dependent. We accomplish this by elaborating a weak dependence condition which ensures that we can obtain exponential inequalities which bound probabilities of deviations of graph statistics from the expected value. We apply this concentration inequality to empirical distributions of sequences of graph statistics and derive non-asymptotic bounds on the $\ell_{\infty}$-error which hold with high probability. Our non-asymptotic results are then extended to demonstrate uniform convergence almost surely in selected examples. We illustrate theoretical results through examples, simulation studies, and an application.
