Pseudo-likelihood-based $M$-estimation of random graphs with dependent edges and parameter vectors of increasing dimension
Jonathan R. Stewart, Michael Schweinberger
TL;DR
This work develops scalable, statistically guaranteed inference for exponential-family random graphs with dependent edges and high-dimensional parameter vectors learned from a single network. It introduces a probabilistic framework that incorporates overlapping subpopulations to model brokerage and heterogeneity, yielding generalized $\beta$-models with dependent edges that cover dense and sparse regimes. The authors establish convergence rates for pseudo-likelihood-based $M$-estimators and provide sharp bounds on coupling, Hessian invertibility, and sufficient-statistic smoothness, clarifying how phase transitions and near-degeneracy affect estimation. The results yield concrete rates and conditions for both independent-edge and dependent-edge settings, with corollaries detailing how large the parameter dimension can be as a function of $N$ while maintaining consistency. Overall, the paper delivers a tractable, scalable pathway for reliable inference in complex network data under single-observation and increasing-dimensional paradigms.
Abstract
An important question in statistical network analysis is how to estimate models of discrete and dependent network data with intractable likelihood functions, without sacrificing computational scalability and statistical guarantees. We demonstrate that scalable estimation of random graph models with dependent edges is possible, by establishing convergence rates of pseudo-likelihood-based $M$-estimators for discrete undirected graphical models with exponential parameterizations and parameter vectors of increasing dimension in single-observation scenarios. We highlight the impact of two complex phenomena on the convergence rate: phase transitions and model near-degeneracy. The main results have possible applications to discrete and dependent network, spatial, and temporal data. To showcase convergence rates, we introduce a novel class of generalized $β$-models with dependent edges and parameter vectors of increasing dimension, which leverage additional structure in the form of overlapping subpopulations to control dependence. We establish convergence rates of pseudo-likelihood-based $M$-estimators for generalized $β$-models in dense- and sparse-graph settings.
