$U$-statistics on bipartite exchangeable networks
Tâm Le Minh
TL;DR
This work develops a theory of quadruplet $U$-statistics on bipartite row-column exchangeable networks, proving a general weak convergence result and a central limit theorem in the dissociated case. Leveraging backward martingale methods and the Aldous–Hoover representation, the authors show that the limiting distribution is a mixture of Gaussians in general and Gaussian when dissociated, with explicit variance structures. They apply the theory to Bipartite Expected Degree Distribution (BEDD) models, deriving identifiability by quadruplets and providing practical inference tools for row heterogeneity, network comparison, and motif frequencies, supported by simulations. The framework yields computationally tractable estimators based on simple matrix operations, with clear directions for extensions to larger subgraphs and graphon-based models, and highlights avenues for finite-sample refinements.
Abstract
Bipartite networks with exchangeable nodes can be represented by row-column exchangeable matrices. A quadruplet is a submatrix of size $2 \times 2$. A quadruplet $U$-statistic is the average of a function on a quadruplet over all the quadruplets of a matrix. We prove several asymptotic results for quadruplet $U$-statistics on row-column exchangeable matrices, including a weak convergence result in the general case and a central limit theorem when the matrix is also dissociated. These results are applied to statistical inference in network analysis. We suggest a method to perform parameter estimation, network comparison and motifs count for a particular family of row-column exchangeable network models: the bipartite expected degree distribution (BEDD) models. These applications are illustrated by simulations.
