Stochastic Degree Sequence Model with Edge Constraints (SDSM-EC) for Backbone Extraction
Zachary P. Neal, Jennifer Watling Neal
TL;DR
The paper tackles bias in projected bipartite networks by focusing on backbone extraction under appropriate null models. It introduces SDSM-EC, an extension of the Stochastic Degree Sequence Model that enforces edge constraints (prohibited edges) in the null space, with $Q'_{ik}$ computed to exclude impossible cell configurations. Through toy and empirical preschool data, SDSM-EC demonstrates sparser backbones by omitting edges that would appear significant under unconstrained nulls, highlighting the importance of correct constraint incorporation. The method is implemented in the backbone R package (sdsm()), and the authors discuss extensions to required edges and faster $Q'-$estimation, underscoring the practical impact for bias-free backbone analysis in constrained bipartite settings.
Abstract
It is common to use the projection of a bipartite network to measure a unipartite network of interest. For example, scientific collaboration networks are often measured using a co-authorship network, which is the projection of a bipartite author-paper network. Caution is required when interpreting the edge weights that appear in such projections. However, backbone models offer a solution by providing a formal statistical method for evaluating when an edge in a projection is statistically significantly strong. In this paper, we propose an extension to the existing Stochastic Degree Sequence Model (SDSM) that allows the null model to include edge constraints (EC) such as prohibited edges. We demonstrate the new SDSM-EC in toy data and empirical data on young children's' play interactions, illustrating how it correctly omits noisy edges from the backbone.
