Bipartite mixed membership distribution-free model. A novel model for community detection in overlapping bipartite weighted networks
Huan Qing, Jingli Wang
TL;DR
BiMMDF delivers a unified distribution-free framework for overlapping bipartite weighted networks by letting edge weights come from an arbitrary distribution with mean $\Omega=\rho\Pi_r P \Pi_c^\top$. The proposed DiSP spectral algorithm achieves consistent mixed-membership estimation under distribution-specific separation conditions, with a rigorous analysis of how $\gamma$ and $\rho$ shape error rates. A missing-edge extension enables scalable handling of sparse networks by coupling BiMMDF with bipartite unweighted models. Empirical results on synthetic data and eight real networks demonstrate robust performance across diverse distributions, including signed weights, and show practical utility for uncovering complex, overlapping community structure. Together, these contributions offer a flexible, scalable approach that extends classic blockmodels to general, overlapping bipartite weighted networks with theoretical guarantees.
Abstract
Modeling and estimating mixed memberships for overlapping unipartite un-weighted networks has been well studied in recent years. However, to our knowledge, there is no model for a more general case, the overlapping bipartite weighted networks. To close this gap, we introduce a novel model, the Bipartite Mixed Membership Distribution-Free (BiMMDF) model. Our model allows an adjacency matrix to follow any distribution as long as its expectation has a block structure related to node membership. In particular, BiMMDF can model overlapping bipartite signed networks and it is an extension of many previous models, including the popular mixed membership stochastic blcokmodels. An efficient algorithm with a theoretical guarantee of consistent estimation is applied to fit BiMMDF. We then obtain the separation conditions of BiMMDF for different distributions. Furthermore, we also consider missing edges for sparse networks. The advantage of BiMMDF is demonstrated in extensive synthetic networks and eight real-world networks.
