Overlapping community detection in weighted networks
Huan Qing
TL;DR
This work addresses overlapping community detection in weighted networks by introducing the Weighted Degree-Corrected Mixed Membership (WDCMM) model, which extends the DCMM framework to arbitrary edge-weight distributions while preserving a low-rank, block-structured mean matrix $\Omega=\Theta\Pi P\Pi'\Theta$. It develops a distribution-free spectral estimator (ScD) that leverages an Ideal Cone structure to recover the mixed-membership matrix $\Pi$ with provable consistency bounds that hold under mild, distribution-agnostic conditions. To determine the number of communities, the authors define the overlapping weighted modularity $Q_{ovw}$ and propose a modularity-driven approach (KScD) that works for both assortative and disassortative, as well as signed, weighted networks. Extensive simulations and real-data experiments show that ScD competes effectively with existing methods across weight regimes, and that $Q_{ovw}$ provides reliable guidance for selecting $K$. The results advance practical tools for analyzing complex weighted networks with overlapping structure and motivate future extensions to Mixed-SCORE, directed networks, and faster scalable algorithms.
Abstract
Over the past decade, community detection in overlapping un-weighted networks, where nodes can belong to multiple communities, has been one of the most popular topics in modern network science. However, community detection in overlapping weighted networks, where edge weights can be any real value, remains challenging. In this article, we propose a generative model called the weighted degree-corrected mixed membership (WDCMM) model to model such weighted networks. This model adopts the same factorization for the expectation of the adjacency matrix as the previous degree-corrected mixed membership (DCMM) model. Our WDCMM extends the DCMM from un-weighted networks to weighted networks by allowing the elements of the adjacency matrix to be generated from distributions beyond Bernoulli. We first address the community membership estimation of the model by applying a spectral algorithm and establishing a theoretical guarantee of consistency. Then, we propose overlapping weighted modularity to measure the quality of overlapping community detection for both assortative and dis-assortative weighted networks. To determine the number of communities, we incorporate the algorithm into the proposed modularity. We demonstrate the advantages of the model and the modularity through applications to simulated data and real-world networks.
