Community detection in multi-layer bipartite networks
Huan Qing
TL;DR
This paper introduces a multi-layer degree-corrected stochastic co-block model (multi-layer DC-ScBM) for community detection in bipartite networks and proposes a debiased spectral co-clustering algorithm, NcDSoS, to recover the shared row and column communities across layers. It provides rigorous high-probability error bounds showing consistency that improve as the number of layers grows, highlighting the benefit of multi-layer structure. The authors establish a series of lemmas and theorems, including concentration bounds via rectangular matrix Bernstein and perturbation analyses, and validate the method through extensive simulations and real-data applications. The work advances cross-layer community detection in bipartite networks and lays groundwork for scalable, theoretically grounded techniques applicable to complex multi-layer systems.
Abstract
The problem of community detection in multi-layer undirected networks has received considerable attention in recent years. However, practical scenarios often involve multi-layer bipartite networks, where each layer consists of two distinct types of nodes. Existing community detection algorithms tailored for multi-layer undirected networks are not directly applicable to multi-layer bipartite networks. To address this challenge, this paper introduces a novel multi-layer degree-corrected stochastic co-block model specifically designed to capture the underlying community structure within multi-layer bipartite networks. Within this framework, we propose an efficient debiased spectral co-clustering algorithm for detecting nodes' communities. We establish the consistent estimation property of our proposed algorithm and demonstrate that an increased number of layers in bipartite networks improves the accuracy of community detection. Through extensive numerical experiments, we showcase the superior performance of our algorithm compared to existing methods. Additionally, we validate our algorithm by applying it to real-world multi-layer network datasets, yielding meaningful and insightful results.
