Detecting Groups in Directed and Non-Directed Bipartite Networks
Alexandre Benatti, Luciano da F. Costa
TL;DR
This paper addresses detecting groups in directed and non-directed bipartite networks by leveraging a coincidence similarity index to transform bipartite representations into similarity networks whose connectivity reveals clusters. It extends the coincidence similarity index, defining $C(A,B)=J(A,B)I(A,B)$ with $J( vec{x}, vec{y})=\frac{\sum_i \min\{x_i,y_i\}}{\sum_i \max\{x_i,y_i\}}$ and $I( vec{x},\nvec{y})=\frac{\sum_i \min\{x_i,y_i\}}{\min\{\sum_i x_i,\sum_i y_i\}}$, to transform bipartite data into coincidence similarity networks that expose community structure. The authors generate synthetic modular bipartite networks with controllable numbers of groups and overlap via a scrambling probability $p$, and demonstrate that direct representations with more features yield the best group separation, outperforming projections with respect to modularity-based metrics. The results show robust separation even under substantial overlap and provide a scalable framework for evaluating group detection in bipartite systems, with avenues for extending to multipartite networks, variable group sizes, and node-subset analyses.
Abstract
Bipartite networks provide an effective resource for representing, characterizing, and modeling several abstract and real-world systems and structures involving binary relations, which include food webs, social interactions, and customer-product relationships. Of particular interest is the problem of, given a specific bipartite network, to identify possible respective groups or clusters characterized by similar interconnecting patterns. The present work approaches this issue by extending and complementing a previously described coincidence similarity methodology (Bioarxiv, doi.org/10.1101/2022.07.16.500294) in several manners, including the consideration of direct and non-directed bipartite networks, the characterization of groups in those networks, as well as considering synthetic bipartite networks presenting groups as a resource for studying the performance of the described methodology. Several interesting results are described and discussed, including the corroboration of the potential of the coincidence similarity methodology for achieving enhanced separation between the groups in bipartite networks.
