Strong and Weak Random Walks on Signed Networks
Shazia'Ayn Babul, Yu Tian, Renaud Lambiotte
TL;DR
This work addresses how to design random walks on signed networks to uncover multi‑community structure under weak balance, going beyond traditional two‑community (strong) models. It introduces a strong walk tied to the signed Laplacian and a novel weak walk that permits at most one negative edge per path, incorporating restart/teleportation to yield finite, interpretable kernels. Through synthetic SSBMs and real networks (e.g., Highland Tribes, Sampson’s Monastery), the weak walk consistently outperforms the strong walk when the network contains more than two communities or exhibits density asymmetries, and it remains competitive with the three‑parameter SRWR while being parameter‑free. These findings suggest that replacing strong‑balance‑biased walks with weak walks can improve a wide range of signed‑network algorithms, including unsupervised and semi‑supervised clustering, with broad practical impact for community detection and network embedding on signed graphs.
Abstract
Random walks play an important role in probing the structure of complex networks. On traditional networks, they can be used to extract community structure, understand node centrality, perform link prediction, or capture the similarity between nodes. On signed networks, where the edge weights can be either positive or negative, it is non-trivial to design a random walk which can be used to extract information about the signed structure of the network, in particular the ability to partition the graph into communities with positive edges inside and negative edges in between. Prior works on signed network random walks focus on the case where there are only two such communities (strong balance), which is rarely the case in empirical networks. In this paper, we propose a signed network random walk which can capture the structure of a network with more than two such communities (weak balance). The walk results in a similarity matrix which can be used to cluster the nodes into antagonistic communities. We compare the characteristics of the so-called strong and weak random walks, in terms of walk length and stationarity. We show through a series of experiments on synthetic and empirical networks that the similarity matrix based on weak walks can be used for both unsupervised and semi-supervised clustering, outperforming the same similarity matrix based on strong walks when the graph has more than two communities, or exhibits asymmetry in the density of links. These results suggest that other random-walk based algorithms for signed networks could be improved simply by running them with weak walks instead of strong walks.
