An iterative spectral algorithm for digraph clustering
James Martin, Tim Rogers, Luca Zanetti
TL;DR
The paper addresses clustering in directed graphs where preserving edge-direction patterns is crucial. It introduces an iterative spectral algorithm that builds a problem-adapted Hermitian representation $M^{\mathcal{S}}$ conditioned on the current clustering $\mathcal{S}$ and uses a normalized eigenbasis of $L(M^{\mathcal{S}})$ to refine the partition, repeating for $T$ iterations and selecting the best clustering by a directional-flow objective $\delta$. The key contribution is a flexible framework that encodes arbitrary meta-graphs via complex roots of unity and a penalty mechanism, enabling accurate recovery under the Directed Stochastic Block Model and real-world graphs like Hearthstone, Florida Bay, and C. elegans brain networks. The approach outperforms the state-of-the-art CLSZ in synthetic experiments and yields interpretable, directionally-consistent cluster structures in diverse datasets, at the cost of additional computation due to repeated spectral decompositions. This gives a practical, tunable tool for revealing higher-order directional patterns in digraphs, with potential applications in ecology, neuroscience, and game analytics.
Abstract
Graph clustering is a fundamental technique in data analysis with applications in many different fields. While there is a large body of work on clustering undirected graphs, the problem of clustering directed graphs is much less understood. The analysis is more complex in the directed graph case for two reasons: the clustering must preserve directional information in the relationships between clusters, and directed graphs have non-Hermitian adjacency matrices whose properties are less conducive to traditional spectral methods. Here we consider the problem of partitioning the vertex set of a directed graph into $k\ge 2$ clusters so that edges between different clusters tend to follow the same direction. We present an iterative algorithm based on spectral methods applied to new Hermitian representations of directed graphs. Our algorithm performs favourably against the state-of-the-art, both on synthetic and real-world data sets. Additionally, it is able to identify a "meta-graph" of $k$ vertices that represents the higher-order relations between clusters in a directed graph. We showcase this capability on data sets pertaining food webs, biological neural networks, and the online card game Hearthstone.
