Alignment and Comparison of Directed Networks via Transition Couplings of Random Walks
Bongsoo Yi, Kevin O'Connor, Kevin McGoff, Andrew B. Nobel
TL;DR
NetOTC introduces a parameter-free, transport-based framework for comparing and aligning directed or undirected networks by coupling the full random walks of two graphs through stationary transition couplings that minimize the expected cost $\mathbb{E} c(\tilde{X}_0, \tilde{Y}_0)$. The method outputs a dissimilarity $\rho(G_1,G_2)$ and probabilistic vertex and edge alignments $\pi_v$, $\pi_e$, while preserving edges; it scales to networks of different sizes and has no Monte Carlo randomness. Theoretical results show edge preservation, metric properties on undirected common-vertex networks when $c$ is a metric, and a natural notion of network factors that relate extension and factor graphs via deterministic couplings. Empirically, NetOTC is competitive with or superior to existing OT-based methods in tasks like isomorphism recovery and SBM block alignment, and can detect exact or approximate factor relations, all without tunable parameters.
Abstract
We describe and study a transport based procedure called NetOTC (network optimal transition coupling) for the comparison and alignment of two networks. The networks of interest may be directed or undirected, weighted or unweighted, and may have distinct vertex sets of different sizes. Given two networks and a cost function relating their vertices, NetOTC finds a transition coupling of their associated random walks having minimum expected cost. The minimizing cost quantifies the difference between the networks, while the optimal transport plan itself provides alignments of both the vertices and the edges of the two networks. Coupling of the full random walks, rather than their marginal distributions, ensures that NetOTC captures local and global information about the networks, and preserves edges. NetOTC has no free parameters, and does not rely on randomization. We investigate a number of theoretical properties of NetOTC and present experiments establishing its empirical performance.
