Fast Computation for the Forest Matrix of an Evolving Graph
Haoxin Sun, Xiaotian Zhou, Zhongzhi Zhang
TL;DR
This work tackles the problem of efficiently querying entries of the forest matrix $\boldsymbol{\Omega}=(\mathbf{I}+\mathbf{L})^{-1}$ on evolving graphs. It introduces SFQ and SFQPlus for static graphs, leveraging a probabilistic interpretation of $\omega_{ij}$ and variance-reduction strategies to obtain unbiased, low-variance estimators via extensions of Wilson's algorithm. For dynamic graphs, it develops Insert-Update and Delete-Update procedures that maintain a uniformly sampled list of spanning converging forests with a prune mechanism to cap growth, achieving $O(1)$-time query and update performance in practice. Extensive experiments on large real-world networks demonstrate that SFQPlus consistently outperforms SFQ in accuracy and that the approach scales to graphs with tens of millions of nodes, where exact solvers fail. The techniques enable fast, scalable forest-matrix analyses essential for network science, opinion dynamics, and related domains.
Abstract
The forest matrix plays a crucial role in network science, opinion dynamics, and machine learning, offering deep insights into the structure of and dynamics on networks. In this paper, we study the problem of querying entries of the forest matrix in evolving graphs, which more accurately represent the dynamic nature of real-world networks compared to static graphs. To address the unique challenges posed by evolving graphs, we first introduce two approximation algorithms, \textsc{SFQ} and \textsc{SFQPlus}, for static graphs. \textsc{SFQ} employs a probabilistic interpretation of the forest matrix, while \textsc{SFQPlus} incorporates a novel variance reduction technique and is theoretically proven to offer enhanced accuracy. Based on these two algorithms, we further devise two dynamic algorithms centered around efficiently maintaining a list of spanning converging forests. This approach ensures $O(1)$ runtime complexity for updates, including edge additions and deletions, as well as for querying matrix elements, and provides an unbiased estimation of forest matrix entries. Finally, through extensive experiments on various real-world networks, we demonstrate the efficiency and effectiveness of our algorithms. Particularly, our algorithms are scalable to massive graphs with more than forty million nodes.
