Repelling Random Walks
Isaac Reid, Eli Berger, Krzysztof Choromanski, Adrian Weller
TL;DR
This work addresses the challenge of efficiently sampling on graphs without bias by introducing repelling random walks, a discrete quasi-Monte Carlo scheme that induces correlations among multiple walkers while preserving each walk's marginal transition probabilities. The method yields unbiased estimators with strengthened concentration and is simple to implement as a drop-in modification (sampling without replacement within neighbor blocks). The authors develop theory and demonstrate strong gains across three domains: graph-kernel estimation via Graph Random Features, PageRank approximation, and graphlet concentration estimation, including a theoretical variance-reduction result for kernels and a general PageRank variance bound. Empirically, repelling walkers consistently outperform iid walkers across synthetic and real graphs, suggesting broad practical impact for graph-based statistical estimation and learning tasks. The work opens avenues for further theoretical analysis and extensions of quasi-Monte Carlo sampling to a wider class of graph estimators and networks.
Abstract
We present a novel quasi-Monte Carlo mechanism to improve graph-based sampling, coined repelling random walks. By inducing correlations between the trajectories of an interacting ensemble such that their marginal transition probabilities are unmodified, we are able to explore the graph more efficiently, improving the concentration of statistical estimators whilst leaving them unbiased. The mechanism has a trivial drop-in implementation. We showcase the effectiveness of repelling random walks in a range of settings including estimation of graph kernels, the PageRank vector and graphlet concentrations. We provide detailed experimental evaluation and robust theoretical guarantees. To our knowledge, repelling random walks constitute the first rigorously studied quasi-Monte Carlo scheme correlating the directions of walkers on a graph, inviting new research in this exciting nascent domain.
