Revisiting Local PageRank Estimation on Undirected Graphs: Simple and Optimal
Hanzhi Wang
TL;DR
This work tackles the problem of locally estimating the PageRank score of a single target node in undirected graphs. It introduces BackMC, a simple Monte Carlo–style algorithm based on alpha-discounted random walks from the target, and proves a tight worst-case time bound of $O\left(\frac{1}{d_{\mathrm{min}}}\cdot \min\left(d_t, m^{1/2}\right)\right)$, along with a matching lower bound. The authors provide detailed analysis showing unbiasedness, variance control, and a median-based amplification to meet a specified failure probability, while maintaining an optimal overall runtime. Empirical results on real-world and synthetic graphs demonstrate BackMC's substantial gains in both efficiency and accuracy over prior methods, including SetPush. The work advances the theoretical understanding of local PageRank in undirected graphs and offers a practical tool for scalable graph analysis and downstream tasks like graph neural networks.
Abstract
We propose a simple and optimal algorithm, BackMC, for local PageRank estimation in undirected graphs: given an arbitrary target node $t$ in an undirected graph $G$ comprising $n$ nodes and $m$ edges, BackMC accurately estimates the PageRank score of node $t$ while assuring a small relative error and a high success probability. The worst-case computational complexity of BackMC is upper bounded by $O\left(\frac{1}{d_{\mathrm{min}}}\cdot \min\left(d_t, m^{1/2}\right)\right)$, where $d_{\mathrm{min}}$ denotes the minimum degree of $G$, and $d_t$ denotes the degree of $t$, respectively. Compared to the previously best upper bound of $ O\left(\log{n}\cdot \min\left(d_t, m^{1/2}\right)\right)$ (VLDB '23), which is derived from a significantly more complex algorithm and analysis, our BackMC improves the computational complexity for this problem by a factor of $Θ\left(\frac{\log{n}}{d_{\mathrm{min}}}\right)$ with a much simpler algorithm. Furthermore, we establish a matching lower bound of $Ω\left(\frac{1}{d_{\mathrm{min}}}\cdot \min\left(d_t, m^{1/2}\right)\right)$ for any algorithm that attempts to solve the problem of local PageRank estimation, demonstrating the theoretical optimality of our BackMC. We conduct extensive experiments on various large-scale real-world and synthetic graphs, where BackMC consistently shows superior performance.
