Scaling Up Graph Propagation Computation on Large Graphs: A Local Chebyshev Approximation Approach
Yichun Yang, Rong-Hua Li, Meihao Liao, Longlong Lin, Guoren Wang
TL;DR
The paper tackles scalable graph propagation (GP) for SSPPR and HKPR on large graphs, where Taylor-based PwMethod and Push-based approaches converge slowly. It introduces a novel Chebyshev expansion of GP functions, deriving explicit coefficients for SSPPR and HKPR and using them to build two algorithms: ChebyPower (a Chebyshev-accelerated power method) and ChebyPush (a local push-style method via a subset Chebyshev recurrence). Theoretical results show faster convergence (roughly O(√N) over Taylor) and provable error guarantees, with ChebyPush offering graph-size-independent runtime under suitable thresholds; the paper also generalizes to broader GP operators and bidirectional schemes. Extensive experiments on five real-world graphs demonstrate substantial speedups (often 3–8×) over state-of-the-art baselines for both SSPPR and HKPR, validating the practicality and scalability of the approach for large-scale graph analysis and graph neural network workflows.
Abstract
Graph propagation (GP) computation plays a crucial role in graph data analysis, supporting various applications such as graph node similarity queries, graph node ranking, graph clustering, and graph neural networks. Existing methods, mainly relying on power iteration or push computation frameworks, often face challenges with slow convergence rates when applied to large-scale graphs. To address this issue, we propose a novel and powerful approach that accelerates power iteration and push methods using Chebyshev polynomials. Specifically, we first present a novel Chebyshev expansion formula for general GP functions, offering a new perspective on GP computation and achieving accelerated convergence. Building on these theoretical insights, we develop a novel Chebyshev power iteration method (\ltwocheb) and a novel Chebyshev push method (\chebpush). Our \ltwocheb method demonstrates an approximate acceleration of $O(\sqrt{N})$ compared to existing power iteration techniques for both personalized PageRank and heat kernel PageRank computations, which are well-studied GP problems. For \chebpush, we propose an innovative subset Chebyshev recurrence technique, enabling the design of a push-style local algorithm with provable error guarantee and reduced time complexity compared to existing push methods. We conduct extensive experiments using 5 large real-world datasets to evaluate our proposed algorithms, demonstrating their superior efficiency compared to state-of-the-art approaches.
