Enhanced preprocessed multi-step splitting iterations for computing PageRank
Guangcong Meng, Yuehua Feng, Yongxin Dong
TL;DR
The paper tackles slow convergence in PageRank as the damping factor $\alpha$ nears 1 by introducing a novel multi-step splitting framework (MIIO) and two accelerated variants that fuse this framework with Krylov subspace methods: Arnoldi-MIIO using thick restarted Arnoldi and GArnoldi-MIIO using adaptive Generalized Arnoldi. The authors provide theoretical convergence bounds showing contraction factors that guarantee rapid progression, and they validate the approach with extensive experiments on large, sparse Google matrices, demonstrating substantial speedups in CPU time and reduced matrix-vector product counts compared to state-of-the-art baselines. The work advances practical PageRank computation by delivering faster, scalable algorithms with solid convergence guarantees, enabling efficient ranking on very large graphs. The proposed methods hold potential for broader impact in large-scale Markov chain computations and search engine ranking pipelines where damping is close to 1.
Abstract
In recent years, the PageRank algorithm has garnered significant attention due to its crucial role in search engine technologies and its applications across various scientific fields. It is well-known that the power method is a classical method for computing PageRank. However, there is a pressing demand for alternative approaches that can address its limitations and enhance its efficiency. Specifically, the power method converges very slowly when the damping factor is close to 1. To address this challenge, this paper introduces a new multi-step splitting iteration approach for accelerating PageRank computations. Furthermore, we present two new approaches for computating PageRank, which are modifications of the new multi-step splitting iteration approach, specifically utilizing the thick restarted Arnoldi and generalized Arnoldi methods. We provide detailed discussions on the construction and theoretical convergence results of these two approaches. Extensive experiments using large test matrices demonstrate the significant performance improvements achieved by our proposed algorithms.
