Table of Contents
Fetching ...

Approximate Graph Propagation Revisited: Dynamic Parameterized Queries, Tighter Bounds and Dynamic Updates

Zhuowei Zhao, Zhuo Zhang, Hanzhi Wang, Junhao Gan, Zhifeng Bao, Jianzhong Qi

TL;DR

A new algorithm is proposed, AGP-Static++, which is simpler yet reduces roughly a factor of $O(\log^2 n)$ in the query complexity while preserving the approximation guarantees of AGP-Static, however, AGP-Static++ still requires $O(n)$ time to process each update.

Abstract

We revisit Approximate Graph Propagation (AGP), a unified framework which captures various graph propagation tasks, such as PageRank, feature propagation in Graph Neural Networks (GNNs), and graph-based Retrieval-Augmented Generation (RAG). Our work focuses on the settings of dynamic graphs and dynamic parameterized queries, where the underlying graphs evolve over time (updated by edge insertions or deletions) and the input query parameters are specified on the fly to fit application needs. Our first contribution is an interesting observation that the SOTA solution, AGP-Static, can be adapted to support dynamic parameterized queries; however several challenges remain unresolved. Firstly, the query time complexity of AGP-Static is based on an assumption of using an optimal algorithm for subset sampling in its query algorithm. Unfortunately, back to that time, such an algorithm did not exist; without such an optimal algorithm, an extra $O(\log^2 n)$ factor is required in the query complexity, where $n$ is the number of vertices in the graphs. Secondly, AGP-Static performs poorly on dynamic graphs, taking $O(n\log n)$ time to process each update. To address these challenges, we propose a new algorithm, AGP-Static++, which is simpler yet reduces roughly a factor of $O(\log^2 n)$ in the query complexity while preserving the approximation guarantees of AGP-Static. However, AGP-Static++ still requires $O(n)$ time to process each update. To better support dynamic graphs, we further propose AGP-Dynamic, which achieves $O(1)$ amortized time per update, significantly improving the aforementioned $O(n)$ per-update bound, while still preserving the query complexity and approximation guarantees. Last, our comprehensive experiments validate the theoretical improvements: compared to the baselines, our algorithm achieves speedups of up to $177\times$ on update time and $10\times$ on query efficiency.

Approximate Graph Propagation Revisited: Dynamic Parameterized Queries, Tighter Bounds and Dynamic Updates

TL;DR

A new algorithm is proposed, AGP-Static++, which is simpler yet reduces roughly a factor of in the query complexity while preserving the approximation guarantees of AGP-Static, however, AGP-Static++ still requires time to process each update.

Abstract

We revisit Approximate Graph Propagation (AGP), a unified framework which captures various graph propagation tasks, such as PageRank, feature propagation in Graph Neural Networks (GNNs), and graph-based Retrieval-Augmented Generation (RAG). Our work focuses on the settings of dynamic graphs and dynamic parameterized queries, where the underlying graphs evolve over time (updated by edge insertions or deletions) and the input query parameters are specified on the fly to fit application needs. Our first contribution is an interesting observation that the SOTA solution, AGP-Static, can be adapted to support dynamic parameterized queries; however several challenges remain unresolved. Firstly, the query time complexity of AGP-Static is based on an assumption of using an optimal algorithm for subset sampling in its query algorithm. Unfortunately, back to that time, such an algorithm did not exist; without such an optimal algorithm, an extra factor is required in the query complexity, where is the number of vertices in the graphs. Secondly, AGP-Static performs poorly on dynamic graphs, taking time to process each update. To address these challenges, we propose a new algorithm, AGP-Static++, which is simpler yet reduces roughly a factor of in the query complexity while preserving the approximation guarantees of AGP-Static. However, AGP-Static++ still requires time to process each update. To better support dynamic graphs, we further propose AGP-Dynamic, which achieves amortized time per update, significantly improving the aforementioned per-update bound, while still preserving the query complexity and approximation guarantees. Last, our comprehensive experiments validate the theoretical improvements: compared to the baselines, our algorithm achieves speedups of up to on update time and on query efficiency.

Paper Structure

This paper contains 17 sections, 11 theorems, 15 equations, 11 figures, 3 tables, 4 algorithms.

Key Result

Theorem 1

The expected query time complexity of AGP algorithms (e.g., AGP-Static and AGP-Static++) is bounded by where $E[C_{\text{init}}]$ is the expected cost of initialization, i.e., reading the query input $q(a, b, \mathcal{O}_w, \mathbf{x})$.

Figures (11)

  • Figure 1: Average query processing time (second)
  • Figure 2: Relative error (%) vs. query time (second)
  • Figure 3: Average query time (second) on different applications
  • Figure 4: Average update time ($\times10^{-6}$ second)
  • Figure 5: Average update time ($\times10^{-6}$ sec) vs. update pattern
  • ...and 6 more figures

Theorems & Definitions (11)

  • Theorem 1
  • Corollary 1
  • Corollary 2
  • Corollary 3
  • Lemma 1
  • Lemma 2
  • Lemma 3
  • Lemma 4
  • Lemma 5
  • Theorem 2
  • ...and 1 more