Table of Contents
Fetching ...

Online Algorithm for Node Feature Forecasting in Temporal Graphs

Aniq Ur Rahman, Justin P. Coon

TL;DR

An online algorithm mspace is proposed for forecasting node features in temporal graphs, which captures spatial cross-correlation among different nodes as well as the temporal auto-correlation within a node, making it applicable for estimation and generation tasks.

Abstract

In this paper, we propose an online algorithm mspace for forecasting node features in temporal graphs, which captures spatial cross-correlation among different nodes as well as the temporal auto-correlation within a node. The algorithm can be used for both probabilistic and deterministic multi-step forecasting, making it applicable for estimation and generation tasks. Comparative evaluations against various baselines, including temporal graph neural network (TGNN) models and classical Kalman filters, demonstrate that mspace performs at par with the state-of-the-art and even surpasses them on some datasets. Importantly, mspace demonstrates consistent performance across datasets with varying training sizes, a notable advantage over TGNN models that require abundant training samples to effectively learn the spatiotemporal trends in the data. Therefore, employing mspace is advantageous in scenarios where the training sample availability is limited. Additionally, we establish theoretical bounds on multi-step forecasting error of mspace and show that it scales linearly with the number of forecast steps $q$ as $\mathcal{O}(q)$. For an asymptotically large number of nodes $n$, and timesteps $T$, the computational complexity of mspace grows linearly with both $n$, and $T$, i.e., $\mathcal{O}(nT)$, while its space complexity remains constant $\mathcal{O}(1)$. We compare the performance of various mspace variants against ten recent TGNN baselines and two classical baselines, ARIMA and the Kalman filter across ten real-world datasets. Additionally, we propose a technique to generate synthetic datasets to aid in evaluating node feature forecasting methods, with the potential to serve as a benchmark for future research. Lastly, we have investigate the interpretability of different mspace variants by analyzing model parameters alongside dataset characteristics to derive model and data-centric insights.

Online Algorithm for Node Feature Forecasting in Temporal Graphs

TL;DR

An online algorithm mspace is proposed for forecasting node features in temporal graphs, which captures spatial cross-correlation among different nodes as well as the temporal auto-correlation within a node, making it applicable for estimation and generation tasks.

Abstract

In this paper, we propose an online algorithm mspace for forecasting node features in temporal graphs, which captures spatial cross-correlation among different nodes as well as the temporal auto-correlation within a node. The algorithm can be used for both probabilistic and deterministic multi-step forecasting, making it applicable for estimation and generation tasks. Comparative evaluations against various baselines, including temporal graph neural network (TGNN) models and classical Kalman filters, demonstrate that mspace performs at par with the state-of-the-art and even surpasses them on some datasets. Importantly, mspace demonstrates consistent performance across datasets with varying training sizes, a notable advantage over TGNN models that require abundant training samples to effectively learn the spatiotemporal trends in the data. Therefore, employing mspace is advantageous in scenarios where the training sample availability is limited. Additionally, we establish theoretical bounds on multi-step forecasting error of mspace and show that it scales linearly with the number of forecast steps as . For an asymptotically large number of nodes , and timesteps , the computational complexity of mspace grows linearly with both , and , i.e., , while its space complexity remains constant . We compare the performance of various mspace variants against ten recent TGNN baselines and two classical baselines, ARIMA and the Kalman filter across ten real-world datasets. Additionally, we propose a technique to generate synthetic datasets to aid in evaluating node feature forecasting methods, with the potential to serve as a benchmark for future research. Lastly, we have investigate the interpretability of different mspace variants by analyzing model parameters alongside dataset characteristics to derive model and data-centric insights.
Paper Structure (63 sections, 11 theorems, 33 equations, 14 figures, 7 tables, 3 algorithms)

This paper contains 63 sections, 11 theorems, 33 equations, 14 figures, 7 tables, 3 algorithms.

Key Result

Theorem 6.1

The RMSE of mspace for a $q$-step node feature forecast is upper bounded as ${\rm RMSE}(q) \leq \sqrt{ \alpha q^2 + (3\alpha + \beta)q + \beta }$, where $\alpha, \beta \in \mathbb{R}^+$ are constants that depend on the data, as well as the variant of the mspace algorithm.

Figures (14)

  • Figure 1: Markov Approximation.
  • Figure 2: Operation of a queue.
  • Figure 3: Shock Distribution.
  • Figure 4: TGNN architecture.
  • Figure 5: LDS.
  • ...and 9 more figures

Theorems & Definitions (22)

  • Theorem 6.1
  • Corollary 6.1
  • Theorem 6.2
  • Definition A.1
  • proof : Proof of Theorem \ref{['thm:rmse']}
  • Lemma C.1
  • proof
  • Lemma C.2
  • proof
  • Lemma C.3
  • ...and 12 more