From Link Prediction to Forecasting: Addressing Challenges in Batch-based Temporal Graph Learning
Moritz Lampert, Christopher Blöcker, Ingo Scholtes
TL;DR
This work critically examines batch-based evaluation in dynamic temporal graphs and shows that fixed-size batches cause information loss, information leakage, and inconsistent task difficulty across time windows, especially in continuous-time data. It reframes dynamic link prediction as dynamic link forecasting using fixed-duration time windows, mitigating leakage and aligning evaluation with real-world temporal patterns. Across 14 continuous-time and 6 discrete-time real-world datasets, the authors demonstrate substantial performance shifts between forecasting and prediction for state-of-the-art TGNNs, notably memory-based models, and provide practical implementations ( SnapshotLoader, DyGLib extensions) to facilitate adoption. The study contributes a principled evaluation paradigm that yields fairer comparisons and more realistic assessments of model capabilities in temporal graphs, with implications for both research and applied deployments.
Abstract
Dynamic link prediction is an important problem often considered in recent works proposing various approaches for learning temporal edge patterns. To assess their efficacy, models are evaluated on benchmark datasets involving continuous-time and discrete-time temporal graphs. However, as we show in this work, the suitability of common batch-oriented evaluation depends on the datasets' characteristics, which can cause multiple issues: For continuous-time temporal graphs, fixed-size batches create time windows with different durations, resulting in an inconsistent dynamic link prediction task. For discrete-time temporal graphs, the sequence of batches can additionally introduce temporal dependencies that are not present in the data. In this work, we empirically show that this common evaluation approach leads to skewed model performance and hinders the fair comparison of methods. We mitigate this problem by reformulating dynamic link prediction as a link forecasting task that better accounts for temporal information present in the data.
