Online Continual Graph Learning
Giovanni Donghi, Luca Pasa, Daniele Zambon, Cesare Alippi, Nicolò Navarin
TL;DR
This work defines Online Continual Graph Learning (OCGL), a framework for node-level learning on evolving graphs under strict memory and computation budgets with anytime-inference needs. It formalizes the problem, analyzes neighborhood expansion, and proposes simple neighborhood-sampling strategies alongside a minimal but effective baseline, LINEAR. A comprehensive benchmark across seven datasets and nine adapted CL strategies reveals replay-based methods generally excel, with LINEAR offering a fast, robust alternative. The study lays a foundation for systematic progress in OCGL and highlights directions such as addressing neighborhood drift, richer streams, and broader tasks like link prediction.
Abstract
Continual Learning (CL) aims to incrementally acquire new knowledge while mitigating catastrophic forgetting. Within this setting, Online Continual Learning (OCL) focuses on updating models promptly and incrementally from single or small batches of observations from a data stream. Extending OCL to graph-structured data is crucial, as many real-world networks evolve over time and require timely, online predictions. However, existing continual or streaming graph learning methods typically assume access to entire graph snapshots or multiple passes over tasks, violating the efficiency constraints of the online setting. To address this gap, we introduce the Online Continual Graph Learning (OCGL) setting, which formalizes node-level continual learning on evolving graphs under strict memory and computational budgets. OCGL defines how a model incrementally processes a stream of node-level information while maintaining anytime inference and respecting resource constraints. We further establish a comprehensive benchmark comprising seven datasets and nine CL strategies, suitably adapted to the OCGL setting, enabling a standardized evaluation setup. Finally, we present a minimalistic yet competitive baseline for OCGL, inspired by our benchmarking results, that achieves strong empirical performance with high efficiency.
