Simple yet Effective Node Property Prediction on Edge Streams under Distribution Shifts
Jongha Lee, Taehyung Kwon, Heechan Moon, Kijung Shin
TL;DR
This work tackles node property prediction on continuously evolving graphs (CTDGs) where distribution shifts undermine traditional TGNNs. It introduces SPLASH, a simple yet effective framework that augments node features with three schemes (random, positional, structural), automatically selects the most robust augmentations via linear models evaluated across multiple shift-aware splits, and uses SLIM, a lightweight MLP-based TGNN, to predict properties in real time. Across dynamic anomaly detection, dynamic node classification, and node affinity prediction on seven real-world datasets (plus synthetic shifts), SPLASH consistently outperforms baselines, especially under distribution shifts, while offering substantial gains in speed and parameter efficiency. The approach thus provides a practical, generalizable solution for robust dynamic graph learning, with code and datasets released for reproducibility.
Abstract
The problem of predicting node properties (e.g., node classes) in graphs has received significant attention due to its broad range of applications. Graphs from real-world datasets often evolve over time, with newly emerging edges and dynamically changing node properties, posing a significant challenge for this problem. In response, temporal graph neural networks (TGNNs) have been developed to predict dynamic node properties from a stream of emerging edges. However, our analysis reveals that most TGNN-based methods are (a) far less effective without proper node features and, due to their complex model architectures, (b) vulnerable to distribution shifts. In this paper, we propose SPLASH, a simple yet powerful method for predicting node properties on edge streams under distribution shifts. Our key contributions are as follows: (1) we propose feature augmentation methods and an automatic feature selection method for edge streams, which improve the effectiveness of TGNNs, (2) we propose a lightweight MLP-based TGNN architecture that is highly efficient and robust under distribution shifts, and (3) we conduct extensive experiments to evaluate the accuracy, efficiency, generalization, and qualitative performance of the proposed method and its competitors on dynamic node classification, dynamic anomaly detection, and node affinity prediction tasks across seven real-world datasets.
