FLASH: Flexible Learning of Adaptive Sampling from History in Temporal Graph Neural Networks
Or Feldman, Krishna Sri Ipsit Mantri, Carola-Bibiane Schönlieb, Chaim Baskin, Moshe Eliasof
TL;DR
This work tackles the inefficiency and rigidity of historical-neighborhood sampling in temporal graph neural networks by introducing FLASH, a learnable, graph-adaptive sampling framework. FLASH learns to score historical neighbors using spatial-temporal embeddings and a link-aware context, selecting the top-$k$ neighbors via differentiable scoring and training with a self-supervised ranking objective that compares against uniform baselines. Theoretical results show FLASH strictly surpasses traditional heuristics in expressiveness, and extensive experiments across multiple TGNN backbones and dynamic-graph benchmarks demonstrate consistent improvements with manageable overhead. The proposed approach enables TGNNs to leverage long histories more effectively, enhancing future link prediction in dynamic graphs without requiring architectural changes to existing models.
Abstract
Aggregating temporal signals from historic interactions is a key step in future link prediction on dynamic graphs. However, incorporating long histories is resource-intensive. Hence, temporal graph neural networks (TGNNs) often rely on historical neighbors sampling heuristics such as uniform sampling or recent neighbors selection. These heuristics are static and fail to adapt to the underlying graph structure. We introduce FLASH, a learnable and graph-adaptive neighborhood selection mechanism that generalizes existing heuristics. FLASH integrates seamlessly into TGNNs and is trained end-to-end using a self-supervised ranking loss. We provide theoretical evidence that commonly used heuristics hinders TGNNs performance, motivating our design. Extensive experiments across multiple benchmarks demonstrate consistent and significant performance improvements for TGNNs equipped with FLASH.
