EvoluNet: Advancing Dynamic Non-IID Transfer Learning on Graphs
Haohui Wang, Yuzhen Mao, Yujun Yan, Yaoqing Yang, Jianhui Sun, Kevin Choi, Balaji Veeramani, Alison Hu, Edward Bowen, Tyler Cody, Dawei Zhou
TL;DR
This work addresses dynamic non-IID transfer learning on graphs, where source graphs observed over $T$ timestamps inform predictions on a evolving target graph at $T+1$ with limited labels. It derives a novel generalization bound showing the target error $\epsilon_{tgt}^{(T+1)}(h)$ is governed by the minimum historical empirical error, a dynamic Wasserstein distance $\tilde{W}_p$ that captures domain evolution, and the hypothesis complexity term $\tilde{\Re}(\mathcal{H}_{\mathcal{L}})$. Guided by this theory, EvoluNet combines a transformer-based multi-resolution temporal encoding (M1) with dual-divergence unification (M2) to learn domain-invariant representations across evolving graphs. On DBLP and HCP benchmarks, EvoluNet achieves up to $12.1\%$ relative improvements in AUC over strong baselines, demonstrating the practical value of modeling temporal evolution and cross-domain alignment for dynamic graph transfer learning.
Abstract
Non-IID transfer learning on graphs is crucial in many high-stakes domains. The majority of existing works assume stationary distribution for both source and target domains. However, real-world graphs are intrinsically dynamic, presenting challenges in terms of domain evolution and dynamic discrepancy between source and target domains. To bridge the gap, we shift the problem to the dynamic setting and pose the question: given the label-rich source graphs and the label-scarce target graphs both observed in previous T timestamps, how can we effectively characterize the evolving domain discrepancy and optimize the generalization performance of the target domain at the incoming T+1 timestamp? To answer it, we propose a generalization bound for dynamic non-IID transfer learning on graphs, which implies the generalization performance is dominated by domain evolution and domain discrepancy between source and target graphs. Inspired by the theoretical results, we introduce a novel generic framework named EvoluNet. It leverages a transformer-based temporal encoding module to model temporal information of the evolving domains and then uses a dynamic domain unification module to efficiently learn domain-invariant representations across the source and target domains. Finally, EvoluNet outperforms the state-of-the-art models by up to 12.1%, demonstrating its effectiveness in transferring knowledge from dynamic source graphs to dynamic target graphs.
