Topology-Aware Dynamic Reweighting for Distribution Shifts on Graph
Weihuang Zheng, Jiashuo Liu, Jiaxing Li, Jiayun Wu, Peng Cui, Youyong Kong
TL;DR
This work tackles distribution shifts in graph-based node classification by proposing Topology-Aware Dynamic Reweighting (TAR), which reweights samples through gradient flow in the discrete geometric Wasserstein space to incorporate graph topology into robustness. The method casts training as a minimax problem over model parameters and sample densities, using entropy and topology-based penalties to regularize the reweighting process. The authors prove that the inner gradient flow approximates a local worst-case distribution, yielding distributional robustness with an error bound that decays exponentially with the number of gradient steps $T_{\text{in}}$. Empirically, TAR improves over strong baselines on four OOD datasets and three class-imbalanced datasets, without requiring domain labels, and demonstrates resilience to both covariate and concept shifts as well as label imbalance. The results suggest that leveraging graph structure via geometric Wasserstein gradient flow provides a principled and effective avenue for graph OOD generalization and robust node classification.
Abstract
Graph Neural Networks (GNNs) are widely used for node classification tasks but often fail to generalize when training and test nodes come from different distributions, limiting their practicality. To overcome this, recent approaches adopt invariant learning techniques from the out-of-distribution (OOD) generalization field, which seek to establish stable prediction methods across environments. However, the applicability of these invariant assumptions to graph data remains unverified, and such methods often lack solid theoretical support. In this work, we introduce the Topology-Aware Dynamic Reweighting (TAR) framework, which dynamically adjusts sample weights through gradient flow in the geometric Wasserstein space during training. Instead of relying on strict invariance assumptions, we prove that our method is able to provide distributional robustness, thereby enhancing the out-of-distribution generalization performance on graph data. By leveraging the inherent graph structure, TAR effectively addresses distribution shifts. Our framework's superiority is demonstrated through standard testing on four graph OOD datasets and three class-imbalanced node classification datasets, exhibiting marked improvements over existing methods.
