Graph Data Augmentation with Contrastive Learning on Covariate Distribution Shift
Fanlong Zeng, Wensheng Gan
TL;DR
Graph classification under covariate distribution shift is improved by MPAIACL, which leverages contrastive learning in latent space to disentangle stable from environmental features. Building on AIA, it strengthens both the stable feature generator and the adversarial augmenter with InfoNCE and triplet/Wasserstein objectives, respectively. The approach is theoretically motivated and empirically validated on diverse OOD datasets, showing robust generalization and competitive gains over baselines. This work highlights the practical value of latent-space information for out-of-distribution robustness in graph learning.
Abstract
Covariate distribution shift occurs when certain structural features present in the test set are absent from the training set. It is a common type of out-of-distribution (OOD) problem, frequently encountered in real-world graph data with complex structures. Existing research has revealed that most out-of-the-box graph neural networks (GNNs) fail to account for covariate shifts. Furthermore, we observe that existing methods aimed at addressing covariate shifts often fail to fully leverage the rich information contained within the latent space. Motivated by the potential of the latent space, we introduce a new method called MPAIACL for More Powerful Adversarial Invariant Augmentation using Contrastive Learning. MPAIACL leverages contrastive learning to unlock the full potential of vector representations by harnessing their intrinsic information. Through extensive experiments, MPAIACL demonstrates its robust generalization and effectiveness, as it performs well compared with other baselines across various public OOD datasets. The code is publicly available at https://github.com/flzeng1/MPAIACL.
