Learn Beneficial Noise as Graph Augmentation
Siqi Huang, Yanchen Xu, Hongyuan Zhang, Xuelong Li
TL;DR
This work tackles the instability of graph augmentations in graph contrastive learning by introducing PiNGDA, a framework that learns beneficial perturbations through the information-theoretic notion of $\pi$-noise. A Gaussian auxiliary variable ties the GCL loss to information entropy, revealing that standard predefined augmentations approximate a point estimate of $\pi$-noise and motivating trainable generators for both topology and node attributes. The method derives a differentiable loss $\mathcal{L}_{\pi}$ and demonstrates improved performance and stability across node, graph, and heterogeneous graph tasks, with favorable efficiency and interpretability of the learned augmentations. Overall, PiNGDA provides a principled, adaptable augmentation strategy that improves robustness and generalization in graph representation learning with broad practical impact for downstream graph tasks.
Abstract
Although graph contrastive learning (GCL) has been widely investigated, it is still a challenge to generate effective and stable graph augmentations. Existing methods often apply heuristic augmentation like random edge dropping, which may disrupt important graph structures and result in unstable GCL performance. In this paper, we propose Positive-incentive Noise driven Graph Data Augmentation (PiNGDA), where positive-incentive noise (pi-noise) scientifically analyzes the beneficial effect of noise under the information theory. To bridge the standard GCL and pi-noise framework, we design a Gaussian auxiliary variable to convert the loss function to information entropy. We prove that the standard GCL with pre-defined augmentations is equivalent to estimate the beneficial noise via the point estimation. Following our analysis, PiNGDA is derived from learning the beneficial noise on both topology and attributes through a trainable noise generator for graph augmentations, instead of the simple estimation. Since the generator learns how to produce beneficial perturbations on graph topology and node attributes, PiNGDA is more reliable compared with the existing methods. Extensive experimental results validate the effectiveness and stability of PiNGDA.
