Rethinking Spectral Augmentation for Contrast-based Graph Self-Supervised Learning
Xiangru Jian, Xinjian Zhao, Wei Pang, Chaolong Ying, Yimu Wang, Yaoyao Xu, Tianshu Yu
TL;DR
This work critically evaluates the role of spectral augmentation in contrast-based graph self-supervised learning. By combining theoretical bounds on the InfoNCE loss for shallow GNNs with extensive empirical comparisons across node- and graph-level tasks, it shows that simple edge perturbations such as DropEdge and AddEdge not only match but frequently outperform spectral augmentation methods, while also offering far greater computational efficiency. The analysis reveals that spectral cues are largely inaccessible to shallow encoders and that optimizing augmentation strength can yield more practical gains than complex spectral manipulations. Collectively, the findings suggest a shift toward simple topology perturbations in CG-SSL and highlight the need to probe deeper architectures if spectral information is to be effectively leveraged.
Abstract
The recent surge in contrast-based graph self-supervised learning has prominently featured an intensified exploration of spectral cues. Spectral augmentation, which involves modifying a graph's spectral properties such as eigenvalues or eigenvectors, is widely believed to enhance model performance. However, an intriguing paradox emerges, as methods grounded in seemingly conflicting assumptions regarding the spectral domain demonstrate notable enhancements in learning performance. Through extensive empirical studies, we find that simple edge perturbations - random edge dropping for node-level and random edge adding for graph-level self-supervised learning - consistently yield comparable or superior performance while being significantly more computationally efficient. This suggests that the computational overhead of sophisticated spectral augmentations may not justify their practical benefits. Our theoretical analysis of the InfoNCE loss bounds for shallow GNNs further supports this observation. The proposed insights represent a significant leap forward in the field, potentially refining the understanding and implementation of graph self-supervised learning.
