Bootstrap Latents of Nodes and Neighbors for Graph Self-Supervised Learning
Yunhui Liu, Huaisong Zhang, Tieke He, Tao Zheng, Jianhua Zhao
TL;DR
This work extends negative-free graph self-supervised learning by exploiting graph homophily through neighbor-based positive pairs. It introduces BLNN, which adds node–neighbor positives to the BGRL framework and employs a cross-attention module to softly weight neighbor contributions, reducing class collision from false positives. Empirical results across five datasets and three downstream tasks show improved intra-class compactness and state-of-the-art performance, with robust hyperparameter behavior and scalable complexity. The approach offers a practical, scalable way to leverage local label-consistent structure in graphs for improved representation learning.
Abstract
Contrastive learning is a significant paradigm in graph self-supervised learning. However, it requires negative samples to prevent model collapse and learn discriminative representations. These negative samples inevitably lead to heavy computation, memory overhead and class collision, compromising the representation learning. Recent studies present that methods obviating negative samples can attain competitive performance and scalability enhancements, exemplified by bootstrapped graph latents (BGRL). However, BGRL neglects the inherent graph homophily, which provides valuable insights into underlying positive pairs. Our motivation arises from the observation that subtly introducing a few ground-truth positive pairs significantly improves BGRL. Although we can't obtain ground-truth positive pairs without labels under the self-supervised setting, edges in the graph can reflect noisy positive pairs, i.e., neighboring nodes often share the same label. Therefore, we propose to expand the positive pair set with node-neighbor pairs. Subsequently, we introduce a cross-attention module to predict the supportiveness score of a neighbor with respect to the anchor node. This score quantifies the positive support from each neighboring node, and is encoded into the training objective. Consequently, our method mitigates class collision from negative and noisy positive samples, concurrently enhancing intra-class compactness. Extensive experiments are conducted on five benchmark datasets and three downstream task node classification, node clustering, and node similarity search. The results demonstrate that our method generates node representations with enhanced intra-class compactness and achieves state-of-the-art performance.
