Simple Graph Contrastive Learning via Fractional-order Neural Diffusion Networks
Yanan Zhao, Feng Ji, Kai Zhao, Xuhao Li, Qiyu Kang, Wenfei Liang, Yahya Alkhatib, Xingchao Jian, Wee Peng Tay
TL;DR
This work tackles unsupervised node representation learning on graphs without data augmentations or negative samples by introducing FD GCL, a simple yet effective augmentation free contrastive framework based on fractional order neural diffusion. Two encoders governed by different order parameters generate distinct views that capture local and global graph information, and a regularized cosine mean loss ensures view diversity while mitigating collapse. The approach is underpinned by graph signal processing and fractional differential equation theory, with extensive experiments showing state of the art performance across both homophilic and heterophilic graphs and strong robustness to loss function choice. The results highlight the practical impact of memory aware diffusion dynamics for flexible and scalable graph representation learning.
Abstract
Graph Contrastive Learning (GCL) has recently made progress as an unsupervised graph representation learning paradigm. GCL approaches can be categorized into augmentation-based and augmentation-free methods. The former relies on complex data augmentations, while the latter depends on encoders that can generate distinct views of the same input. Both approaches may require negative samples for training. In this paper, we introduce a novel augmentation-free GCL framework based on graph neural diffusion models. Specifically, we utilize learnable encoders governed by Fractional Differential Equations (FDE). Each FDE is characterized by an order parameter of the differential operator. We demonstrate that varying these parameters allows us to produce learnable encoders that generate diverse views, capturing either local or global information, for contrastive learning. Our model does not require negative samples for training and is applicable to both homophilic and heterophilic datasets. We demonstrate its effectiveness across various datasets, achieving state-of-the-art performance.
