Less is More: Towards Simple Graph Contrastive Learning
Yanan Zhao, Feng Ji, Jingyang Dai, Jiaze Ma, Wee Peng Tay
TL;DR
This work tackles the difficulty of graph contrastive learning on heterophilic graphs by advocating a minimal, augmentation-free approach that uses two complementary views: a GCN-based structural view and an MLP-based feature view. By ensuring weak correlation between the noises of these views, the model achieves strong noise cancellation and outperforms state-of-the-art methods on heterophilic benchmarks while remaining efficient and robust. The authors provide theoretical insights into noise decoupling and validate the method through extensive experiments, including adversarial robustness tests and complexity analyses. Overall, the paper argues that simplicity, via a two-view, augmentation-free design, can outperform more complex GCL pipelines in challenging graph regimes.
Abstract
Graph Contrastive Learning (GCL) has shown strong promise for unsupervised graph representation learning, yet its effectiveness on heterophilic graphs, where connected nodes often belong to different classes, remains limited. Most existing methods rely on complex augmentation schemes, intricate encoders, or negative sampling, which raises the question of whether such complexity is truly necessary in this challenging setting. In this work, we revisit the foundations of supervised and unsupervised learning on graphs and uncover a simple yet effective principle for GCL: mitigating node feature noise by aggregating it with structural features derived from the graph topology. This observation suggests that the original node features and the graph structure naturally provide two complementary views for contrastive learning. Building on this insight, we propose an embarrassingly simple GCL model that uses a GCN encoder to capture structural features and an MLP encoder to isolate node feature noise. Our design requires neither data augmentation nor negative sampling, yet achieves state-of-the-art results on heterophilic benchmarks with minimal computational and memory overhead, while also offering advantages in homophilic graphs in terms of complexity, scalability, and robustness. We provide theoretical justification for our approach and validate its effectiveness through extensive experiments, including robustness evaluations against both black-box and white-box adversarial attacks.
