CGCL: Collaborative Graph Contrastive Learning without Handcrafted Graph Data Augmentations
Tianyu Zhang, Yuxiang Ren, Wenzheng Feng, Weitao Du, Xuecang Zhang
TL;DR
This paper tackles the instability of handcrafted augmentations in unsupervised graph contrastive learning by introducing Collaborative Graph Contrastive Learning (CGCL), a framework that uses multiple heterogeneous GNN encoders to generate contrastive views from the encoder perspective. By enforcing an asymmetric architecture and encouraging complementary encoders, CGCL avoids reliance on perturbations and mitigates model collapse, with two metrics—Asymmetry Coefficient ($AC$) and Complementarity Coefficient ($CC$)—to quantify the assembly. Extensive experiments on nine graph benchmarks show CGCL achieving state-of-the-art or competitive graph-level representations without augmentations, with results improving when the encoder assembly exhibits high asymmetry and complementarity. The work also provides a theoretical and empirical basis for evaluating encoder assemblies and contributes a reproducible implementation (code available online).
Abstract
Unsupervised graph representation learning is a non-trivial topic. The success of contrastive methods in the unsupervised representation learning on structured data inspires similar attempts on the graph. Existing graph contrastive learning (GCL) aims to learn the invariance across multiple augmentation views, which renders it heavily reliant on the handcrafted graph augmentations. However, inappropriate graph data augmentations can potentially jeopardize such invariance. In this paper, we show the potential hazards of inappropriate augmentations and then propose a novel Collaborative Graph Contrastive Learning framework (CGCL). This framework harnesses multiple graph encoders to observe the graph. Features observed from different encoders serve as the contrastive views in contrastive learning, which avoids inducing unstable perturbation and guarantees the invariance. To ensure the collaboration among diverse graph encoders, we propose the concepts of asymmetric architecture and complementary encoders as the design principle. To further prove the rationality, we utilize two quantitative metrics to measure the assembly of CGCL respectively. Extensive experiments demonstrate the advantages of CGCL in unsupervised graph-level representation learning and the potential of collaborative framework. The source code for reproducibility is available at https://github.com/zhangtia16/CGCL
