Uncovering Capabilities of Model Pruning in Graph Contrastive Learning
Junran Wu, Xueyuan Chen, Shangzhe Li
TL;DR
This work tackles the reliance of graph contrastive learning on data augmentations that may distort semantics by reframing pre-training around model pruning instead of view generation. The proposed framework, LAMP, uses a dense original graph encoder and a pruned perturbation encoder to produce contrasting embeddings, complemented by a local node-level loss to address hard negatives. The authors provide theoretical results showing pruning can preserve or even improve mutual information with downstream labels compared to augmentation-based views, and they demonstrate strong empirical gains in unsupervised and transfer learning across diverse benchmarks. Overall, LAMP offers a general, domain-agnostic approach to graph representation learning that reduces reliance on potentially destructive augmentations while delivering state-of-the-art performance.
Abstract
Graph contrastive learning has achieved great success in pre-training graph neural networks without ground-truth labels. Leading graph contrastive learning follows the classical scheme of contrastive learning, forcing model to identify the essential information from augmented views. However, general augmented views are produced via random corruption or learning, which inevitably leads to semantics alteration. Although domain knowledge guided augmentations alleviate this issue, the generated views are domain specific and undermine the generalization. In this work, motivated by the firm representation ability of sparse model from pruning, we reformulate the problem of graph contrastive learning via contrasting different model versions rather than augmented views. We first theoretically reveal the superiority of model pruning in contrast to data augmentations. In practice, we take original graph as input and dynamically generate a perturbed graph encoder to contrast with the original encoder by pruning its transformation weights. Furthermore, considering the integrity of node embedding in our method, we are capable of developing a local contrastive loss to tackle the hard negative samples that disturb the model training. We extensively validate our method on various benchmarks regarding graph classification via unsupervised and transfer learning. Compared to the state-of-the-art (SOTA) works, better performance can always be obtained by the proposed method.
