Affinity Uncertainty-based Hard Negative Mining in Graph Contrastive Learning
Chaoxi Niu, Guansong Pang, Ling Chen
TL;DR
This work addresses the difficulty of identifying true hard negatives in graph contrastive learning due to non-i.i.d. graph structure and oversmoothing. It introduces AUGCL, which learns an anchor-dependent hardness score for negatives by modeling affinity uncertainty across two negative groups, and integrates this as a weighting term in contrastive losses, with a theoretical equivalence to a triplet loss with an adaptive margin $m_{ij} = \frac{\tau}{2} \log(\alpha u_{ij})$. The framework combines an anchor-dependent binary partition of negatives with a Deep Gambler-based uncertainty estimator to produce a per-anchor uncertainty matrix $\mathbf{U}$ that informs $w_{ij} = \alpha \phi_i(\widehat{z}_j; \Theta)$. Empirically, AUGCL consistently improves graph- and node-classification performance across ten graph datasets, enhances robustness to graph adversarial attacks, and demonstrates favorable ablations and stable hyperparameter behavior. The method is data-driven, eliminates prior assumptions used by some prior hardness methods, and readily plugs into existing GCL losses, with code available at the provided repository.
Abstract
Hard negative mining has shown effective in enhancing self-supervised contrastive learning (CL) on diverse data types, including graph CL (GCL). The existing hardness-aware CL methods typically treat negative instances that are most similar to the anchor instance as hard negatives, which helps improve the CL performance, especially on image data. However, this approach often fails to identify the hard negatives but leads to many false negatives on graph data. This is mainly due to that the learned graph representations are not sufficiently discriminative due to oversmooth representations and/or non-independent and identically distributed (non-i.i.d.) issues in graph data. To tackle this problem, this article proposes a novel approach that builds a discriminative model on collective affinity information (i.e., two sets of pairwise affinities between the negative instances and the anchor instance) to mine hard negatives in GCL. In particular, the proposed approach evaluates how confident/uncertain the discriminative model is about the affinity of each negative instance to an anchor instance to determine its hardness weight relative to the anchor instance. This uncertainty information is then incorporated into the existing GCL loss functions via a weighting term to enhance their performance. The enhanced GCL is theoretically grounded that the resulting GCL loss is equivalent to a triplet loss with an adaptive margin being exponentially proportional to the learned uncertainty of each negative instance. Extensive experiments on ten graph datasets show that our approach does the following: 1) consistently enhances different state-of-the-art (SOTA) GCL methods in both graph and node classification tasks and 2) significantly improves their robustness against adversarial attacks. Code is available at https://github.com/mala-lab/AUGCL.
