GCL-OT: Graph Contrastive Learning with Optimal Transport for Heterophilic Text-Attributed Graphs
Yating Ren, Yikun Ban, Huobin Tan
TL;DR
GCL-OT tackles multi-granular heterophily in text-attributed graphs by integrating optimal transport into graph contrastive learning to align structure and text representations bidirectionally. It introduces RealSoftMax-based similarity for partial heterophily, a prompt-based filter to mitigate complete heterophily, and OT-guided latent homophily mining to uncover hidden semantically related neighbors. Theoretical analysis shows tighter mutual information bounds and improved Bayes error guarantees, while extensive experiments on nine TAG benchmarks demonstrate robust, state-of-the-art performance across supervised and unsupervised settings. The framework remains effective with different PLMs and exhibits strong resilience to perturbations, highlighting practical applicability for real-world TAG tasks.
Abstract
Recently, structure-text contrastive learning has shown promising performance on text-attributed graphs by leveraging the complementary strengths of graph neural networks and language models. However, existing methods typically rely on homophily assumptions in similarity estimation and hard optimization objectives, which limit their applicability to heterophilic graphs. Although existing methods can mitigate heterophily through structural adjustments or neighbor aggregation, they usually treat textual embeddings as static targets, leading to suboptimal alignment. In this work, we identify the multi-granular heterophily in text-attributed graphs, including complete heterophily, partial heterophily, and latent homophily, which makes structure-text alignment particularly challenging due to mixed, noisy, and missing semantic correlations. To achieve flexible and bidirectional alignment, we propose GCL-OT, a novel graph contrastive learning framework with optimal transport, equipped with tailored mechanisms for each type of heterophily. Specifically, for partial heterophily, we design a RealSoftMax-based similarity estimator to emphasize key neighbor-word interactions while easing background noise. For complete heterophily, we introduce a prompt-based filter that adaptively excludes irrelevant noise during optimal transport alignment. Furthermore, we incorporate OT-guided soft supervision to uncover potential neighbors with similar semantics, enhancing the learning of latent homophily. Theoretical analysis shows that GCL-OT can improve the mutual information bound and Bayes error guarantees. Extensive experiments on nine benchmarks show that GCL-OT consistently outperforms state-of-the-art methods, verifying its effectiveness and robustness.
