Structure-enhanced Contrastive Learning for Graph Clustering
Xunlian Wu, Jingqi Hu, Anqi Zhang, Yining Quan, Qiguang Miao, Peng Gang Sun
TL;DR
This work addresses graph clustering without heavy reliance on data augmentation and without ignoring mesoscopic community structure. It introduces Structure-enhanced Contrastive Learning (SECL), combining cross-view contrastive learning over structure and attribute views with a structural alignment loss and a modularity-maximization objective to preserve community structure. Through two MLP encoders and a learnable mapping to clusters, SECL achieves state-of-the-art results across six diverse datasets, outperforming both traditional deep clustering methods and recent contrastive approaches. The approach reduces the need for pre-training and cumbersome augmentations while delivering robust, cluster-aware representations with practical implications for real-world network analysis.
Abstract
Graph clustering is a crucial task in network analysis with widespread applications, focusing on partitioning nodes into distinct groups with stronger intra-group connections than inter-group ones. Recently, contrastive learning has achieved significant progress in graph clustering. However, most methods suffer from the following issues: 1) an over-reliance on meticulously designed data augmentation strategies, which can undermine the potential of contrastive learning. 2) overlooking cluster-oriented structural information, particularly the higher-order cluster(community) structure information, which could unveil the mesoscopic cluster structure information of the network. In this study, Structure-enhanced Contrastive Learning (SECL) is introduced to addresses these issues by leveraging inherent network structures. SECL utilizes a cross-view contrastive learning mechanism to enhance node embeddings without elaborate data augmentations, a structural contrastive learning module for ensuring structural consistency, and a modularity maximization strategy for harnessing clustering-oriented information. This comprehensive approach results in robust node representations that greatly enhance clustering performance. Extensive experiments on six datasets confirm SECL's superiority over current state-of-the-art methods, indicating a substantial improvement in the domain of graph clustering.
