CHGNN: A Semi-Supervised Contrastive Hypergraph Learning Network
Yumeng Song, Yu Gu, Tianyi Li, Jianzhong Qi, Zhenghao Liu, Christian S. Jensen, Ge Yu
TL;DR
CHGNN tackles semi-supervised node classification on hypergraphs by uniting contrastive self-supervision with label information. It introduces an adaptive hypergraph view generator to create diverse, informative hypergraph views and a hyperedge homogeneity-aware HyperGNN to preserve higher-order dependencies via $homo(e)$. The training objective combines a similarity loss for the views, a supervised classification loss, a hyperedge homogeneity loss, and both basic and cross-validation contrastive losses, with an enhanced strategy that adaptively distances negative samples. Experiments on nine real-world hypergraph datasets show CHGNN consistently outperforms 13–19 competitive methods, demonstrating robustness under low-label scenarios and complex higher-order relationships. This approach highlights the value of leveraging hyperedge semantics and cross-type contrasts for scalable, accurate hypergraph learning.
Abstract
Hypergraphs can model higher-order relationships among data objects that are found in applications such as social networks and bioinformatics. However, recent studies on hypergraph learning that extend graph convolutional networks to hypergraphs cannot learn effectively from features of unlabeled data. To such learning, we propose a contrastive hypergraph neural network, CHGNN, that exploits self-supervised contrastive learning techniques to learn from labeled and unlabeled data. First, CHGNN includes an adaptive hypergraph view generator that adopts an auto-augmentation strategy and learns a perturbed probability distribution of minimal sufficient views. Second, CHGNN encompasses an improved hypergraph encoder that considers hyperedge homogeneity to fuse information effectively. Third, CHGNN is equipped with a joint loss function that combines a similarity loss for the view generator, a node classification loss, and a hyperedge homogeneity loss to inject supervision signals. It also includes basic and cross-validation contrastive losses, associated with an enhanced contrastive loss training process. Experimental results on nine real datasets offer insight into the effectiveness of CHGNN, showing that it outperforms 13 competitors in terms of classification accuracy consistently.
