Heterophilous Distribution Propagation for Graph Neural Networks
Zhuonan Zheng, Sheng Zhou, Hongjia Xu, Ming Gu, Yilun Xu, Ao Li, Yuhong Li, Jingjun Gu, Jiajun Bu
TL;DR
This work tackles the challenge of heterophily in graph neural networks by introducing Heterophilous Distribution Propagation (HDP), which adaptively partitions neighborhood signals into homophilous and heterophilous components using pseudo-labels and learns heterophilous distributions under an orthogonality constraint via a Trusted Prototype Contrastive (TPC) loss. It combines a semantic-aware neighborhood partition with mean-operator-based heterophilous modeling and a semantic-aware message passing framework to propagate both patterns, achieving strong performance on heterophilous graphs across nine datasets. The approach is supported by extensive ablations and analyses showing the contributions of assignment initialization, semantic structural encoding, SMP, and TPC, as well as insights into the discriminability of the learned representations. Overall, HDP advances robust, discriminative graph representation learning in heterophilous settings with practical implications for real-world networks.
Abstract
Graph Neural Networks (GNNs) have achieved remarkable success in various graph mining tasks by aggregating information from neighborhoods for representation learning. The success relies on the homophily assumption that nearby nodes exhibit similar behaviors, while it may be violated in many real-world graphs. Recently, heterophilous graph neural networks (HeterGNNs) have attracted increasing attention by modifying the neural message passing schema for heterophilous neighborhoods. However, they suffer from insufficient neighborhood partition and heterophily modeling, both of which are critical but challenging to break through. To tackle these challenges, in this paper, we propose heterophilous distribution propagation (HDP) for graph neural networks. Instead of aggregating information from all neighborhoods, HDP adaptively separates the neighbors into homophilous and heterphilous parts based on the pseudo assignments during training. The heterophilous neighborhood distribution is learned with orthogonality-oriented constraint via a trusted prototype contrastive learning paradigm. Both the homophilous and heterophilous patterns are propagated with a novel semantic-aware message passing mechanism. We conduct extensive experiments on 9 benchmark datasets with different levels of homophily. Experimental results show that our method outperforms representative baselines on heterophilous datasets.
