Clarify Confused Nodes via Separated Learning
Jiajun Zhou, Shengbo Gong, Xuanze Chen, Chenxuan Xie, Shanqing Yu, Qi Xuan, Xiaoniu Yang
TL;DR
The paper tackles the degradation of traditional GNNs on heterophilous graphs by challenging the weight-sharing assumption. It introduces Neighborhood Confusion (NC) as a scalable node-wise metric capturing neighborhood label diversity, and builds Neighborhood Confusion-guided Graph Convolutional Network (NCGCN), a two-channel, end-to-end framework that separates low-NC and high-NC nodes with dedicated transformations and intra-group message passing. NC is computed from pseudo-labels during training to produce NC masks that steer two parallel learning streams, significantly improving accuracy across diverse benchmarks and enabling a plug-in extension to other backbones. The work provides theoretical connections between NC, conditional entropy, and classification error bounds, demonstrates empirical SOTA results on ten datasets, and offers practical insights into node distinguishability and feature fusion preferences, with scalability to large graphs and inductive settings.
Abstract
Graph neural networks (GNNs) have achieved remarkable advances in graph-oriented tasks. However, real-world graphs invariably contain a certain proportion of heterophilous nodes, challenging the homophily assumption of traditional GNNs and hindering their performance. Most existing studies continue to design generic models with shared weights between heterophilous and homophilous nodes. Despite the incorporation of high-order messages or multi-channel architectures, these efforts often fall short. A minority of studies attempt to train different node groups separately but suffer from inappropriate separation metrics and low efficiency. In this paper, we first propose a new metric, termed Neighborhood Confusion (NC), to facilitate a more reliable separation of nodes. We observe that node groups with different levels of NC values exhibit certain differences in intra-group accuracy and visualized embeddings. These pave the way for Neighborhood Confusion-guided Graph Convolutional Network (NCGCN), in which nodes are grouped by their NC values and accept intra-group weight sharing and message passing. Extensive experiments on both homophilous and heterophilous benchmarks demonstrate that our framework can effectively separate nodes and yield significant performance improvement compared to the latest methods. The source code will be available in https://github.com/GISec-Team/NCGNN.
