Is Homophily a Necessity for Graph Neural Networks?
Yao Ma, Xiaorui Liu, Neil Shah, Jiliang Tang
TL;DR
The paper questions the necessity of homophily for graph neural networks by examining the standard GCN under heterophily through a CSBM-based theoretical lens and controlled experiments. It shows that when same-label nodes share similar neighborhood patterns and different classes are distinguishable by neighbor distributions, GCNs can achieve strong semi-supervised node classification even on heterophilous graphs. The study couples formal results with extensive empirical analyses on synthetic graphs and real benchmarks, uncovering a nuanced, degree- and distribution-dependent picture (including a V-shaped performance trend as heterophily is varied). Overall, it reframes the narrative around homophily, illustrating that it is not universally necessary, but certain structural conditions must hold for GCNs to excel.
Abstract
Graph neural networks (GNNs) have shown great prowess in learning representations suitable for numerous graph-based machine learning tasks. When applied to semi-supervised node classification, GNNs are widely believed to work well due to the homophily assumption ("like attracts like"), and fail to generalize to heterophilous graphs where dissimilar nodes connect. Recent works design new architectures to overcome such heterophily-related limitations, citing poor baseline performance and new architecture improvements on a few heterophilous graph benchmark datasets as evidence for this notion. In our experiments, we empirically find that standard graph convolutional networks (GCNs) can actually achieve better performance than such carefully designed methods on some commonly used heterophilous graphs. This motivates us to reconsider whether homophily is truly necessary for good GNN performance. We find that this claim is not quite true, and in fact, GCNs can achieve strong performance on heterophilous graphs under certain conditions. Our work carefully characterizes these conditions, and provides supporting theoretical understanding and empirical observations. Finally, we examine existing heterophilous graphs benchmarks and reconcile how the GCN (under)performs on them based on this understanding.
