A Survey on Learning from Graphs with Heterophily: Recent Advances and Future Directions
Chenghua Gong, Yao Cheng, Jianxiang Yu, Can Xu, Caihua Shan, Siqiang Luo, Xiang Li
TL;DR
This survey addresses learning from graphs with heterophily by delivering a comprehensive, taxonomy-driven review that spans metrics, benchmarks, GNN models, learning paradigms, and broad applications. It highlights spectral and high-order filtering, global and discriminative message passing, graph transformers, neural diffusion, self-supervised and prompt-based learning as core advances for heterophilic graphs. The work underscores the critical role of robust benchmarks, reassessment studies, and scalability concerns while identifying future directions, including heterogeneous/dynamic/hypergraphs, deeper theory, and Graph Foundation Models. Its synthesis of methods and practical guidance aims to accelerate progress and inform practitioners across AI-powered domains that rely on complex, heterogeneous graph data.
Abstract
Graphs are structured data that models complex relations between real-world entities. Heterophilic graphs, where linked nodes are prone to be with different labels or dissimilar features, have recently attracted significant attention and found many real-world applications. Meanwhile, increasing efforts have been made to advance learning from graphs with heterophily. Various graph heterophily measures, benchmark datasets, and learning paradigms are emerging rapidly. In this survey, we comprehensively review existing works on learning from graphs with heterophily. First, we overview over 500 publications, of which more than 340 are directly related to heterophilic graphs. After that, we survey existing metrics of graph heterophily and list recent benchmark datasets. Further, we systematically categorize existing methods based on a hierarchical taxonomy including GNN models, learning paradigms and practical applications. In addition, broader topics related to graph heterophily are also included. Finally, we discuss the primary challenges of existing studies and highlight promising avenues for future research.
