Edge-Splitting MLP: Node Classification on Homophilic and Heterophilic Graphs without Message Passing
Matthias Kohn, Marcel Hoffmann, Ansgar Scherp
TL;DR
ES-MLP addresses the challenge of node classification on graphs with varying homophily by fusing Graph-MLP’s neighborhood-aware training with ES-GNN’s edge-splitting into a two-channel, no-inference-edge architecture. It learns task-relevant and task-irrelevant embeddings, applying a neighborhood contrastive loss and an irrelevance regularization to achieve robust performance without relying on edges at inference time. Across seven real-world datasets and a CSBM synthetic dataset, ES-MLP matches or surpasses baselines on both homophilic and heterophilic graphs, while offering significant inference-time speedups and resilience to edge-noise. The work demonstrates practical impact for scalable, robust graph learning and suggests future directions toward incorporating graph directionality and further optimizing edge-splitting dynamics.
Abstract
Message Passing Neural Networks (MPNNs) have demonstrated remarkable success in node classification on homophilic graphs. It has been shown that they do not solely rely on homophily but on neighborhood distributions of nodes, i.e., consistency of the neighborhood label distribution within the same class. MLP-based models do not use message passing, \eg Graph-MLP incorporates the neighborhood in a separate loss function. These models are faster and more robust to edge noise. Graph-MLP maps adjacent nodes closer in the embedding space but is unaware of the neighborhood pattern of the labels, i.e., relies solely on homophily. Edge Splitting GNN (ES-GNN) is a model specialized for heterophilic graphs and splits the edges into task-relevant and task-irrelevant, respectively. To mitigate the limitations of Graph-MLP on heterophilic graphs, we propose ES-MLP that combines Graph-MLP with an edge-splitting mechanism from ES-GNN. It incorporates the edge splitting into the loss of Graph-MLP to learn two separate adjacency matrices based on relevant and irrelevant feature pairs. Our experiments on seven datasets with six baselines show that ES-MLP is on par with homophilic and heterophilic models on all datasets without using edges during inference. We show that ES-MLP is robust to multiple types of edge noise during inference and that its inference time is two to five times faster than that of commonly used MPNNs. The source code is available at https://github.com/MatthiasKohn/ES-MLP.
