Uncertainty Estimation for Heterophilic Graphs Through the Lens of Information Theory
Dominik Fuchsgruber, Tom Wollschläger, Johannes Bordne, Stephan Günnemann
TL;DR
This work addresses uncertainty estimation on heterophilic graphs, where traditional homophily-driven methods struggle. By casting MPNNs in an information-theoretic framework, it derives a Data Processing Equality analogue that allows information about the target to be redistributed across layers, revealing that deeper representations can carry unique, complementary information on heterophilic graphs. It introduces JLDE, a simple post-hoc KNN-based density estimator over the joint latent space of all layer embeddings, which achieves state-of-the-art epistemic uncertainty on heterophilic datasets while matching homophilic baselines without diffusion. The results demonstrate that exploiting information from all latent representations is crucial for reliable uncertainty estimation in non-i.i.d. graph settings and programmatically validates a principled design guideline for uncertainty in GNNs beyond homophily.
Abstract
While uncertainty estimation for graphs recently gained traction, most methods rely on homophily and deteriorate in heterophilic settings. We address this by analyzing message passing neural networks from an information-theoretic perspective and developing a suitable analog to data processing inequality to quantify information throughout the model's layers. In contrast to non-graph domains, information about the node-level prediction target can increase with model depth if a node's features are semantically different from its neighbors. Therefore, on heterophilic graphs, the latent embeddings of an MPNN each provide different information about the data distribution - different from homophilic settings. This reveals that considering all node representations simultaneously is a key design principle for epistemic uncertainty estimation on graphs beyond homophily. We empirically confirm this with a simple post-hoc density estimator on the joint node embedding space that provides state-of-the-art uncertainty on heterophilic graphs. At the same time, it matches prior work on homophilic graphs without explicitly exploiting homophily through post-processing.
