TopInG: Topologically Interpretable Graph Learning via Persistent Rationale Filtration
Cheng Xin, Fan Xu, Xin Ding, Jie Gao, Jiaxin Ding
TL;DR
TopInG introduces TopInG, a topological framework for intrinsically interpretable GNNs that identifies persistent rationale subgraphs through a learnable edge filtration guided by persistent homology. By enforcing a topological discrepancy between rationale and non-rationale components via a tractable lower bound and a two-Gaussian prior, it provides theoretical guarantees that the ground-truth rationale uniquely optimizes the loss under certain conditions. The approach yields stable explanations across variform rationales and improves interpretability while maintaining competitive predictive accuracy, demonstrated on eight benchmark datasets with diverse motif structures. This topological perspective enables more trustworthy AI in scientific domains by capturing stable, scale-spanning topological features that underlie predictions. The framework highlights a path toward robust, interpretable graph learning that can resist spurious correlations and variations in subgraph structure.
Abstract
Graph Neural Networks (GNNs) have shown remarkable success across various scientific fields, yet their adoption in critical decision-making is often hindered by a lack of interpretability. Recently, intrinsically interpretable GNNs have been studied to provide insights into model predictions by identifying rationale substructures in graphs. However, existing methods face challenges when the underlying rationale subgraphs are complex and varied. In this work, we propose TopInG: Topologically Interpretable Graph Learning, a novel topological framework that leverages persistent homology to identify persistent rationale subgraphs. TopInG employs a rationale filtration learning approach to model an autoregressive generation process of rationale subgraphs, and introduces a self-adjusted topological constraint, termed topological discrepancy, to enforce a persistent topological distinction between rationale subgraphs and irrelevant counterparts. We provide theoretical guarantees that our loss function is uniquely optimized by the ground truth under specific conditions. Extensive experiments demonstrate TopInG's effectiveness in tackling key challenges, such as handling variform rationale subgraphs, balancing predictive performance with interpretability, and mitigating spurious correlations. Results show that our approach improves upon state-of-the-art methods on both predictive accuracy and interpretation quality.
