Factorized Explainer for Graph Neural Networks
Rundong Huang, Farhad Shirani, Dongsheng Luo
TL;DR
This work identifies a fundamental flaw in GIB-based post-hoc explanations for graph neural networks: the original objective can yield trivial, input-insensitive explanations. It introduces a modified GIB objective that compares the model's predictions on the subgraph to the ground-truth label via cross-entropy, tying explanations to the actual model behavior. To overcome locality and aggregation losses in existing parametric explainers, the authors propose K-FactExplainer, a factorized framework that combines multiple local explainers with a global selector, and they provide a bootstrapping method to bound the number of components. Through extensive experiments on synthetic and real datasets, the method shows consistent improvements in explanation faithfulness (AUC) over baselines and demonstrates robustness to multi-motif scenarios and lossy aggregation, offering practical impact for trustworthy GNN deployments.
Abstract
Graph Neural Networks (GNNs) have received increasing attention due to their ability to learn from graph-structured data. To open the black-box of these deep learning models, post-hoc instance-level explanation methods have been proposed to understand GNN predictions. These methods seek to discover substructures that explain the prediction behavior of a trained GNN. In this paper, we show analytically that for a large class of explanation tasks, conventional approaches, which are based on the principle of graph information bottleneck (GIB), admit trivial solutions that do not align with the notion of explainability. Instead, we argue that a modified GIB principle may be used to avoid the aforementioned trivial solutions. We further introduce a novel factorized explanation model with theoretical performance guarantees. The modified GIB is used to analyze the structural properties of the proposed factorized explainer. We conduct extensive experiments on both synthetic and real-world datasets to validate the effectiveness of our proposed factorized explainer.
