TreeX: Generating Global Graphical GNN Explanations via Critical Subtree Extraction
Shengyao Lu, Jiuding Yang, Baochun Li, Di Niu
TL;DR
TreeX addresses the challenge of global, graphical explanations for WL-based GNNs by mining full $L$-hop subtrees incurred during message passing and representing each subtree with its last-layer root embedding. The method builds global graph concepts through clustering of local subtree embeddings and merges duplicates via isomorphism, then learns class-specific global-rule weights to explain both dataset-wide and individual predictions. It enables explaining incorrect predictions and provides faithful, interpretable motifs that align with ground-truth concepts, outperforming existing global explainers and rivaling local explainers in fidelity. This approach offers practical, scalable, and interpretable explanations for graph-structured data, with direct applicability to molecular and network analyses."
Abstract
The growing demand for transparency and interpretability in critical domains has driven increased interests in comprehending the explainability of Message-Passing (MP) Graph Neural Networks (GNNs). Although substantial research efforts have been made to generate explanations for individual graph instances, identifying global explaining concepts for a GNN still poses great challenges, especially when concepts are desired in a graphical form on the dataset level. While most prior works treat GNNs as black boxes, in this paper, we propose to unbox GNNs by analyzing and extracting critical subtrees incurred by the inner workings of message passing, which correspond to critical subgraphs in the datasets. By aggregating subtrees in an embedding space with an efficient algorithm, which does not require complex subgraph matching or search, we can make intuitive graphical explanations for Message-Passing GNNs on local, class and global levels. We empirically show that our proposed approach not only generates clean subgraph concepts on a dataset level in contrast to existing global explaining methods which generate non-graphical rules (e.g., language or embeddings) as explanations, but it is also capable of providing explanations for individual instances with a comparable or even superior performance as compared to leading local-level GNN explainers.
