SliceGX: Layer-wise GNN Explanation with Model-slicing
Tingting Zhu, Tingyang Chen, Yinghui Wu, Arijit Khan, Xiangyu Ke
TL;DR
The paper tackles layer-wise interpretability in graph neural networks to enable diagnosis and optimization of intermediate representations. It introduces SliceGX, which uses model slicing to construct l-sliced models {\mathcal{M}}^{l} and derives explanations {G^l_s} at target layers, guided by a bi-criteria score f(G^l_s) = γ I(V^l_s) + (1−γ) D(V^l_s) that mixes relative influence I and embedding diversity D. It proves that the explanation generation problem is NP-hard but shows that f is monotone submodular, enabling a 1/2-approximation algorithm SliceSS and scalable variants SliceMS and SliceMM. Empirical results on six benchmarks demonstrate high fidelity and efficiency, with a progressive-diagnosis case study illustrating practical usefulness for debugging and optimization in real-world GNN deployments, all while maintaining guard conditions that ensure faithful explanations across sliced and full models.
Abstract
Ensuring the trustworthiness of graph neural networks (GNNs), which are often treated as black-box models, requires effective explanation techniques. Existing GNN explanations typically apply input perturbations to identify subgraphs that are responsible for the occurrence of the final output of GNNs. However, such approaches lack finer-grained, layer-wise analysis of how intermediate representations contribute to the final result, capabilities that are crucial for model diagnosis and architecture optimization. This paper introduces SliceGX, a novel GNN explanation approach that generates explanations at specific GNN layers in a progressive manner. Given a GNN model M, a set of selected intermediate layers, and a target layer, SliceGX slices M into layer blocks("model slice") and discovers high-quality explanatory subgraphs within each block that elucidate how the model output arises at the target layer. Although finding such layer-wise explanations is computationally challenging, we develop efficient algorithms and optimization techniques that incrementally construct and maintain these subgraphs with provable approximation guarantees. Extensive experiments on synthetic and real-world benchmarks demonstrate the effectiveness and efficiency of SliceGX, and illustrate its practical utility in supporting model debugging.
