Table of Contents
Fetching ...

SliceGX: Layer-wise GNN Explanation with Model-slicing

Tingting Zhu, Tingyang Chen, Yinghui Wu, Arijit Khan, Xiangyu Ke

TL;DR

The paper tackles layer-wise interpretability in graph neural networks to enable diagnosis and optimization of intermediate representations. It introduces SliceGX, which uses model slicing to construct l-sliced models {\mathcal{M}}^{l} and derives explanations {G^l_s} at target layers, guided by a bi-criteria score f(G^l_s) = γ I(V^l_s) + (1−γ) D(V^l_s) that mixes relative influence I and embedding diversity D. It proves that the explanation generation problem is NP-hard but shows that f is monotone submodular, enabling a 1/2-approximation algorithm SliceSS and scalable variants SliceMS and SliceMM. Empirical results on six benchmarks demonstrate high fidelity and efficiency, with a progressive-diagnosis case study illustrating practical usefulness for debugging and optimization in real-world GNN deployments, all while maintaining guard conditions that ensure faithful explanations across sliced and full models.

Abstract

Ensuring the trustworthiness of graph neural networks (GNNs), which are often treated as black-box models, requires effective explanation techniques. Existing GNN explanations typically apply input perturbations to identify subgraphs that are responsible for the occurrence of the final output of GNNs. However, such approaches lack finer-grained, layer-wise analysis of how intermediate representations contribute to the final result, capabilities that are crucial for model diagnosis and architecture optimization. This paper introduces SliceGX, a novel GNN explanation approach that generates explanations at specific GNN layers in a progressive manner. Given a GNN model M, a set of selected intermediate layers, and a target layer, SliceGX slices M into layer blocks("model slice") and discovers high-quality explanatory subgraphs within each block that elucidate how the model output arises at the target layer. Although finding such layer-wise explanations is computationally challenging, we develop efficient algorithms and optimization techniques that incrementally construct and maintain these subgraphs with provable approximation guarantees. Extensive experiments on synthetic and real-world benchmarks demonstrate the effectiveness and efficiency of SliceGX, and illustrate its practical utility in supporting model debugging.

SliceGX: Layer-wise GNN Explanation with Model-slicing

TL;DR

The paper tackles layer-wise interpretability in graph neural networks to enable diagnosis and optimization of intermediate representations. It introduces SliceGX, which uses model slicing to construct l-sliced models {\mathcal{M}}^{l} and derives explanations {G^l_s} at target layers, guided by a bi-criteria score f(G^l_s) = γ I(V^l_s) + (1−γ) D(V^l_s) that mixes relative influence I and embedding diversity D. It proves that the explanation generation problem is NP-hard but shows that f is monotone submodular, enabling a 1/2-approximation algorithm SliceSS and scalable variants SliceMS and SliceMM. Empirical results on six benchmarks demonstrate high fidelity and efficiency, with a progressive-diagnosis case study illustrating practical usefulness for debugging and optimization in real-world GNN deployments, all while maintaining guard conditions that ensure faithful explanations across sliced and full models.

Abstract

Ensuring the trustworthiness of graph neural networks (GNNs), which are often treated as black-box models, requires effective explanation techniques. Existing GNN explanations typically apply input perturbations to identify subgraphs that are responsible for the occurrence of the final output of GNNs. However, such approaches lack finer-grained, layer-wise analysis of how intermediate representations contribute to the final result, capabilities that are crucial for model diagnosis and architecture optimization. This paper introduces SliceGX, a novel GNN explanation approach that generates explanations at specific GNN layers in a progressive manner. Given a GNN model M, a set of selected intermediate layers, and a target layer, SliceGX slices M into layer blocks("model slice") and discovers high-quality explanatory subgraphs within each block that elucidate how the model output arises at the target layer. Although finding such layer-wise explanations is computationally challenging, we develop efficient algorithms and optimization techniques that incrementally construct and maintain these subgraphs with provable approximation guarantees. Extensive experiments on synthetic and real-world benchmarks demonstrate the effectiveness and efficiency of SliceGX, and illustrate its practical utility in supporting model debugging.

Paper Structure

This paper contains 17 sections, 4 theorems, 8 equations, 12 figures, 2 tables, 3 algorithms.

Key Result

Lemma 1

Given a configuration ${\mathcal{C}}$ = $(G,{\mathcal{M}}, v_t, \L, l_t, k)$, and a subgraph $G_s$ of $G$, it is in $$PTIME to verify if $G_s$ is an explanation w.r.t. ${\mathcal{M}}^{l_t}(G,v_t)$ at layer $l$.

Figures (12)

  • Figure 1: Generating layer-wise explanations for GNN diagnosing for spam review detection dou2020enhancing.
  • Figure 2: Example of explanations with model-slicing. The upper part depicts the architecture of an $l$-sliced model ${\mathcal{M}}^l$, and the lower part shows factual explanations at each layer of ${\mathcal{M}}$ for diagnosing purposes.
  • Figure 3: Explanation with high influence and high embedding diversity (k=3)
  • Figure 4: A running example of SliceGX (k=4)
  • Figure 5: Impact of factors on quality of explanations
  • ...and 7 more figures

Theorems & Definitions (9)

  • Example 1
  • Example 2
  • Example 3
  • Example 4
  • Lemma 1
  • Theorem 1
  • Theorem 2
  • Example 5
  • Lemma 2