PAGE: Parametric Generative Explainer for Graph Neural Network
Yang Qiu, Wei Liu, Jun Wang, Ruixuan Li
TL;DR
The paper addresses the challenge of explaining graph neural networks with faithful, scalable explanations. It introduces PAGE, a parametric generative explainer built from an autoencoder and a discriminator that isolates latent causal features $Z_c$ to generate a graph-substructure explanation without perturbations, via a two-stage training regime that maximizes mutual information with predictions. PAGE demonstrates superior fidelity, accuracy, and efficiency on both synthetic and real-world datasets compared with GNNExplainer, PGExplainer, Gem, and OrphicX, while avoiding extensive perturbation or sampling. The approach is model-agnostic, post-hoc, and scalable, offering a practical path toward reliable interpretability in diverse GNN applications, though it currently relies on autoencoders as the interpreter for explanations.
Abstract
This article introduces PAGE, a parameterized generative interpretive framework. PAGE is capable of providing faithful explanations for any graph neural network without necessitating prior knowledge or internal details. Specifically, we train the auto-encoder to generate explanatory substructures by designing appropriate training strategy. Due to the dimensionality reduction of features in the latent space of the auto-encoder, it becomes easier to extract causal features leading to the model's output, which can be easily employed to generate explanations. To accomplish this, we introduce an additional discriminator to capture the causality between latent causal features and the model's output. By designing appropriate optimization objectives, the well-trained discriminator can be employed to constrain the encoder in generating enhanced causal features. Finally, these features are mapped to substructures of the input graph through the decoder to serve as explanations. Compared to existing methods, PAGE operates at the sample scale rather than nodes or edges, eliminating the need for perturbation or encoding processes as seen in previous methods. Experimental results on both artificially synthesized and real-world datasets demonstrate that our approach not only exhibits the highest faithfulness and accuracy but also significantly outperforms baseline models in terms of efficiency.
