Explaining Graph Neural Networks via Structure-aware Interaction Index
Ngoc Bui, Hieu Trung Nguyen, Viet Anh Nguyen, Rex Ying
TL;DR
The paper addresses explainability for graph neural networks by introducing the Myerson-Taylor interaction index, which internalizes graph structure into node and motif attributions. Building on this, the authors present MAGE, a structure-aware explainer that uses second-order MTI to compute pairwise node interactions and optimizes for multiple explanatory motifs that can be positive or negative. They provide an axiomatic justification showing MTI is unique under five natural axioms and demonstrate through extensive experiments on ten datasets that MAGE consistently outperforms state-of-the-art baselines, achieving significant improvements in explanation accuracy and motif fidelity with favorable query efficiency. The work advances graph explainability by jointly accounting for connectivity and high-order interactions, enabling concise, human-interpretable subgraph explanations with potential impact across molecular, visual, and text domains.
Abstract
The Shapley value is a prominent tool for interpreting black-box machine learning models thanks to its strong theoretical foundation. However, for models with structured inputs, such as graph neural networks, existing Shapley-based explainability approaches either focus solely on node-wise importance or neglect the graph structure when perturbing the input instance. This paper introduces the Myerson-Taylor interaction index that internalizes the graph structure into attributing the node values and the interaction values among nodes. Unlike the Shapley-based methods, the Myerson-Taylor index decomposes coalitions into components satisfying a pre-chosen connectivity criterion. We prove that the Myerson-Taylor index is the unique one that satisfies a system of five natural axioms accounting for graph structure and high-order interaction among nodes. Leveraging these properties, we propose Myerson-Taylor Structure-Aware Graph Explainer (MAGE), a novel explainer that uses the second-order Myerson-Taylor index to identify the most important motifs influencing the model prediction, both positively and negatively. Extensive experiments on various graph datasets and models demonstrate that our method consistently provides superior subgraph explanations compared to state-of-the-art methods.
