COMBINEX: A Unified Counterfactual Explainer for Graph Neural Networks via Node Feature and Structural Perturbations
Flavio Giorgi, Fabrizio Silvestri, Gabriele Tolomei
TL;DR
COMBINEX introduces a unified counterfactual explainer for Graph Neural Networks that jointly optimizes node feature and graph-structure perturbations to flip predictions with minimal changes. By using differentiable perturbation matrices and a tunable trade-off parameter α, the method balances topology and feature modifications and supports both discrete and continuous features. Empirical results across node and graph classification tasks demonstrate strong validity, competitive fidelity, and sparsity, outperforming several baselines while offering improved scalability via an edge-weight sparsification technique. The approach is versatile across multiple GNN architectures and datasets, with flexible α-scheduling policies enabling adaptation to dataset characteristics. This work advances explainability for graphs by providing realistic, actionable counterfactuals that maintain proximity to the data distribution and model decision boundaries.
Abstract
Counterfactual explanations have emerged as a powerful tool to unveil the opaque decision-making processes of graph neural networks (GNNs). However, existing techniques primarily focus on edge modifications, often overlooking the crucial role of node feature perturbations in shaping model predictions. To address this limitation, we propose COMBINEX, a novel GNN explainer that generates counterfactual explanations for both node and graph classification tasks. Unlike prior methods, which treat structural and feature-based changes independently, COMBINEX optimally balances modifications to edges and node features by jointly optimizing these perturbations. This unified approach ensures minimal yet effective changes required to flip a model's prediction, resulting in realistic and interpretable counterfactuals. Additionally, COMBINEX seamlessly handles both continuous and discrete node features, enhancing its versatility across diverse datasets and GNN architectures. Extensive experiments on real-world datasets and various GNN architectures demonstrate the effectiveness and robustness of our approach over existing baselines.
