Explaining Hypergraph Neural Networks: From Local Explanations to Global Concepts
Shiye Su, Iulia Duta, Lucie Charlotte Magister, Pietro Liò
TL;DR
This work tackles the explainability gap for hypergraph neural networks by introducing SHypX, a model-agnostic post-hoc explainer that yields local explanations as subhypergraphs and global explanations via unsupervised concept extraction. The local explainer optimizes a joint objective that balances faithfulness (via $D_{KL}$ between full and subhypergraph predictions) and concision (via $|G_{sub}|_1$) using differentiable Gumbel-Softmax sampling over potential node-hyperedge links. SHypX achieves superior fidelity and conciseness compared to baselines on four real and four synthetic hypergraphs, and enables a tunable faithfulness-concision tradeoff, while the global explainer provides class-level explanations through representative concepts. The introduction of novel synthetic hypergraph benchmarks and generalized fidelity metrics strengthens evaluation and offers practical insights for debugging and deploying hyperGNNs in structure-rich domains.
Abstract
Hypergraph neural networks are a class of powerful models that leverage the message passing paradigm to learn over hypergraphs, a generalization of graphs well-suited to describing relational data with higher-order interactions. However, such models are not naturally interpretable, and their explainability has received very limited attention. We introduce SHypX, the first model-agnostic post-hoc explainer for hypergraph neural networks that provides both local and global explanations. At the instance-level, it performs input attribution by discretely sampling explanation subhypergraphs optimized to be faithful and concise. At the model-level, it produces global explanation subhypergraphs using unsupervised concept extraction. Extensive experiments across four real-world and four novel, synthetic hypergraph datasets demonstrate that our method finds high-quality explanations which can target a user-specified balance between faithfulness and concision, improving over baselines by 25 percent points in fidelity on average.
