Faithful and Accurate Self-Attention Attribution for Message Passing Neural Networks via the Computation Tree Viewpoint
Yong-Min Shin, Siqing Li, Xin Cao, Won-Yong Shin
TL;DR
This work addresses the gap between the widespread use of self-attention in Att-GNNs and the reliability of attention as explanations. It introduces GAtt, a computation-tree–based edge attribution method, to extract faithful edge contributions from attention weights in Att-GNNs. By aligning edge attributions with the feed-forward computation tree and enforcing two principles—proximity to the target node and adjustment by path position—GAtt provides a simple, deterministic, and hyperparameter-free way to quantify edge importance. Empirical results demonstrate that GAtt substantially improves faithfulness and explanation accuracy over naive attention-averaging and compares favorably with several post-hoc explainers across real-world and synthetic datasets, highlighting the potential of attention-based explanations when properly interpreted. The method is model-agnostic within Att-GNNs and offers efficient matrix-based computation, enabling scalable explanations for large graphs, with code available for reproducibility."
Abstract
The self-attention mechanism has been adopted in various popular message passing neural networks (MPNNs), enabling the model to adaptively control the amount of information that flows along the edges of the underlying graph. Such attention-based MPNNs (Att-GNNs) have also been used as a baseline for multiple studies on explainable AI (XAI) since attention has steadily been seen as natural model interpretations, while being a viewpoint that has already been popularized in other domains (e.g., natural language processing and computer vision). However, existing studies often use naive calculations to derive attribution scores from attention, undermining the potential of attention as interpretations for Att-GNNs. In our study, we aim to fill the gap between the widespread usage of Att-GNNs and their potential explainability via attention. To this end, we propose GATT, edge attribution calculation method for self-attention MPNNs based on the computation tree, a rooted tree that reflects the computation process of the underlying model. Despite its simplicity, we empirically demonstrate the effectiveness of GATT in three aspects of model explanation: faithfulness, explanation accuracy, and case studies by using both synthetic and real-world benchmark datasets. In all cases, the results demonstrate that GATT greatly improves edge attribution scores, especially compared to the previous naive approach. Our code is available at https://github.com/jordan7186/GAtt.
