Enhancing Vision Transformer Explainability Using Artificial Astrocytes
Nicolas Echevarrieta-Catalan, Ana Ribas-Rodriguez, Francisco Cedron, Odelia Schwartz, Vanessa Aguiar-Pulido
TL;DR
Vision transformers often generate explanations that diverge from human intuition, hindering trust in automated vision systems. The authors propose ViTA, a training-free, neuroscience-inspired modification that injects artificial astrocytes into the first self-attention block to modulate activations and enhance human-aligned explanations. Across Grad-CAM and Grad-CAM++ heatmaps evaluated against the ClickMe ground truth, ViTA yields statistically significant improvements in alignment and produces more focused object-centered explanations. This biologically inspired modulation offers a general, training-free route to improve XAI for vision models and can extend to other architectures and datasets.
Abstract
Machine learning models achieve high precision, but their decision-making processes often lack explainability. Furthermore, as model complexity increases, explainability typically decreases. Existing efforts to improve explainability primarily involve developing new eXplainable artificial intelligence (XAI) techniques or incorporating explainability constraints during training. While these approaches yield specific improvements, their applicability remains limited. In this work, we propose the Vision Transformer with artificial Astrocytes (ViTA). This training-free approach is inspired by neuroscience and enhances the reasoning of a pretrained deep neural network to generate more human-aligned explanations. We evaluated our approach employing two well-known XAI techniques, Grad-CAM and Grad-CAM++, and compared it to a standard Vision Transformer (ViT). Using the ClickMe dataset, we quantified the similarity between the heatmaps produced by the XAI techniques and a (human-aligned) ground truth. Our results consistently demonstrate that incorporating artificial astrocytes enhances the alignment of model explanations with human perception, leading to statistically significant improvements across all XAI techniques and metrics utilized.
