IMPACTX: Improving Model Performance by Appropriately predicting CorrecT eXplanations
Andrea Apicella, Salvatore Giugliano, Francesco Isgrò, Roberto Prevete
TL;DR
IMPACTX addresses the challenge of using explanations to improve ML model performance in a fully automated, model-agnostic fashion. It introduces a dual-branch architecture with a top feature extractor and classifier plus a bottom latent explanation predictor and decoder that together optimize both prediction accuracy and attribution reconstruction. The approach yields consistent accuracy gains across three DL models and three image datasets (CIFAR-10/100, STL-10) and provides explanations at inference time without external XAI, with attribution maps validated via MoRF analysis. Experimental results indicate that IMPACTX can produce more reliable explanations than post-hoc methods like SHAP and ABN, suggesting a practical path toward self-explanatory, better-performing AI systems.
Abstract
The eXplainable Artificial Intelligence (XAI) research predominantly concentrates to provide explainations about AI model decisions, especially Deep Learning (DL) models. However, there is a growing interest in using XAI techniques to automatically improve the performance of the AI systems themselves. This paper proposes IMPACTX, a novel approach that leverages XAI as a fully automated attention mechanism, without requiring external knowledge or human feedback. Experimental results show that IMPACTX has improved performance respect to the standalone ML model by integrating an attention mechanism based an XAI method outputs during the model training. Furthermore, IMPACTX directly provides proper feature attribution maps for the model's decisions, without relying on external XAI methods during the inference process. Our proposal is evaluated using three widely recognized DL models (EfficientNet-B2, MobileNet, and LeNet-5) along with three standard image datasets: CIFAR-10, CIFAR-100, and STL-10. The results show that IMPACTX consistently improves the performance of all the inspected DL models across all evaluated datasets, and it directly provides appropriate explanations for its responses.
