Why do explanations fail? A typology and discussion on failures in XAI
Clara Bove, Thibault Laugel, Marie-Jeanne Lesot, Charles Tijus, Marcin Detyniecki
TL;DR
This paper introduces a holistic typology of XAI failures by separating system-specific (technical) failures from user-specific (interpretation) failures, addressing observed fragmentation in prior work. It systematically builds a taxonomy from literature (n=108 papers) using a meta-characteristic framework and ends conditions, capturing how failures arise and propagate through model, explainer, and user interactions. The typology distinguishes misleading, competing, unstable, and incompatible system-level failures, as well as mismatches, counterintuitive explanations, and biased inferences at the user level, and discusses mitigation strategies across design, evaluation, and interfaces. The authors advocate for holistic, user-centered, and transparent XAI design, stressing the need for integrative approaches that consider interactions among components and richer XUI designs to improve explainability and trust in AI systems. Overall, the work provides a structured lens to diagnose XAI limitations and map research directions toward more robust, personalized, and comprehensible explanations with practical implications for practitioners and researchers.
Abstract
As Machine Learning models achieve unprecedented levels of performance, the XAI domain aims at making these models understandable by presenting end-users with intelligible explanations. Yet, some existing XAI approaches fail to meet expectations: several issues have been reported in the literature, generally pointing out either technical limitations or misinterpretations by users. In this paper, we argue that the resulting harms arise from a complex overlap of multiple failures in XAI, which existing ad-hoc studies fail to capture. This work therefore advocates for a holistic perspective, presenting a systematic investigation of limitations of current XAI methods and their impact on the interpretation of explanations. % By distinguishing between system-specific and user-specific failures, we propose a typological framework that helps revealing the nuanced complexities of explanation failures. Leveraging this typology, we discuss some research directions to help practitioners better understand the limitations of XAI systems and enhance the quality of ML explanations.
