Why do explanations fail? A typology and discussion on failures in XAI

Clara Bove; Thibault Laugel; Marie-Jeanne Lesot; Charles Tijus; Marcin Detyniecki

Why do explanations fail? A typology and discussion on failures in XAI

Clara Bove, Thibault Laugel, Marie-Jeanne Lesot, Charles Tijus, Marcin Detyniecki

TL;DR

This paper introduces a holistic typology of XAI failures by separating system-specific (technical) failures from user-specific (interpretation) failures, addressing observed fragmentation in prior work. It systematically builds a taxonomy from literature (n=108 papers) using a meta-characteristic framework and ends conditions, capturing how failures arise and propagate through model, explainer, and user interactions. The typology distinguishes misleading, competing, unstable, and incompatible system-level failures, as well as mismatches, counterintuitive explanations, and biased inferences at the user level, and discusses mitigation strategies across design, evaluation, and interfaces. The authors advocate for holistic, user-centered, and transparent XAI design, stressing the need for integrative approaches that consider interactions among components and richer XUI designs to improve explainability and trust in AI systems. Overall, the work provides a structured lens to diagnose XAI limitations and map research directions toward more robust, personalized, and comprehensible explanations with practical implications for practitioners and researchers.

Abstract

As Machine Learning models achieve unprecedented levels of performance, the XAI domain aims at making these models understandable by presenting end-users with intelligible explanations. Yet, some existing XAI approaches fail to meet expectations: several issues have been reported in the literature, generally pointing out either technical limitations or misinterpretations by users. In this paper, we argue that the resulting harms arise from a complex overlap of multiple failures in XAI, which existing ad-hoc studies fail to capture. This work therefore advocates for a holistic perspective, presenting a systematic investigation of limitations of current XAI methods and their impact on the interpretation of explanations. % By distinguishing between system-specific and user-specific failures, we propose a typological framework that helps revealing the nuanced complexities of explanation failures. Leveraging this typology, we discuss some research directions to help practitioners better understand the limitations of XAI systems and enhance the quality of ML explanations.

Why do explanations fail? A typology and discussion on failures in XAI

TL;DR

Abstract

Paper Structure (59 sections, 2 figures, 1 table)

This paper contains 59 sections, 2 figures, 1 table.

Introduction
Context and Related Works
Explanation Process Components
Notion of Explanation Failure
Related Works
Research Questions and Methodology
Research Questions
RQ1: What are the different failures that may arise during the explanation process?
RQ2: How can these failures be avoided?
Methodology
Meta-characteristic
Ending conditions
Data collection
Overview of the Result
Proposed Typology: System-specific XAI Failures
...and 44 more sections

Figures (2)

Figure 1: Types of system-specific explanation failures: misleading explanations ascribed to the ML system and contradicting ones ascribed to the explainer. The cross and the $\neq$ symbol indicate the step at which the failure takes place.
Figure 2: Types of user-specific explanation failures in ML explanations. The ML model is supposed to give an accurate prediction and the explainer to generate faithful explanations. The cross indicates the step at which the failure takes place.

Why do explanations fail? A typology and discussion on failures in XAI

TL;DR

Abstract

Why do explanations fail? A typology and discussion on failures in XAI

Authors

TL;DR

Abstract

Table of Contents

Figures (2)