The Explanation Necessity for Healthcare AI
Michail Mamalakis, Héloïse de Vareilles, Graham Murray, Pietro Lio, John Suckling
TL;DR
The paper tackles the lack of practical guidance for explainability in healthcare AI by proposing a four-class taxonomy of explanation necessity—self-explainable, semi-explainable, non-explainable, and new-patterns discovery—parameterized by three factors: the robustness of the evaluation protocol, variability in expert observations, and the representational dimensionality $d$. It provides a formal mathematical formulation with inputs $\mathbf{x}$, outputs $\mathbf{y}$, and post-hoc explanations $\mathbf{z}$, introducing global explanations $g_z=R(\mathbf{z})$ and an alignment mechanism $A_l$ for new-pattern discovery against a ground-truth model $R_S(\mathbf{x'})$. A two-flow framework guides practical decision-making: inter-observer variability assessment and dimensionality evaluation, culminating in an $XAI$ Need Decision and, when needed, a Category Decision. The approach aims to improve trust, safety, and regulatory readiness in clinical AI by linking explanation depth to task risk and data characteristics, with domain-specific examples illustrating how thresholds and classifications shift across medical imaging and diagnostic tasks. Future work is needed to normalize thresholds for other domains and to refine risk-adaptive explanations in high-stakes settings.
Abstract
Explainability is a critical factor in enhancing the trustworthiness and acceptance of artificial intelligence (AI) in healthcare, where decisions directly impact patient outcomes. Despite advancements in AI interpretability, clear guidelines on when and to what extent explanations are required in medical applications remain lacking. We propose a novel categorization system comprising four classes of explanation necessity (self-explainable, semi-explainable, non-explainable, and new-patterns discovery), guiding the required level of explanation; whether local (patient or sample level), global (cohort or dataset level), or both. To support this system, we introduce a mathematical formulation that incorporates three key factors: (i) robustness of the evaluation protocol, (ii) variability of expert observations, and (iii) representation dimensionality of the application. This framework provides a practical tool for researchers to determine the appropriate depth of explainability needed, addressing the critical question: When does an AI medical application need to be explained, and at what level of detail?
