Position: Explain to Question not to Justify
Przemyslaw Biecek, Wojciech Samek
TL;DR
The paper argues that the XAI field suffers from competing goals, presenting a two-culture framework: BLUE XAI (human-values oriented) and RED XAI (model validation oriented). It shows that RED XAI is under-explored and outlines concrete challenges, such as constructing complementary explanations, establishing benchmarks and tooling, adopting an explorer mindset, and leveraging Rashomon sets for debugging and discovery. It contends that focusing on RED XAI can advance model robustness and safety, complementing user-centric explanations. The discussion includes a Covid-19 risk example to illustrate how BLUE and RED perspectives can converge and where specialized RED tools are needed. The work advocates for methodological diversification, education shifts, and the development of standards to accelerate RED XAI research.
Abstract
Explainable Artificial Intelligence (XAI) is a young but very promising field of research. Unfortunately, the progress in this field is currently slowed down by divergent and incompatible goals. We separate various threads tangled within the area of XAI into two complementary cultures of human/value-oriented explanations (BLUE XAI) and model/validation-oriented explanations (RED XAI). This position paper argues that the area of RED XAI is currently under-explored, i.e., more methods for explainability are desperately needed to question models (e.g., extract knowledge from well-performing models as well as spotting and fixing bugs in faulty models), and the area of RED XAI hides great opportunities and potential for important research necessary to ensure the safety of AI systems. We conclude this paper by presenting promising challenges in this area.
