In Search of Verifiability: Explanations Rarely Enable Complementary Performance in AI-Advised Decision Making
Raymond Fok, Daniel S. Weld
TL;DR
The paper tackles the inconsistent empirical findings on AI explanations in decision making by proposing a verifiability-centered theory: explanations improve team performance only when they enable verification of the AI's recommendation, with verification analogous to certificate checking in $NP$-complete problems. It reframes appropriate reliance as strategy-graded reliance, arguing that decisions should depend on expected relative performance rather than post-hoc outcomes. The discussion covers verification spectra, cognitive and interaction paradigms, and broader uses of explanations beyond immediate task performance, including collective decision making and regulatory contexts. Practically, the work guides XAI design toward verifiable, interaction-capable explanations that enable humans to detect AI errors and collaborate effectively with AI in real-world settings.
Abstract
The current literature on AI-advised decision making -- involving explainable AI systems advising human decision makers -- presents a series of inconclusive and confounding results. To synthesize these findings, we propose a simple theory that elucidates the frequent failure of AI explanations to engender appropriate reliance and complementary decision making performance. We argue explanations are only useful to the extent that they allow a human decision maker to verify the correctness of an AI's prediction, in contrast to other desiderata, e.g., interpretability or spelling out the AI's reasoning process. Prior studies find in many decision making contexts AI explanations do not facilitate such verification. Moreover, most tasks fundamentally do not allow easy verification, regardless of explanation method, limiting the potential benefit of any type of explanation. We also compare the objective of complementary performance with that of appropriate reliance, decomposing the latter into the notions of outcome-graded and strategy-graded reliance.
