Don't be Fooled: The Misinformation Effect of Explanations in Human-AI Collaboration
Philipp Spitzer, Joshua Holstein, Katelyn Morrison, Kenneth Holstein, Gerhard Satzger, Niklas Kühl
TL;DR
This work addresses a critical gap in XAI by examining how incorrect explanations paired with correct AI advice can mislead humans and degrade post-collaboration capabilities. Using a preregistered online study ($n=160$) on architectural-style classification, the authors show that incorrect explanations impair procedural knowledge and reasoning, while correct explanations maximize human-AI team performance. The findings reveal a dual effect: explanations can boost immediate performance but harm long-term knowledge retention and autonomous task performance if incorrect. The study provides design guidelines and highlights policy-relevant implications to ensure reliable, human-centered AI support in high-stakes decision contexts.
Abstract
Across various applications, humans increasingly use black-box artificial intelligence (AI) systems without insight into these systems' reasoning. To counter this opacity, explainable AI (XAI) methods promise enhanced transparency and interpretability. While recent studies have explored how XAI affects human-AI collaboration, few have examined the potential pitfalls caused by incorrect explanations. The implications for humans can be far-reaching but have not been explored extensively. To investigate this, we ran a study (n=160) on AI-assisted decision-making in which humans were supported by XAI. Our findings reveal a misinformation effect when incorrect explanations accompany correct AI advice with implications post-collaboration. This effect causes humans to infer flawed reasoning strategies, hindering task execution and demonstrating impaired procedural knowledge. Additionally, incorrect explanations compromise human-AI team-performance during collaboration. With our work, we contribute to HCI by providing empirical evidence for the negative consequences of incorrect explanations on humans post-collaboration and outlining guidelines for designers of AI.
