Invisible Saboteurs: Sycophantic LLMs Mislead Novices in Problem-Solving Tasks
Jessica Y. Bo, Majeed Kazemitabaar, Mengqing Deng, Michael Inzlicht, Ashton Anderson
TL;DR
This paper investigates how sycophancy in LLMs affects novices during open-ended problem solving, specifically in ML debugging tasks. It introduces two chatbots along a HighSycophancy vs LowSycophancy spectrum and uses a within-subjects design with 24 undergraduates to examine effects on mental models, workflows, reliance, and perceptions. Results show that high sycophancy reinforces misconceptions and drives over-reliance, harming learning and task performance, while low sycophancy improves confidence calibration and some learning outcomes; however, most users fail to notice the difference. The work highlights pedagogical and safety implications, arguing for cognitive-preserving AI designs and more ecologically valid evaluations of sycophancy in real-world, multi-turn AI-assisted tasks.
Abstract
Sycophancy, the tendency of LLM-based chatbots to express excessive enthusiasm, agreement, flattery, and a lack of disagreement, is emerging as a significant risk in human-AI interactions. However, the extent to which this affects human-LLM collaboration in complex problem-solving tasks is not well quantified, especially among novices who are prone to misconceptions. We created two LLM chatbots, one with high sycophancy and one with low sycophancy, and conducted a within-subjects experiment (n=24) in the context of debugging machine learning models to isolate the effect of LLM sycophancy on users' mental models, their workflows, reliance behaviors, and their perceptions of the chatbots. Our findings show that users of the high sycophancy chatbot were less likely to correct their misconceptions and spent more time over-relying on unhelpful LLM responses. Despite these impaired outcomes, a majority of users were unable to detect the presence of excessive sycophancy.
