People readily follow personal advice from AI but it does not improve their well-being
Lennart Luettgau, Vanessa Cheung, Magda Dubois, Keno Juechems, Jessica Bergs, Henry Davidson, Bessie O'Dell, Hannah Rose Kirk, Max Rollwage, Christopher Summerfield
TL;DR
The paper investigates whether people follow personal advice from consumer LLMs and whether such guidance impacts well-being. Using a large, representative UK sample (N=2302) in a longitudinal RCT with a 2x2x2 factorial design plus a control, participants engaged in a 20-minute GPT-4o conversation across health, career, or relationship domains, with 2–3 week follow-ups. The study employs an LLM-based harm autograder and Bayesian GLMs to examine advice content, adherence determinants, and psychological outcomes, finding high adherence but no lasting well-being benefits; personalization increases likelihood of following advice, and safety safeguards yield very low harm rates. These findings imply that while AI advice can influence real-world decisions, it offers limited sustained psychological value, underscoring the need for governance and safety considerations as LLMs become common personal-advisors.
Abstract
People increasingly seek personal advice from large language models (LLMs), yet whether humans follow their advice, and its consequences for their well-being, remains unknown. In a longitudinal randomised controlled trial with a representative UK sample (N = 2,302), 75% of participants who had a 20-minute discussion with GPT-4o about health, careers or relationships subsequently reported following its advice. Based on autograder evaluations of chat transcripts, LLM advice rarely violated safety best practice. When queried 2-3 weeks later, participants who had interacted with personalised AI (with access to detailed user information) followed its advice more often in the real world and reported higher well-being than those advised by non-personalised AI. However, while receiving personal advice from AI temporarily reduced well-being, no differential long-term effects compared to a control emerged. Our results suggest that humans readily follow LLM advice about personal issues but doing so shows no additional well-being benefit over casual conversations.
