Evaluating the Simulation of Human Personality-Driven Susceptibility to Misinformation with LLMs
Manuel Pratelli, Marinella Petrocchi
TL;DR
This study probes whether LLM agents conditioned on Big-Five personality profiles can mimic human patterns of misinformation susceptibility in news discernment. By mapping each human participant to a personality-conditioned LLM and evaluating the same 24 headlines used in human studies, the authors assess whether trait–discernment associations emerge in synthetic agents. They find that conditioning yields significant shifts in judgments and that GPT-4o closely mirrors human patterns for Agreeableness, Conscientiousness, and Open-Mindedness, though Extraversion and Negative Emotionality show model-specific biases. The work highlights both the promise of scalable, ethically lighter behavioral simulations and the limitations posed by model biases and prompt design, offering a foundation for stress-testing interventions against misinformation while urging caution and further validation. Practically, personality-aligned agents can help prototype resilience strategies and educational tools, but responsible use requires debiasing efforts and cross-cultural verification.
Abstract
Large language models (LLMs) make it possible to generate synthetic behavioural data at scale, offering an ethical and low-cost alternative to human experiments. Whether such data can faithfully capture psychological differences driven by personality traits, however, remains an open question. We evaluate the capacity of LLM agents, conditioned on Big-Five profiles, to reproduce personality-based variation in susceptibility to misinformation, focusing on news discernment, the ability to judge true headlines as true and false headlines as false. Leveraging published datasets in which human participants with known personality profiles rated headline accuracy, we create matching LLM agents and compare their responses to the original human patterns. Certain trait-misinformation associations, notably those involving Agreeableness and Conscientiousness, are reliably replicated, whereas others diverge, revealing systematic biases in how LLMs internalize and express personality. The results underscore both the promise and the limits of personality-aligned LLMs for behavioral simulation, and offer new insight into modeling cognitive diversity in artificial agents.
