Do Androids Dream of Unseen Puppeteers? Probing for a Conspiracy Mindset in Large Language Models
Francesco Corso, Francesco Pierri, Gianmarco De Francisci Morales
TL;DR
The paper investigates whether large language models exhibit conspiratorial tendencies and how sociodemographic conditioning and prompting influence such reasoning. It adapts validated psychometric conspiracy surveys into prompts and evaluates innate, demographic, and conditioned conspiratorial responses across open-weight LLMs. Key findings show partial alignment with conspiracy beliefs in baseline models, demographic conditioning can bias results, and targeted conspiracy prompts robustly shift models toward conspiratorial responses, with safety implications for deployment. The work advances computational social science by using AI as a proxy to study high-level cognitive constructs while highlighting risks of manipulation and the need for mitigation strategies.
Abstract
In this paper, we investigate whether Large Language Models (LLMs) exhibit conspiratorial tendencies, whether they display sociodemographic biases in this domain, and how easily they can be conditioned into adopting conspiratorial perspectives. Conspiracy beliefs play a central role in the spread of misinformation and in shaping distrust toward institutions, making them a critical testbed for evaluating the social fidelity of LLMs. LLMs are increasingly used as proxies for studying human behavior, yet little is known about whether they reproduce higher-order psychological constructs such as a conspiratorial mindset. To bridge this research gap, we administer validated psychometric surveys measuring conspiracy mindset to multiple models under different prompting and conditioning strategies. Our findings reveal that LLMs show partial agreement with elements of conspiracy belief, and conditioning with socio-demographic attributes produces uneven effects, exposing latent demographic biases. Moreover, targeted prompts can easily shift model responses toward conspiratorial directions, underscoring both the susceptibility of LLMs to manipulation and the potential risks of their deployment in sensitive contexts. These results highlight the importance of critically evaluating the psychological dimensions embedded in LLMs, both to advance computational social science and to inform possible mitigation strategies against harmful uses.
