Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas
Pietro Bernardelle, Leon Fröhling, Stefano Civelli, Riccardo Lunardi, Kevin Roitero, Gianluca Demartini
TL;DR
The paper addresses how synthetic personas influence the political orientations of large language models when evaluated with the Political Compass Test (PCT). It combines PersonaHub with a two-phase prompting strategy across four open-source LLMs to map persona-driven political distributions and to test manipulation with explicit ideological prompts. Baseline results show a left-libertarian bias in persona responses, with distinct differences across models; explicit prompts shift responses toward right-authoritarian positions, with Llama showing the strongest responsiveness and Zephyr the most resistance. The work highlights significant prompt-based malleability in LLM political expressions and underscores implications for bias mitigation, safety, and the design of debiasing strategies in real-world deployments.
Abstract
The analysis of political biases in large language models (LLMs) has primarily examined these systems as single entities with fixed viewpoints. While various methods exist for measuring such biases, the impact of persona-based prompting on LLMs' political orientation remains unexplored. In this work we leverage PersonaHub, a collection of synthetic persona descriptions, to map the political distribution of persona-based prompted LLMs using the Political Compass Test (PCT). We then examine whether these initial compass distributions can be manipulated through explicit ideological prompting towards diametrically opposed political orientations: right-authoritarian and left-libertarian. Our experiments reveal that synthetic personas predominantly cluster in the left-libertarian quadrant, with models demonstrating varying degrees of responsiveness when prompted with explicit ideological descriptors. While all models demonstrate significant shifts towards right-authoritarian positions, they exhibit more limited shifts towards left-libertarian positions, suggesting an asymmetric response to ideological manipulation that may reflect inherent biases in model training.
