From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness
Linbo Cao, Lihao Sun, Yang Yue
TL;DR
This study shows that demographic-based persona conditioning can meaningfully distort LLM agent behavior, undermining task performance across diverse domains. By evaluating three widely used LLMs on five agentic benchmarks with 23 personas spanning gender, race/origin, religion, and profession, the authors demonstrate degradations up to 26.2% and reveal that biases propagate from language to action. The findings highlight an overlooked vulnerability in LLM agents that can compromise safety and reliability in real-world deployments, underscoring the need for debiasing and robustness interventions. The work provides a foundation for understanding how societal stereotypes can seep into autonomous decision-making and offers directions for designing more stable, fair, and accountable agentic systems.
Abstract
Large Language Models (LLMs) are increasingly deployed as autonomous agents capable of actions with real-world impacts beyond text generation. While persona-induced biases in text generation are well documented, their effects on agent task performance remain largely unexplored, even though such effects pose more direct operational risks. In this work, we present the first systematic case study showing that demographic-based persona assignments can alter LLM agents' behavior and degrade performance across diverse domains. Evaluating widely deployed models on agentic benchmarks spanning strategic reasoning, planning, and technical operations, we uncover substantial performance variations - up to 26.2% degradation, driven by task-irrelevant persona cues. These shifts appear across task types and model architectures, indicating that persona conditioning and simple prompt injections can distort an agent's decision-making reliability. Our findings reveal an overlooked vulnerability in current LLM agentic systems: persona assignments can introduce implicit biases and increase behavioral volatility, raising concerns for the safe and robust deployment of LLM agents.
