The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents
Yifan Duan, Yihong Tang, Xuefeng Bai, Kehai Chen, Juntao Li, Min Zhang
TL;DR
This study adopts a human psychological simulation framework to systematically examine how Big Five personality traits influence LLM behavior across closed tasks, open-ended tasks, and multi-agent collaboration, using a set of $2^5=32$ trait configurations. By validating trait expression with the BFI-2 scale and evaluating across multiple models (e.g., Qwen-32B, Qwen-14B, Llama-8B) and benchmarks (MMLU, GPQA, TTCT), the authors show that certain traits consistently modulate reasoning accuracy and creativity, while multi-agent configurations yield collective intelligence distinct from single-agent performance. The findings align with some human psychology patterns (e.g., openness enhancing originality) but also reveal model-dependent variability, underscoring the role of model size and architecture. The work suggests personality-driven prompt design as a route to tailoring capabilities and guiding safe, collaborative AI systems in real-world tasks.
Abstract
Large language models (LLMs) excel in both closed tasks (including problem-solving, and code generation) and open tasks (including creative writing), yet existing explanations for their capabilities lack connections to real-world human intelligence. To fill this gap, this paper systematically investigates LLM intelligence through the lens of ``human simulation'', addressing three core questions: (1) \textit{How do personality traits affect problem-solving in closed tasks?} (2) \textit{How do traits shape creativity in open tasks?} (3) \textit{How does single-agent performance influence multi-agent collaboration?} By assigning Big Five personality traits to LLM agents and evaluating their performance in single- and multi-agent settings, we reveal that specific traits significantly influence reasoning accuracy (closed tasks) and creative output (open tasks). Furthermore, multi-agent systems exhibit collective intelligence distinct from individual capabilities, driven by distinguishing combinations of personalities.
