Programmable Cognitive Bias in Social Agents
Xuan Liu, Haoyang Shang, Haojian Jin
TL;DR
CoBRA introduces a principled toolkit for programmable cognitive bias in LLM-based social agents, replacing opaque implicit prompts with a Cognitive Bias Index grounded in classic social experiments. Through its closed-loop, ground-truth calibration (CBI) and a three-space Behavioral Regulation Engine (Prompt Numerical Control, Representation Engineering, and LoRA-based fine-tuning), CoBRA achieves reproducible, tunable bias across models, temperatures, and reasoning modes. Technical benchmarks show robust cross-model reproducibility and controllability, while a demonstration with emotional contagion validates a clear dose-response relationship between programmed bias and emergent behavior. This approach enables rigorous, theory-driven social simulations with broad implications for research, policy testing, and ethically guided AI deployment.
Abstract
This paper introduces CoBRA, a novel toolkit for systematically specifying agent behavior in LLM-based social simulation. We found that conventional approaches that specify agent behaviors through implicit natural language descriptions cannot yield consistent behaviors across models, and the produced agent behaviors do not capture the nuances of the descriptions. In contrast, CoBRA presents a new approach to program agents' cognitive biases explicitly, by grounding agents' expected behaviors using classic social science experiments. CoBRA has two components: (1) Cognitive Bias Index that measures the cognitive bias of a social agent, by quantifying the agent's reactions in a set of validated classical social science experiments; (2) Behavioral Regulation Engine that aligns the agent's behavior to demonstrate controlled cognitive bias. We evaluated CoBRA as an HCI toolkit through demonstration and technical benchmarks. Our results suggest that CoBRA can precisely program the cognitive bias demonstrated in a social agent in a model-agnostic manner.
