MAPS: Multi-Agent Personality Shaping for Collaborative Reasoning
Jian Zhang, Zhiyuan Wang, Zhangqi Wang, Fangzhi Xu, Qika Lin, Lingling Zhang, Rui Mao, Erik Cambria, Jun Liu
TL;DR
MAPS introduces a multi-agent framework where five personality-driven agents (Interpreter, Aligner, Scholar, Solver, and Critic) collaborate on multimodal reasoning, with the Critic providing Socratic feedback to enable reflective revision. Grounded in the Big Five personality theory, the approach yields a four-stage reasoning pipeline that is iteratively refined through a feedback loop, backed by a formal information-bottleneck and free-energy perspective. Empirical results on MathVista, OlympiadBench, and EMMA show state-of-the-art performance, surpassing human experts and generalizing across diverse base models such as GPT-4o, Qwen, Gemini, and others. The study demonstrates that structured personality-driven collaboration coupled with reflective critique enhances reasoning depth, accuracy, and robustness in complex multimodal tasks, offering a scalable blueprint for advanced AI problem solving.
Abstract
Collaborative reasoning with multiple agents offers the potential for more robust and diverse problem-solving. However, existing approaches often suffer from homogeneous agent behaviors and lack of reflective and rethinking capabilities. We propose Multi-Agent Personality Shaping (MAPS), a novel framework that enhances reasoning through agent diversity and internal critique. Inspired by the Big Five personality theory, MAPS assigns distinct personality traits to individual agents, shaping their reasoning styles and promoting heterogeneous collaboration. To enable deeper and more adaptive reasoning, MAPS introduces a Critic agent that reflects on intermediate outputs, revisits flawed steps, and guides iterative refinement. This integration of personality-driven agent design and structured collaboration improves both reasoning depth and flexibility. Empirical evaluations across three benchmarks demonstrate the strong performance of MAPS, with further analysis confirming its generalizability across different large language models and validating the benefits of multi-agent collaboration.
