Cognitive Insights and Stable Coalition Matching for Fostering Multi-Agent Cooperation
Jiaqi Shao, Tianjun Yuan, Tao Lin, Bing Luo
TL;DR
This work investigates how Theory of Mind (ToM) affects cooperation in large-language-model (LLM) powered multi-agent systems and finds that ToM depth can sometimes impair collaboration. It introduces a ToM-aware team-formation framework that forms belief-aligned, stable coalitions by optimizing social welfare under stability constraints and a belief-action alignment metric, while allowing specialization-driven adaptation. The mechanism relies on iterated belief updates via a ToM function $f_{ToM}$, a stability notion against blocking coalitions, and an epsilon threshold to prune weak alignments. Across programming, debate, and reasoning tasks, experiments show that belief-aligned teams, especially with higher-ToM, achieve improved task performance and longer team lifetimes, demonstrating effective, human-like coordination in complex tasks. The findings suggest practical implications for cooperative AI in domains requiring robust, interpretable, and scalable multi-agent collaboration, while acknowledging ethical considerations and the need for safeguards in belief modeling.
Abstract
Cognitive abilities, such as Theory of Mind (ToM), play a vital role in facilitating cooperation in human social interactions. However, our study reveals that agents with higher ToM abilities may not necessarily exhibit better cooperative behavior compared to those with lower ToM abilities. To address this challenge, we propose a novel matching coalition mechanism that leverages the strengths of agents with different ToM levels by explicitly considering belief alignment and specialized abilities when forming coalitions. Our proposed matching algorithm seeks to find stable coalitions that maximize the potential for cooperative behavior and ensure long-term viability. By incorporating cognitive insights into the design of multi-agent systems, our work demonstrates the potential of leveraging ToM to create more sophisticated and human-like coordination strategies that foster cooperation and improve overall system performance.
