MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Rui Ye, Shuo Tang, Rui Ge, Yaxin Du, Zhenfei Yin, Siheng Chen, Jing Shao
TL;DR
MAS-GPT tackles the bottleneck of designing LLM-based multi-agent systems by reframing MAS construction as a generative task that outputs executable Python code. It introduces a consistency-driven data pipeline to create query–MAS pairs and trains an open-source 32B LLM to generate query-specific MAS in one inference. Across 9 benchmarks and 5 driving LLMs, MAS-GPT consistently surpasses 10 baselines, with notable gains on challenging tasks and reduced inference costs. The approach promises scalable, adaptable MAS deployment and broader impact by making MAS design more accessible and efficient.
Abstract
LLM-based multi-agent systems (MAS) have shown significant potential in tackling diverse tasks. However, to design effective MAS, existing approaches heavily rely on manual configurations or multiple calls of advanced LLMs, resulting in inadaptability and high inference costs. In this paper, we simplify the process of building an MAS by reframing it as a generative language task, where the input is a user query and the output is a corresponding MAS. To address this novel task, we unify the representation of MAS as executable code and propose a consistency-oriented data construction pipeline to create a high-quality dataset comprising coherent and consistent query-MAS pairs. Using this dataset, we train MAS-GPT, an open-source medium-sized LLM that is capable of generating query-adaptive MAS within a single LLM inference. The generated MAS can be seamlessly applied to process user queries and deliver high-quality responses. Extensive experiments on 9 benchmarks and 5 LLMs show that the proposed MAS-GPT consistently outperforms 10+ baseline MAS methods on diverse settings, indicating MAS-GPT's high effectiveness, efficiency and strong generalization ability. Code will be available at https://github.com/rui-ye/MAS-GPT.
