Compositional Coordination for Multi-Robot Teams with Large Language Models
Zhehui Huang, Guangyao Shi, Yuwei Wu, Vijay Kumar, Gaurav S. Sukhatme
TL;DR
LAN2CB introduces a language-to-coordinated-behavior framework that converts natural-language mission descriptions into executable Python code for robot teams. The system splits into Mission Analysis (NL-to-behavior-tree decomposition) and Code Generation (synthesis of robot control code using a structured knowledge base and a meta-behavior library), enabling automatic execution and replanning through an iterative execution loop. A dedicated NL mission dataset and comprehensive experiments—spanning nine missions, simulations, and real-world demos—demonstrate robust, flexible coordination across diverse tasks and highlight the importance of standardized prompting. The work promises to reduce manual engineering effort and enhance generalization for large, heterogeneous robot teams in dynamic environments.
Abstract
Multi-robot coordination has traditionally relied on a mission-specific and expert-driven pipeline, where natural language mission descriptions are manually translated by domain experts into mathematical formulation, algorithm design, and executable code. This conventional process is labor-intensive, inaccessible to non-experts, and inflexible to changes in mission requirements. Here, we propose LAN2CB (Language to Collective Behavior), a novel framework that leverages large language models (LLMs) to streamline and generalize the multi-robot coordination pipeline. LAN2CB transforms natural language (NL) mission descriptions into executable Python code for multi-robot systems through two core modules: (1) Mission Analysis, which parses mission descriptions into behavior trees, and (2) Code Generation, which leverages the behavior tree and a structured knowledge base to generate robot control code. We further introduce a dataset of natural language mission descriptions to support development and benchmarking. Experiments in both simulation and real-world environments demonstrate that LAN2CB enables robust and flexible multi-robot coordination from natural language, significantly reducing manual engineering effort and supporting broad generalization across diverse mission types. Website: https://sites.google.com/view/lan-cb
