Hierarchical LLMs In-the-Loop Optimization for Real-Time Multi-Robot Target Tracking under Unknown Hazards
Yuwei Wu, Yuezhan Tao, Peihan Li, Guangyao Shi, Gaurav S. Sukhatme, Vijay Kumar, Lifeng Zhou
TL;DR
The paper tackles real-time, risk-aware multi-robot target tracking in unknown hazardous environments where sensor and communication attacks can degrade performance. It introduces a bi-level optimization framework in which an outer Task LLM reconfigures task assignments and an inner Action LLM provides rapid performance adaptations, all guided by a centralized optimizer and optionally a human supervisor; the outer/inner loop outputs are integrated with constraints through the objective $J(A,u)$ and feasibility conditions on $H(A,u)$ and $G(A,u)$. Key contributions include a novel hierarchical LLM-in-the-loop design, prompt-design and output-verification mechanisms (e.g., $eta(A)$) to ensure constraint adherence, and thorough validation in both simulation with multiple LLMs and real hardware experiments. The results demonstrate improved tracking performance, robustness to hazards, and real-time feasibility, highlighting the potential of safety-aware LLM-assisted coordination for complex, real-world robotic teams.
Abstract
Real-time multi-robot coordination in hazardous and adversarial environments requires fast, reliable adaptation to dynamic threats. While Large Language Models (LLMs) offer strong high-level reasoning capabilities, the lack of safety guarantees limits their direct use in critical decision-making. In this paper, we propose a hierarchical optimization framework that integrates LLMs into the decision loop for multi-robot target tracking in dynamic and hazardous environments. Rather than generating control actions directly, LLMs are used to generate task configuration and adjust parameters in a bi-level task allocation and planning problem. We formulate multi-robot coordination for tracking tasks as a bi-level optimization problem, with LLMs to reason about potential hazards in the environment and the status of the robot team and modify both the inner and outer levels of the optimization. This hierarchical approach enables real-time adjustments to the robots' behavior. Additionally, a human supervisor can offer broad guidance and assessments to address unexpected dangers, model mismatches, and performance issues arising from local minima. We validate our proposed framework in both simulation and real-world experiments with comprehensive evaluations, demonstrating its effectiveness and showcasing its capability for safe LLM integration for multi-robot systems.
