When Semantics Connect the Swarm: LLM-Driven Fuzzy Control for Cooperative Multi-Robot Underwater Coverage
Jingzehua Xu, Weihang Zhang, Yangyang Li, Hongmiaoyi Zhang, Guanwen Xie, Jiwei Tang, Shuai Zhang, Yi Li
TL;DR
This work addresses cooperative underwater coverage under partial observability and GPS-denied conditions by introducing a semantics-guided fuzzy control framework that couples LLM-driven semantic abstraction with an interpretable fuzzy controller and lightweight semantic communication. Raw multimodal data are distilled into compact semantic tokens describing obstacles, unexplored regions, and OOIs, which are then mapped to smooth steering and gait commands; robots exchange intent through semantic tokens to coordinate exploration while avoiding redundancy. Key contributions include (i) LLM-guided efficient semantic abstraction, (ii) an interpretable fuzzy control module with expert-informed rules and centroid defuzzification, and (iii) semantic communication for swarm coordination, validated in Webots simulations across reef-like environments. The findings highlight robust OOI-oriented navigation and improved cooperative coverage efficiency under limited sensing and communication, suggesting a scalable path toward interpretable, semantic-driven underwater swarms in GPS-denied, map-free settings.
Abstract
Underwater multi-robot cooperative coverage remains challenging due to partial observability, limited communication, environmental uncertainty, and the lack of access to global localization. To address these issues, this paper presents a semantics-guided fuzzy control framework that couples Large Language Models (LLMs) with interpretable control and lightweight coordination. Raw multimodal observations are compressed by the LLM into compact, human-interpretable semantic tokens that summarize obstacles, unexplored regions, and Objects Of Interest (OOIs) under uncertain perception. A fuzzy inference system with pre-defined membership functions then maps these tokens into smooth and stable steering and gait commands, enabling reliable navigation without relying on global positioning. Then, we further coordinate multiple robots by introducing semantic communication that shares intent and local context in linguistic form, enabling agreement on who explores where while avoiding redundant revisits. Extensive simulations in unknown reef-like environments show that, under limited sensing and communication, the proposed framework achieves robust OOI-oriented navigation and cooperative coverage with improved efficiency and adaptability, narrowing the gap between semantic cognition and distributed underwater control in GPS-denied, map-free conditions.
