From Raw Data to Shared 3D Semantics: Task-Oriented Communication for Multi-Robot Collaboration
Ruibo Xue, Jiedan Tan, Fang Liu, Jingwen Tong, Taotao Wang, Shuoyao Wang
TL;DR
The paper tackles the data–decision bottleneck in multi-robot systems operating in unknown 3D environments by proposing a decentralized task-oriented semantic communication framework. It constructs a geometry-aware perception pipeline using PiDiNet to extract sparse structural edges, 3D anchors, and object primitives, transmitting only event-driven semantic updates to build a lightweight shared 3D semantic scene for decentralized coordination. Key contributions include the semantic message format with explicit bit budgeting, an event-driven transmission policy with priority rules, and a decentralized pipeline for semantic-aware task allocation, motion planning, and cooperative transport, all without centralized fusion or dense maps. Experiments show dramatic bandwidth reductions (over 200×) and substantial improvements in task completion speed (up to ~4×) and travel efficiency (≈3×) as team size scales, demonstrating the practical impact of aligning communication with task utility in bandwidth-constrained multi-robot collaboration.
Abstract
Multi-robot systems (MRS) rely on exchanging raw sensory data to cooperate in complex three-dimensional (3D) environments. However, this strategy often leads to severe communication congestion and high transmission latency, significantly degrading collaboration efficiency. This paper proposes a decentralized task-oriented semantic communication framework for multi-robot collaboration in unknown 3D environments. Each robot locally extracts compact, task-relevant semantics using a lightweight Pixel Difference Network (PiDiNet) with geometric processing. It shares only these semantic updates to build a task-sufficient 3D scene representation that supports cooperative perception, navigation, and object transport. Our numerical results show that the proposed method exhibits a dramatic reduction in communication overhead from $858.6$ Mb to $4.0$ Mb (over $200\times$ compression gain) while improving collaboration efficiency by shortening task completion from $1,054$ to $281$ steps.
