Decentralized Navigation of a Cable-Towed Load using Quadrupedal Robot Team via MARL
Wen-Tse Chen, Minh Nguyen, Zhongyu Li, Guo Ning Sue, Koushil Sreenath
TL;DR
This work tackles scalable, real-time collaboration for cable-towed load navigation by a team of quadrupedal robots in cluttered environments. It introduces a unified, decentralized MARL planner trained with CTDE and a multi-stage curriculum, enabling one-to-twelve robot coordination while maintaining constant inference time. A three-tier hierarchy—global load planner, decentralized MARL planners, and MPC-based locomotion—enables long-horizon planning, local coordination, and robust motion execution, respectively. Key contributions include multi-stage MARL training with knowledge distillation to prevent forgetting, domain randomization for sim-to-real transfer, and demonstration of robust, adaptive behavior across varying loads and changing team sizes, both in simulation and real-world experiments.
Abstract
This work addresses the challenge of enabling a team of quadrupedal robots to collaboratively tow a cable-connected load through cluttered and unstructured environments while avoiding obstacles. Leveraging cables allows the multi-robot system to navigate narrow spaces by maintaining slack when necessary. However, this introduces hybrid physical interactions due to alternating taut and slack states, with computational complexity that scales exponentially as the number of agents increases. To tackle these challenges, we developed a scalable and decentralized system capable of dynamically coordinating a variable number of quadrupedal robots while managing the hybrid physical interactions inherent in the load-towing task. At the core of this system is a novel multi-agent reinforcement learning (MARL)-based planner, designed for decentralized coordination. The MARL-based planner is trained using a centralized training with decentralized execution (CTDE) framework, enabling each robot to make decisions autonomously using only local (ego) observations. To accelerate learning and ensure effective collaboration across varying team sizes, we introduce a tailored training curriculum for MARL. Experimental results highlight the flexibility and scalability of the framework, demonstrating successful deployment with one to four robots in real-world scenarios and up to twelve robots in simulation. The decentralized planner maintains consistent inference times, regardless of the team size. Additionally, the proposed system demonstrates robustness to environment perturbations and adaptability to varying load weights. This work represents a step forward in achieving flexible and efficient multi-legged robotic collaboration in complex and real-world environments.
