Onboard Mission Replanning for Adaptive Cooperative Multi-Robot Systems
Elim Kwan, Rehman Qureshi, Liam Fletcher, Colin Laganier, Victoria Nockles, Richard Walters
TL;DR
This work defines the Cooperative Mission Replanning Problem (CMRP), a real-world, edge-deployable variant of the multi-TSP that includes flexible start locations, variable task times, and cooperative tasking. It introduces GATR, a Graph Attention Network–based encoder–decoder model trained with REINFORCE to generate per-agent task allocations in an on-board setting. The approach discretizes tasks into sub-tasks and uses an asymmetric cost structure to enable collaboration and fast computation on edge hardware, achieving near-optimal performance compared with LKH3 while being orders of magnitude faster on a Raspberry Pi. The results demonstrate strong generalization across problem sizes and offer a path toward multi-objective, probabilistic, and anticipatory planning for resilient multi-robot missions.
Abstract
Cooperative autonomous robotic systems have significant potential for executing complex multi-task missions across space, air, ground, and maritime domains. But they commonly operate in remote, dynamic and hazardous environments, requiring rapid in-mission adaptation without reliance on fragile or slow communication links to centralised compute. Fast, on-board replanning algorithms are therefore needed to enhance resilience. Reinforcement Learning shows strong promise for efficiently solving mission planning tasks when formulated as Travelling Salesperson Problems (TSPs), but existing methods: 1) are unsuitable for replanning, where agents do not start at a single location; 2) do not allow cooperation between agents; 3) are unable to model tasks with variable durations; or 4) lack practical considerations for on-board deployment. Here we define the Cooperative Mission Replanning Problem as a novel variant of multiple TSP with adaptations to overcome these issues, and develop a new encoder/decoder-based model using Graph Attention Networks and Attention Models to solve it effectively and efficiently. Using a simple example of cooperative drones, we show our replanner consistently (90% of the time) maintains performance within 10% of the state-of-the-art LKH3 heuristic solver, whilst running 85-370 times faster on a Raspberry Pi. This work paves the way for increased resilience in autonomous multi-agent systems.
