Decentralized Unlabeled Multi-agent Pathfinding Via Target And Priority Swapping (With Supplementary)
Stepan Dergachev, Konstantin Yakovlev
TL;DR
This work addresses anonymous multi-agent pathfinding (AMAPF) in a fully decentralized setting where agents independently decide targets and motions using only local communication. Building on TSWAP, the authors introduce TP-SWAP, a target-priority swapping framework that maintains consistent goal assignments via a persistent TP table and local information exchange, with TP-UPDATE ensuring progress toward completion. Theoretical results prove completeness and a decreasing potential function guarantees termination, while extensive experiments on MovingAI maps show TP-SWAP outperforms naive decentralization and approaches centralized baselines in flowtime, with makespan benefits that improve with agent density. The approach advances scalable, complete decentralized AMAPF, offering practical implications for distributed robotic swarms and autonomous systems in uncertain communication environments.
Abstract
In this paper we study a challenging variant of the multi-agent pathfinding problem (MAPF), when a set of agents must reach a set of goal locations, but it does not matter which agent reaches a specific goal - Anonymous MAPF (AMAPF). Current optimal and suboptimal AMAPF solvers rely on the existence of a centralized controller which is in charge of both target assignment and pathfinding. We extend the state of the art and present the first AMAPF solver capable of solving the problem at hand in a fully decentralized fashion, when each agent makes decisions individually and relies only on the local communication with the others. The core of our method is a priority and target swapping procedure tailored to produce consistent goal assignments (i.e. making sure that no two agents are heading towards the same goal). Coupled with an established rule-based path planning, we end up with a TP-SWAP, an efficient and flexible approach to solve decentralized AMAPF. On the theoretical side, we prove that TP-SWAP is complete (i.e. TP-SWAP guarantees that each target will be reached by some agent). Empirically, we evaluate TP-SWAP across a wide range of setups and compare it to both centralized and decentralized baselines. Indeed, TP-SWAP outperforms the fully-decentralized competitor and can even outperform the semi-decentralized one (i.e. the one relying on the initial consistent goal assignment) in terms of flowtime (a widespread cost objective in MAPF
