Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic
Thomy Phan, Benran Zhang, Shao-Hung Chan, Sven Koenig
TL;DR
This work tackles the scalability limits of Anytime MAPF by introducing ADDRESS, a single adaptive destroy-and-repair heuristic for MAPF-LNS. ADDRESS replaces multiple stationary destroy heuristics with a restricted Thompson Sampling strategy applied to the top-$K$ delayed agents to seed adaptive LNS neighborhoods, enabling faster improvement in solution cost and AUC, even in large-scale instances with up to 1000 agents. The authors demonstrate substantial performance gains across five MAPF benchmarks, outperforming MAPF-LNS, BALANCE, and LaCAM*, while revealing that the approach is robust across parameter choices and time budgets. The proposed method offers a simple yet effective blueprint for online adaptation in combinatorial optimization and can be extended to other domains where high-cost variables govern neighborhood generation.
Abstract
Anytime multi-agent path finding (MAPF) is a promising approach to scalable path optimization in multi-agent systems. MAPF-LNS, based on Large Neighborhood Search (LNS), is the current state-of-the-art approach where a fast initial solution is iteratively optimized by destroying and repairing selected paths of the solution. Current MAPF-LNS variants commonly use an adaptive selection mechanism to choose among multiple destroy heuristics. However, to determine promising destroy heuristics, MAPF-LNS requires a considerable amount of exploration time. As common destroy heuristics are non-adaptive, any performance bottleneck caused by these heuristics cannot be overcome via adaptive heuristic selection alone, thus limiting the overall effectiveness of MAPF-LNS in terms of solution cost. In this paper, we propose Adaptive Delay-based Destroy-and-Repair Enhanced with Success-based Self-Learning (ADDRESS) as a single-destroy-heuristic variant of MAPF-LNS. ADDRESS applies restricted Thompson Sampling to the top-K set of the most delayed agents to select a seed agent for adaptive LNS neighborhood generation. We evaluate ADDRESS in multiple maps from the MAPF benchmark set and demonstrate cost improvements by at least 50% in large-scale scenarios with up to a thousand agents, compared with the original MAPF-LNS and other state-of-the-art methods.
