Multi-Objective Multi-Agent Path Finding with Lexicographic Cost Preferences
Pulkit Rustagi, Kyle Hollins Wray, Sandhya Saisubramanian
TL;DR
The paper addresses MO-MAPF under user-defined objective preferences by introducing a lexicographic formulation that orders objectives strictly and seeks a lexicographically minimal joint cost. It presents Lexicographic Conflict-Based Search (LCBS), a two-level CBS framework where the low-level LA$^*$ performs lexicographic optimization and the high-level CBS resolves conflicts via constraint branching, thereby avoiding Pareto-front construction. The authors prove LCBS yields lexicographically optimal solutions on the Pareto frontier and demonstrate linear-in-d to runtime scaling, achieving robust performance up to $d=10$ objectives and large agent counts. Empirical results on standard and randomized MAPF benchmarks show consistently higher success rates for LCBS compared to state-of-the-art baselines, highlighting its scalability and practical impact for real-world, preference-driven multi-agent planning.
Abstract
Many real-world scenarios require multiple agents to coordinate in shared environments, while balancing trade-offs between multiple, potentially competing objectives. Current multi-objective multi-agent path finding (MO-MAPF) algorithms typically produce conflict-free plans by computing Pareto frontiers. They do not explicitly optimize for user-defined preferences, even when the preferences are available, and scale poorly with the number of objectives. We propose a lexicographic framework for modeling MO-MAPF, along with an algorithm \textit{Lexicographic Conflict-Based Search} (LCBS) that directly computes a single solution aligned with a lexicographic preference over objectives. LCBS integrates a priority-aware low-level $A^*$ search with conflict-based search, avoiding Pareto frontier construction and enabling efficient planning guided by preference over objectives. We provide insights into optimality and scalability, and empirically demonstrate that LCBS computes optimal solutions while scaling to instances with up to ten objectives -- far beyond the limits of existing MO-MAPF methods. Evaluations on standard and randomized MAPF benchmarks show consistently higher success rates against state-of-the-art baselines, especially with increasing number of objectives.
