Optimal Task Assignment and Path Planning using Conflict-Based Search with Precedence and Temporal Constraints
Yu Quan Chong, Jiaoyang Li, Katia Sycara
TL;DR
The paper addresses TAPF-PTC, a problem that unifies task assignment, path planning, and user-defined objective under precedence and temporal constraints. It proposes CBS-TA-PTC, a Conflict-Based Search extension that on-demand assigns tasks and computes collision-free paths, guided by an RL-based oracle to maximize return. The approach is theoretically complete and optimal under a single-subtask assumption and introduces a bounded-suboptimal variant for scalability; empirically, it outperforms MAPPO-based MARL and adapted TAPF methods on a complex bomb-defusing domain. The work demonstrates practical gains in solving tightly constrained multi-agent tasks, highlighting potential for integration with learning-based methods to improve efficiency and applicability in real-world scenarios.
Abstract
The Multi-Agent Path Finding (MAPF) problem entails finding collision-free paths for a set of agents, guiding them from their start to goal locations. However, MAPF does not account for several practical task-related constraints. For example, agents may need to perform actions at goal locations with specific execution times, adhering to predetermined orders and timeframes. Moreover, goal assignments may not be predefined for agents, and the optimization objective may lack an explicit definition. To incorporate task assignment, path planning, and a user-defined objective into a coherent framework, this paper examines the Task Assignment and Path Finding with Precedence and Temporal Constraints (TAPF-PTC) problem. We augment Conflict-Based Search (CBS) to simultaneously generate task assignments and collision-free paths that adhere to precedence and temporal constraints, maximizing an objective quantified by the return from a user-defined reward function in reinforcement learning (RL). Experimentally, we demonstrate that our algorithm, CBS-TA-PTC, can solve highly challenging bomb-defusing tasks with precedence and temporal constraints efficiently relative to MARL and adapted Target Assignment and Path Finding (TAPF) methods.
