Multi-Agent Route Planning as a QUBO Problem
Renáta Rusnáková, Martin Chovanec, Juraj Gazda
TL;DR
This work formalizes Multi-Agent Route Planning (MaRP) as selecting predefined vehicle routes to maximize unique network coverage while penalizing overlaps, and proves NP-hardness via a Weighted Set Packing reduction. It develops a QUBO formulation with coefficients that encode $u_i$ (unique coverage) and $c_{ij}$ (pairwise overlap), and distinguishes soft ($\lambda_{soft}$) and hard ($\lambda_{hard}$) penalty regimes to explore the coverage–overlap trade-off. A reproducible pipeline using OpenStreetMap networks and Valhalla routes demonstrates large-scale evaluation on Barcelona instances (up to $10^4$ vehicles) with solvers including Gurobi, simulated annealing, and D-Wave hybrid annealing; results show a pronounced coverage–overlap knee and that Pareto-optimal solutions largely occur under hard penalties, with hybrid quantum and classical solvers delivering essentially identical objective values at scale. The approach supports fleet sizing and scenario analysis as a practical planning tool, while acknowledging simplifications such as fixed routes and lack of temporal or capacity constraints, pointing to avenues for richer, time-aware extensions.
Abstract
Multi-Agent Route Planning considers selecting vehicles, each associated with a single predefined route, such that the spatial coverage of a road network is increased while redundant overlaps are limited. This paper gives a formal problem definition, proves NP-hardness by reduction from the Weighted Set Packing problem, and derives a Quadratic Unconstrained Binary Optimization formulation whose coefficients directly encode unique coverage rewards and pairwise overlap penalties. A single penalty parameter controls the coverage-overlap trade-off. We distinguish between a soft regime, which supports multi-objective exploration, and a hard regime, in which the penalty is strong enough to effectively enforce near-disjoint routes. We describe a practical pipeline for generating city instances, constructing candidate routes, building the QUBO matrix, and solving it with an exact mixed-integer solver (Gurobi), simulated annealing, and D-Wave hybrid quantum annealing. Experiments on Barcelona instances with up to 10 000 vehicles reveal a clear coverage-overlap knee and show that Pareto-optimal solutions are mainly obtained under the hard-penalty regime, while D-Wave hybrid solvers and Gurobi achieve essentially identical objective values with only minor differences in runtime as problem size grows.
