Deep Learning for Data-Driven Districting-and-Routing
Arthur Ferraz, Cheikh Ahmed, Quentin Cappart, Thibaut Vidal
TL;DR
This work tackles data-driven districting-and-routing under stochastic demand by learning district-cost estimates with a Graph Neural Network and integrating them into an Iterated Local Search to rapidly generate high-quality, connected district partitions. The GNN uses BU-level features and district membership indicators to predict long-term district costs, enabling scalable optimization that outperforms continuous-approximation baselines. Across five UK metropolitan areas, the approach achieves an average economic gain of $10.12\%$ over baselines and reveals that district geometry beyond simple compactness critically influences routing efficiency. The results support practical decision-support by delivering fast, accurate cost estimates, strong solution quality, and insights into how learnable geometric features shape effective districting.
Abstract
Districting-and-routing is a strategic problem aiming to aggregate basic geographical units (e.g., zip codes) into delivery districts. Its goal is to minimize the expected long-term routing cost of performing deliveries in each district separately. Solving this stochastic problem poses critical challenges since repeatedly evaluating routing costs on a set of scenarios while searching for optimal districts takes considerable time. Consequently, solution approaches usually replace the true cost estimation with continuous cost approximation formulas extending Beardwood-Halton-Hammersley and Daganzo's work. These formulas commit errors that can be magnified during the optimization step. To reconcile speed and solution quality, we introduce a supervised learning and optimization methodology leveraging a graph neural network for delivery-cost estimation. This network is trained to imitate known costs generated on a limited subset of training districts. It is used within an iterated local search procedure to produce high-quality districting plans. Our computational experiments, conducted on five metropolitan areas in the United Kingdom, demonstrate that the graph neural network predicts long-term district cost operations more accurately, and that optimizing over this oracle permits large economic gains (10.12% on average) over baseline methods that use continuous approximation formulas or shallow neural networks. Finally, we observe that having compact districts alone does not guarantee high-quality solutions and that other learnable geometrical features of the districts play an essential role.
