Demand Selection for VRP with Emission Quota
Farid Najar, Dominique Barth, Yann Strozecki
TL;DR
This work tackles demand selection under an emission quota in VRP (QVRP) by introducing Maximum Feasible Vehicle Assignment (MFVA), a two-layer problem separating assignment and routing. The routing layer is solved with classical OR techniques, while the assignment layer is explored via greedy, dynamic programming, simulated annealing, and learning-based methods including RL and multi-agent approaches. Across synthetic and real-data experiments, classical OR-based methods, especially metaheuristics, consistently outperform learning-based approaches in static MFVA/QVRP settings, with RL and decentralized learners showing limited generalization or efficiency. The findings suggest that pure learning or end-to-end approaches may be unsuitable for static combinatorial optimization, though hybrid or dynamic variants could still benefit from learning in future work.
Abstract
Combinatorial optimization (CO) problems are traditionally addressed using Operations Research (OR) methods, including metaheuristics. In this study, we introduce a demand selection problem for the Vehicle Routing Problem (VRP) with an emission quota, referred to as QVRP. The objective is to minimize the number of omitted deliveries while respecting the pollution quota. We focus on the demand selection part, called Maximum Feasible Vehicle Assignment (MFVA), while the construction of a routing for the VRP instance is solved using classical OR methods. We propose several methods for selecting the packages to omit, both from machine learning (ML) and OR. Our results show that, in this static problem setting, classical OR-based methods consistently outperform ML-based approaches.
