DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems
Zhi Zheng, Shunyu Yao, Zhenkun Wang, Xialiang Tong, Mingxuan Yuan, Ke Tang
TL;DR
This work tackles the min-max VRP objective of minimizing the longest route length by proposing Decoupling-Partition-Navigation (DPN), which separates partition and navigation through a bi-part Partition-and-Navigation encoder (P&N Encoder). It introduces an agent-permutation-symmetric (APS) loss to exploit permutation symmetry and a depot-aware Rotation-based positional encoding to improve partition representations. Empirical results on four min-max VRP variants, including single- and multi-depot settings, show that DPN consistently outperforms existing learning-based solvers and competes with traditional heuristics, with notable gains in mTSP and mPDP scenarios. The approach advances neural solvers for structured combinatorial problems by embedding problem-specific decoupling and symmetry into the learning process, and it opens avenues for extending decoupled representations to general VRPs in the future.
Abstract
The min-max vehicle routing problem (min-max VRP) traverses all given customers by assigning several routes and aims to minimize the length of the longest route. Recently, reinforcement learning (RL)-based sequential planning methods have exhibited advantages in solving efficiency and optimality. However, these methods fail to exploit the problem-specific properties in learning representations, resulting in less effective features for decoding optimal routes. This paper considers the sequential planning process of min-max VRPs as two coupled optimization tasks: customer partition for different routes and customer navigation in each route (i.e., partition and navigation). To effectively process min-max VRP instances, we present a novel attention-based Partition-and-Navigation encoder (P&N Encoder) that learns distinct embeddings for partition and navigation. Furthermore, we utilize an inherent symmetry in decoding routes and develop an effective agent-permutation-symmetric (APS) loss function. Experimental results demonstrate that the proposed Decoupling-Partition-Navigation (DPN) method significantly surpasses existing learning-based methods in both single-depot and multi-depot min-max VRPs. Our code is available at
