Enhancing the Cross-Size Generalization for Solving Vehicle Routing Problems via Continual Learning
Jingwen Li, Zhiguang Cao, Yaoxin Wu, Tang Liu
TL;DR
The paper tackles cross-size generalization in VRPs by proposing a model-agnostic continual learning framework that trains on ascending problem sizes using experience replay and two exemplar-based regularization schemes (inter-task and intra-task) to preserve past knowledge while adapting to larger instances. The approach improves cross-size generalization for autoregressive VRP solvers and outperforms state-of-the-art baselines designed for generalizability, with ablations confirming the synergy of components. Its model-agnostic nature preserves inference efficiency, making it practical for real-world VRP applications. The work offers a solid pathway to scalable, generalizable learning-based VRP solvers and points to large-scale VRP extensions and cross-distribution generalization as future directions.
Abstract
Exploring machine learning techniques for addressing vehicle routing problems has attracted considerable research attention. To achieve decent and efficient solutions, existing deep models for vehicle routing problems are typically trained and evaluated using instances of a single size. This substantially limits their ability to generalize across different problem sizes and thus hampers their practical applicability. To address the issue, we propose a continual learning based framework that sequentially trains a deep model with instances of ascending problem sizes. Specifically, on the one hand, we design an inter-task regularization scheme to retain the knowledge acquired from smaller problem sizes in the model training on a larger size. On the other hand, we introduce an intra-task regularization scheme to consolidate the model by imitating the latest desirable behaviors during training on each size. Additionally, we exploit the experience replay to revisit instances of formerly trained sizes for mitigating the catastrophic forgetting. Experimental results show that our approach achieves predominantly superior performance across various problem sizes (either seen or unseen in the training), as compared to state-of-the-art deep models including the ones specialized for generalizability enhancement. Meanwhile, the ablation studies on the key designs manifest their synergistic effect in the proposed framework.
