INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer
Han Fang, Zhihao Song, Paul Weng, Yutong Ban
TL;DR
This work targets generalization gaps in neural routing solvers by diagnosing embedding aliasing and interference from irrelevant nodes. It introduces INViT, an invariant nested view Transformer that processes multiple localized views around the last visited node and uses a multi-view decoder to predict the next step, with a REINFORCE-based training regime augmented by rotations, reflections, and normalization. Empirical results on MSVDRP and public datasets show INViT achieves superior cross-size and cross-distribution generalization for TSP and CVRP, with ablations validating the roles of graph sparsification, invariance, and nested views. The approach offers fast, scalable inference and strong practical impact for large-scale routing tasks in logistics and related fields.
Abstract
Recently, deep reinforcement learning has shown promising results for learning fast heuristics to solve routing problems. Meanwhile, most of the solvers suffer from generalizing to an unseen distribution or distributions with different scales. To address this issue, we propose a novel architecture, called Invariant Nested View Transformer (INViT), which is designed to enforce a nested design together with invariant views inside the encoders to promote the generalizability of the learned solver. It applies a modified policy gradient algorithm enhanced with data augmentations. We demonstrate that the proposed INViT achieves a dominant generalization performance on both TSP and CVRP problems with various distributions and different problem scales.
