Learning to Reduce Search Space for Generalizable Neural Routing Solver
Changliang Zhou, Xi Lin, Zhenkun Wang, Qingfu Zhang
TL;DR
This work introduces Learning to Reduce (L2R), a reinforcement learning based framework that adaptively reduces the search space in constructive neural routing solvers. By combining static graph pruning with a lightweight embedding and attention based reduction plus a local solution constructor, L2R achieves scalable generalization to large scale TSP and CVRP instances while maintaining solution quality. Extensive experiments show strong cross size and cross distribution generalization, with state of the art performance among neural solvers on up to 1 million nodes, and practical speed advantages over classical solvers on large instances. The approach addresses fundamental limitations of distance based SSR by leveraging learned patterns to select promising candidate nodes, enabling robust real world routing applications.
Abstract
Constructive neural combinatorial optimization (NCO) has attracted growing research attention due to its ability to solve complex routing problems without relying on handcrafted rules. However, existing NCO methods face significant challenges in generalizing to large-scale problems due to high computational complexity and inefficient capture of structural patterns. To address this issue, we propose a novel learning-based search space reduction method that adaptively selects a small set of promising candidate nodes at each step of the constructive NCO process. Unlike traditional methods that rely on fixed heuristics, our selection model dynamically prioritizes nodes based on learned patterns, significantly reducing the search space while maintaining solution quality. Experimental results demonstrate that our method, trained solely on 100-node instances from uniform distribution, generalizes remarkably well to large-scale Traveling Salesman Problem (TSP) and Capacitated Vehicle Routing Problem (CVRP) instances with up to 1 million nodes from the uniform distribution and over 80K nodes from other distributions.
