Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization
Fu Luo, Xi Lin, Fei Liu, Qingfu Zhang, Zhenkun Wang
TL;DR
This paper tackles the challenge of generalizing neural combinatorial optimization to large-scale problems by introducing LEHD, a Light Encoder and Heavy Decoder architecture that updates node relationships at every construction step. To enable practical training, it adopts a data-efficient supervised scheme that learns to reconstruct partial solutions, supplemented by a flexible Random Re-Construct (RRC) mechanism for online improvement. Experiments on TSP and CVRP show LEHD achieves strong generalization up to 1000 nodes and competitive results on TSPLib/CVRPLib, often surpassing other purely learning-based methods and approaching classic solvers with targeted inference budgets. The results suggest that dynamic, scale-aware decoding and partial-solution training can significantly boost the practicality of learning-based NCO for real-world large-scale problems.
Abstract
Neural combinatorial optimization (NCO) is a promising learning-based approach for solving challenging combinatorial optimization problems without specialized algorithm design by experts. However, most constructive NCO methods cannot solve problems with large-scale instance sizes, which significantly diminishes their usefulness for real-world applications. In this work, we propose a novel Light Encoder and Heavy Decoder (LEHD) model with a strong generalization ability to address this critical issue. The LEHD model can learn to dynamically capture the relationships between all available nodes of varying sizes, which is beneficial for model generalization to problems of various scales. Moreover, we develop a data-efficient training scheme and a flexible solution construction mechanism for the proposed LEHD model. By training on small-scale problem instances, the LEHD model can generate nearly optimal solutions for the Travelling Salesman Problem (TSP) and the Capacitated Vehicle Routing Problem (CVRP) with up to 1000 nodes, and also generalizes well to solve real-world TSPLib and CVRPLib problems. These results confirm our proposed LEHD model can significantly improve the state-of-the-art performance for constructive NCO. The code is available at https://github.com/CIAM-Group/NCO_code/tree/main/single_objective/LEHD.
