Unsupervised Learning for Solving the Travelling Salesman Problem
Yimeng Min, Yiwei Bai, Carla P. Gomes
TL;DR
UTSP tackles the NP-hard Travelling Salesman Problem with an unsupervised framework that builds a heat map over edges using a Scattering Attention Graph Neural Network. A differentiable surrogate loss couples a short-path objective with a Hamiltonian Cycle constraint, enabling end-to-end learning without labeled solutions, followed by a non-autoregressive heat-map–guided local search. Empirically, UTSP matches or surpasses existing data-driven TSP heuristics, especially on large instances (up to $n=1000$), while using far fewer training samples and parameters, and significantly reducing the search space via a non-smooth heat map. The approach demonstrates that expressive GNNs and a carefully designed surrogate can yield strong, data-efficient combinatorial optimization heuristics with practical impact for scalable TSP solving.
Abstract
We propose UTSP, an unsupervised learning (UL) framework for solving the Travelling Salesman Problem (TSP). We train a Graph Neural Network (GNN) using a surrogate loss. The GNN outputs a heat map representing the probability for each edge to be part of the optimal path. We then apply local search to generate our final prediction based on the heat map. Our loss function consists of two parts: one pushes the model to find the shortest path and the other serves as a surrogate for the constraint that the route should form a Hamiltonian Cycle. Experimental results show that UTSP outperforms the existing data-driven TSP heuristics. Our approach is parameter efficient as well as data efficient: the model takes $\sim$ 10\% of the number of parameters and $\sim$ 0.2\% of training samples compared with reinforcement learning or supervised learning methods.
