Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

Guillem Rodríguez-Corominas; Maria J. Blesa; Christian Blum

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

Guillem Rodríguez-Corominas, Maria J. Blesa, Christian Blum

TL;DR

A hybrid approach, Construct, Merge, Solve&Adapt with Reinforcement Learning (RL-CMSA), for the symmetric single-depot min-max mTSP that consistently finds (near-)best solutions and outperforms a state-of-the-art hybrid genetic algorithm under comparable time limits, especially as instance size and the number of salesmen increase.

Abstract

The Multiple Traveling Salesman Problem (mTSP) extends the Traveling Salesman Problem to m tours that start and end at a common depot and jointly visit all customers exactly once. In the min-max variant, the objective is to minimize the longest tour, reflecting workload balance. We propose a hybrid approach, Construct, Merge, Solve & Adapt with Reinforcement Learning (RL-CMSA), for the symmetric single-depot min-max mTSP. The method iteratively constructs diverse solutions using probabilistic clustering guided by learned pairwise q-values, merges routes into a compact pool, solves a restricted set-covering MILP, and refines solutions via inter-route remove, shift, and swap moves. The q-values are updated by reinforcing city-pair co-occurrences in high-quality solutions, while the pool is adapted through ageing and pruning. This combination of exact optimization and reinforcement-guided construction balances exploration and exploitation. Computational results on random and TSPLIB instances show that RL-CMSA consistently finds (near-)best solutions and outperforms a state-of-the-art hybrid genetic algorithm under comparable time limits, especially as instance size and the number of salesmen increase.

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

TL;DR

Abstract

Paper Structure (17 sections, 10 equations, 3 figures, 4 tables, 1 algorithm)

This paper contains 17 sections, 10 equations, 3 figures, 4 tables, 1 algorithm.

Introduction
Related Work
The Proposed Algorithm
Construct
Merge
Solve
Improve
Remove
Shift: cross-route relocation (1-move)
Swap: cross-route exchange (1--1 swap)
Learn
Adapt
Experimental Evaluation
Benchmark Instances
Algorithm Tuning
...and 2 more sections

Figures (3)

Figure 1: RL-CMSA: Schematic Algorithm Overview
Figure 2: STN graphic regarding instance 17 with $n=200$ and $m=10\%$.
Figure 3: Structural distances between the 40 best-found solutions per run (and per algorithm) for the 20 problem instances with $n=200$ and $m=10\%$.

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

TL;DR

Abstract

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

Authors

TL;DR

Abstract

Table of Contents

Figures (3)