Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design
Zhi Zheng, Zhuoliang Xie, Zhenkun Wang, Bryan Hooi
TL;DR
This paper tackles the problem of designing high-quality heuristics for complex optimization tasks when using LLM-based automatic heuristic design (AHD). It introduces MCTS-AHD, a Monte Carlo Tree Search-based framework that preserves all generated heuristics and uses a tree-structured exploration with progressive widening to better develop underperforming heuristics and avoid local optima. Through extensive experiments on NP-hard combinatorial optimization problems and cost-aware Bayesian optimization, MCTS-AHD demonstrates superior heuristic quality compared with handcrafted heuristics and prior LLM-based AHD methods, across multiple solving frameworks. The approach offers a robust, framework-agnostic method to expand the space of potential heuristics and has broad applicability beyond traditional CO problems.
Abstract
Handcrafting heuristics for solving complex optimization tasks (e.g., route planning and task allocation) is a common practice but requires extensive domain knowledge. Recently, Large Language Model (LLM)-based automatic heuristic design (AHD) methods have shown promise in generating high-quality heuristics without manual interventions. Existing LLM-based AHD methods employ a population to maintain a fixed number of top-performing LLM-generated heuristics and introduce evolutionary computation (EC) to iteratively enhance the population. However, these population-based procedures cannot fully develop the potential of each heuristic and are prone to converge into local optima. To more comprehensively explore the space of heuristics, this paper proposes to use Monte Carlo Tree Search (MCTS) for LLM-based heuristic evolution. The proposed MCTS-AHD method organizes all LLM-generated heuristics in a tree structure and can better develop the potential of temporarily underperforming heuristics. In experiments, MCTS-AHD delivers significantly higher-quality heuristics on various complex tasks. Our code is available.
