Generalizable Heuristic Generation Through Large Language Models with Meta-Optimization
Yiding Shi, Jianan Zhou, Wen Song, Jieyi Bi, Yaoxin Wu, Jie Zhang
TL;DR
This paper tackles generalization and exploration limits in LLM-driven heuristic design for combinatorial optimization by introducing MoH, a two-level meta-optimization framework. An outer loop uses an LLM-based meta-optimizer to construct diverse optimizers, while an inner loop employs these optimizers to evolve task-specific heuristics across multiple COP tasks within a multi-task setting. Empirical results on TSP and online BPP show that MoH-minted meta-optimizers yield state-of-the-art performance and strong cross-size generalization, often discovering interpretable optimization strategies (EC, ACO, PSO, SA, Tabu) beyond fixed workflows. The work demonstrates the potential of LLM-driven optimizer design to broaden the search space and reduce reliance on hand-crafted EC pipelines, albeit with higher computational cost and scope for efficiency improvements.
Abstract
Heuristic design with large language models (LLMs) has emerged as a promising approach for tackling combinatorial optimization problems (COPs). However, existing approaches often rely on manually predefined evolutionary computation (EC) optimizers and single-task training schemes, which may constrain the exploration of diverse heuristic algorithms and hinder the generalization of the resulting heuristics. To address these issues, we propose Meta-Optimization of Heuristics (MoH), a novel framework that operates at the optimizer level, discovering effective optimizers through the principle of meta-learning. Specifically, MoH leverages LLMs to iteratively refine a meta-optimizer that autonomously constructs diverse optimizers through (self-)invocation, thereby eliminating the reliance on a predefined EC optimizer. These constructed optimizers subsequently evolve heuristics for downstream tasks, enabling broader heuristic exploration. Moreover, MoH employs a multi-task training scheme to promote its generalization capability. Experiments on classic COPs demonstrate that MoH constructs an effective and interpretable meta-optimizer, achieving state-of-the-art performance across various downstream tasks, particularly in cross-size settings.
