HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges
Xianliang Yang, Ling Zhang, Haolong Qian, Lei Song, Jiang Bian
TL;DR
HeurAgenix addresses the adaptability gap in combinatorial optimization by introducing a two-stage, LLM-driven framework that first evolves a diverse pool of heuristics and then adaptively selects among them at test time. The evolution phase uses contrastive analysis and an LLM to extract reusable improvement strategies, while the problem-solving phase employs a lightweight, fine-tunable selector with a dual-reward scheme (POR and CPR) and test-time scaling to robustly pick heuristics under noisy supervision. Empirical results across five canonical CO benchmarks show that HeurAgenix outperforms existing LLM-based hyper-heuristics and matches or exceeds specialized solvers, with a GitHub implementation provided. The work contributes a fully end-to-end, data-driven workflow for autonomous heuristic design and adaptive selection, highlighting practical impact for scalable CO solving in diverse domains.
Abstract
Heuristic algorithms play a vital role in solving combinatorial optimization (CO) problems, yet traditional designs depend heavily on manual expertise and struggle to generalize across diverse instances. We introduce \textbf{HeurAgenix}, a two-stage hyper-heuristic framework powered by large language models (LLMs) that first evolves heuristics and then selects among them automatically. In the heuristic evolution phase, HeurAgenix leverages an LLM to compare seed heuristic solutions with higher-quality solutions and extract reusable evolution strategies. During problem solving, it dynamically picks the most promising heuristic for each problem state, guided by the LLM's perception ability. For flexibility, this selector can be either a state-of-the-art LLM or a fine-tuned lightweight model with lower inference cost. To mitigate the scarcity of reliable supervision caused by CO complexity, we fine-tune the lightweight heuristic selector with a dual-reward mechanism that jointly exploits singals from selection preferences and state perception, enabling robust selection under noisy annotations. Extensive experiments on canonical benchmarks show that HeurAgenix not only outperforms existing LLM-based hyper-heuristics but also matches or exceeds specialized solvers. Code is available at https://github.com/microsoft/HeurAgenix.
