Autonomous Multi-Objective Optimization Using Large Language Model
Yuxiao Huang, Shenghao Wu, Wenjie Zhang, Jibin Wu, Liang Feng, Kay Chen Tan
TL;DR
This work tackles the challenge of designing effective evolutionary operators for multi-objective optimization (MOPs) without expert intervention. It introduces LLMOPT, a framework that autonomously generates, tests, and refines EA operators via Operator Initialization, Dynamic Operator Selection, Operator Crossover, Operator Mutation, Pilot Run & Repair, and Parallel Operator Evaluation. The approach is evaluated on continuous and combinatorial MOPs—CMOPs, MOKPs, and MOTSPs—using multiple baselines and several LLMs, showing competitive or superior performance and highlighting robustness through error-driven dialogue. The results indicate that automated, textually guided operator design can accelerate the development of adaptive optimization strategies, with implications for scalable, autonomous optimization toolchains.
Abstract
Multi-objective optimization problems (MOPs) are ubiquitous in real-world applications, presenting a complex challenge of balancing multiple conflicting objectives. Traditional evolutionary algorithms (EAs), though effective, often rely on domain-specific expertise and iterative fine-tuning, hindering adaptability to unseen MOPs. In recent years, the advent of Large Language Models (LLMs) has revolutionized software engineering by enabling the autonomous generation and refinement of programs. Leveraging this breakthrough, we propose a new LLM-based framework that autonomously designs EA operators for solving MOPs. The proposed framework includes a robust testing module to refine the generated EA operator through error-driven dialogue with LLMs, a dynamic selection strategy along with informative prompting-based crossover and mutation to fit textual optimization pipeline. Our approach facilitates the design of EA operators without the extensive demands for expert intervention, thereby speeding up the innovation of EA operators. Empirical studies across various MOP categories validate the robustness and superior performance of our proposed framework.
