Evolve Cost-aware Acquisition Functions Using Large Language Models
Yiming Yao, Fei Liu, Ji Cheng, Qingfu Zhang
TL;DR
This work tackles expensive optimization with heterogeneous evaluation costs by introducing EvolCAF, an automated framework that designs cost-aware acquisition functions (AFs) via large language models (LLMs) within an evolutionary computation loop. By evolving explicit AFs in the algorithmic space, EvolCAF yields interpretable, high-performing strategies that leverage historical data, GP surrogates, and budget information, outperforming hand-crafted AFs like EIpu and EI-cool on 12 synthetic problems and 3 real-world hyperparameter-tuning tasks. The approach combines prompt-driven initialization, crossover, and mutation to generate elite AFs whose best instance comprises three components: a modified EI term with historical-uncertainty awareness, a budget-aware term, and a distance-based exploration modifier, all expressed explicitly. The results demonstrate strong generalization and efficiency, suggesting broad applicability to cost-aware BO settings and potential extensions to high-dimensional, batch, and multi-objective scenarios, with practical impact in domains requiring budget-conscious optimization.
Abstract
Many real-world optimization scenarios involve expensive evaluation with unknown and heterogeneous costs. Cost-aware Bayesian optimization stands out as a prominent solution in addressing these challenges. To approach the global optimum within a limited budget in a cost-efficient manner, the design of cost-aware acquisition functions (AFs) becomes a crucial step. However, traditional manual design paradigm typically requires extensive domain knowledge and involves a labor-intensive trial-and-error process. This paper introduces EvolCAF, a novel framework that integrates large language models (LLMs) with evolutionary computation (EC) to automatically design cost-aware AFs. Leveraging the crossover and mutation in the algorithmic space, EvolCAF offers a novel design paradigm, significantly reduces the reliance on domain expertise and model training. The designed cost-aware AF maximizes the utilization of available information from historical data, surrogate models and budget details. It introduces novel ideas not previously explored in the existing literature on acquisition function design, allowing for clear interpretations to provide insights into its behavior and decision-making process. In comparison to the well-known EIpu and EI-cool methods designed by human experts, our approach showcases remarkable efficiency and generalization across various tasks, including 12 synthetic problems and 3 real-world hyperparameter tuning test sets.
