Improving LLM-based Global Optimization with Search Space Partitioning
Andrej Schwanke, Lyubomir Ivanov, David Salinas, Fabio Ferreira, Aaron Klein, Frank Hutter, Arber Zela
TL;DR
This work tackles the challenge of optimizing expensive blackbox functions with minimal prior knowledge by integrating large language models into a partitioned, hierarchical search framework. HOLLM adaptively partitions the input space using a KD-tree, assigns a bandit-inspired score to each region that fuses exploitation, geometry, and uncertainty, and uses LLMs to generate localized candidate points within selected regions. The approach combines nonparametric bandit ideas with LLM-based sampling to overcome high-dimensional and multimodal landscapes, achieving performance on par with or better than leading Bayesian optimization and trust-region methods across synthetic tasks, hyperparameter tuning, and neural architecture search. The results indicate HOLLM’s potential to enhance LLM-driven optimization in practice, while the authors acknowledge limitations such as the lack of formal regret guarantees, dependency on LLM quality, and the computational costs of inference. This method offers a scalable path toward more reliable and efficient LLM-guided optimization in complex scientific and engineering domains.
Abstract
Large Language Models (LLMs) have recently emerged as effective surrogate models and candidate generators within global optimization frameworks for expensive blackbox functions. Despite promising results, LLM-based methods often struggle in high-dimensional search spaces or when lacking domain-specific priors, leading to sparse or uninformative suggestions. To overcome these limitations, we propose HOLLM, a novel global optimization algorithm that enhances LLM-driven sampling by partitioning the search space into promising subregions. Each subregion acts as a ``meta-arm'' selected via a bandit-inspired scoring mechanism that effectively balances exploration and exploitation. Within each selected subregion, an LLM then proposes high-quality candidate points, without any explicit domain knowledge. Empirical evaluation on standard optimization benchmarks shows that HOLLM consistently matches or surpasses leading Bayesian optimization and trust-region methods, while substantially outperforming global LLM-based sampling strategies.
