Illuminating the Diversity-Fitness Trade-Off in Black-Box Optimization
Maria Laura Santoni, Elena Raponi, Aneta Neumann, Frank Neumann, Mike Preuss, Carola Doerr
TL;DR
This work reframes black-box optimization as selecting a batch of $k$ solutions with $d_{ ext{min}}$-level diversity while maximizing average fitness, rather than returning a single optimum. It investigates the utility of off-the-shelf trajectories (random, Sobol', CMA-ES, MMO methods) by extracting high-quality diverse batches via a greedy subset selection from a portfolio of $T$ evaluated points, benchmarking on the 24 BBOB functions across $D=2,5,10$. Key findings show that a naive uniform random baseline is surprisingly strong, that greedy extraction closely approximates optimal subsets, and that MMO and CMA-ES offer benefits mainly at small $d_{ ext{min}}$ while random/Sobol can outperform them as diversity constraints tighten; these results motivate developing algorithms that actively enforce diversity without sacrificing quality. The study provides a diagnostic framework for understanding the trade-off between diversity and fitness and highlights the need for diversity-aware optimization methods in real-world design problems where post-hoc qualitative criteria matter. Overall, the work underlines the practical value of generating diverse, high-quality solution batches and points to future directions for integrating diversity mechanisms into global optimizers.
Abstract
In real-world applications, users often favor structurally diverse design choices over one high-quality solution. It is hence important to consider more solutions that decision makers can compare and further explore based on additional criteria. Alongside the existing approaches of evolutionary diversity optimization, quality diversity, and multimodal optimization, this paper presents a fresh perspective on this challenge by considering the problem of identifying a fixed number of solutions with a pairwise distance above a specified threshold while maximizing their average quality. We obtain first insight into these objectives by performing a subset selection on the search trajectories of different well-established search heuristics, whether they have been specifically designed with diversity in mind or not. We emphasize that the main goal of our work is not to present a new algorithm but to understand the capability of off-the-shelf algorithms to quantify the trade-off between the minimum pairwise distance within batches of solutions and their average quality. We also analyze how this trade-off depends on the properties of the underlying optimization problem. A possibly surprising outcome of our empirical study is the observation that naive uniform random sampling establishes a very strong baseline for our problem, hardly ever outperformed by the search trajectories of the considered heuristics. We interpret these results as a motivation to develop algorithms tailored to produce diverse solutions of high average quality.
