Soft Quality-Diversity Optimization
Saeed Hedayatian, Stefanos Nikolaidis
TL;DR
The paper reframes Quality-Diversity (QD) by introducing Soft QD, a discretization-free objective that uses a Gaussian illumination model over the continuous behavior space. It derives SQUAD, a differentiable algorithm that optimizes a lower bound of the Soft QD Score, balancing high quality with spread in behavior space through pairwise repulsion terms. The approach is analyzed theoretically (monotonicity, submodularity, and limiting connections to traditional QD) and validated empirically across high-dimensional benchmarks, showing strong scalability and competitive performance against state-of-the-art baselines. It also demonstrates a practical mechanism to control the quality-diversity trade-off via a kernel bandwidth and discusses efficiency techniques (batches, nearest neighbors) and bounded-space handling via logit transforms. Overall, Soft QD broadens the applicability of QD to differentiable, high-dimensional domains and provides a scalable, tuneable framework for producing diverse, high-quality solution sets.
Abstract
Quality-Diversity (QD) algorithms constitute a branch of optimization that is concerned with discovering a diverse and high-quality set of solutions to an optimization problem. Current QD methods commonly maintain diversity by dividing the behavior space into discrete regions, ensuring that solutions are distributed across different parts of the space. The QD problem is then solved by searching for the best solution in each region. This approach to QD optimization poses challenges in large solution spaces, where storing many solutions is impractical, and in high-dimensional behavior spaces, where discretization becomes ineffective due to the curse of dimensionality. We present an alternative framing of the QD problem, called \emph{Soft QD}, that sidesteps the need for discretizations. We validate this formulation by demonstrating its desirable properties, such as monotonicity, and by relating its limiting behavior to the widely used QD Score metric. Furthermore, we leverage it to derive a novel differentiable QD algorithm, \emph{Soft QD Using Approximated Diversity (SQUAD)}, and demonstrate empirically that it is competitive with current state of the art methods on standard benchmarks while offering better scalability to higher dimensional problems.
