Multi-Objective Covariance Matrix Adaptation MAP-Annealing
Shihan Zhao, Stefanos Nikolaidis
TL;DR
MO-CMA-MAE introduces a CMA-ES-based MOQD algorithm that optimizes Hypervolume Improvement within thresholded cell fronts to jointly promote exploration of under-explored measure-space cells and improvement of discovered Pareto Sets. By coupling CMA-ES with a threshold-annealing mechanism, the method maintains a dynamic trade-off between exploring new cells and refining existing ones, achieving superior MOQD-scores and broader archive coverage in several domains, including Overcooked map generation. The approach outperforms baselines such as MOME, NSGA-II, SMS-EMOA, and COMO-CMA-ES on multiple tasks and demonstrates strong performance even in 3-objective variants, while highlighting the computational cost of exact hypervolume calculations as a limitation and a direction for future work.
Abstract
Quality-Diversity (QD) optimization is an emerging field that focuses on finding a set of behaviorally diverse and high-quality solutions. While the quality is typically defined w.r.t. a single objective function, recent work on Multi-Objective Quality-Diversity (MOQD) extends QD optimization to simultaneously optimize multiple objective functions. This opens up multi-objective applications for QD, such as generating a diverse set of game maps that maximize difficulty, realism, or other properties. Existing MOQD algorithms use non-adaptive methods such as mutation and crossover to search for non-dominated solutions and construct an archive of Pareto Sets (PS). However, recent work in QD has demonstrated enhanced performance through the use of covariance-based evolution strategies for adaptive solution search. We propose bringing this insight into the MOQD problem, and introduce MO-CMA-MAE, a new MOQD algorithm that leverages Covariance Matrix Adaptation-Evolution Strategies (CMA-ES) to optimize the hypervolume associated with every PS within the archive. We test MO-CMA-MAE on three MOQD domains, and for generating maps of a co-operative video game, showing significant improvements in performance.
