Improved Runtime Analysis of a Multi-Valued Compact Genetic Algorithm on Two Generalized OneMax Problems
Sumit Adak, Carsten Witt
TL;DR
This work extends runtime analysis of univariate EDAs to multi-valued settings by addressing the $r$-cGA on generalized OneMax problems. It proves a first theoretical bound of $O(n r^3 \log^2 n \log r)$ for G-OneMax and improves the $r$-OneMax bound to $O(n r \log n \log r)$, while incorporating frequency borders into the analysis. The authors develop a negative-drift framework to control genetic drift and employ additive drift with tail bounds across frequency-phase transitions, both with and without borders, and validate findings with experimentation. These results broaden the understanding of EDAs on multi-valued spaces and suggest potential for further runtime improvements in complex problem settings.
Abstract
Recent research in the runtime analysis of estimation of distribution algorithms (EDAs) has focused on univariate EDAs for multi-valued decision variables. In particular, the runtime of the multi-valued cGA (r-cGA) and UMDA on multi-valued functions has been a significant area of study. Adak and Witt (PPSN 2024) and Hamano et al. (ECJ 2024) independently performed a first runtime analysis of the r-cGA on the r-valued OneMax function (r-OneMax). Adak and Witt also introduced a different r-valued OneMax function called G-OneMax. However, for that function, only empirical results were provided so far due to the increased complexity of its runtime analysis, since r-OneMax involves categorical values of two types only, while G-OneMax encompasses all possible values. In this paper, we present the first theoretical runtime analysis of the r-cGA on the G-OneMax function. We demonstrate that the runtime is O(nr^3 log^2 n log r) with high probability. Additionally, we refine the previously established runtime analysis of the r-cGA on r-OneMax, improving the previous bound to O(nr log n log r), which improves the state of the art by an asymptotic factor of log n and is tight for the binary case. Moreover, we for the first time include the case of frequency borders.
