Faster Optimization Through Genetic Drift
Cella Florescu, Marc Kaufmann, Johannes Lengler, Ulysse Schaller
TL;DR
This paper analyzes the compact Genetic Algorithm (cGA) with hypothetical population size $K$ on Dynamic BinVal (DynBV), a hard dynamic linear benchmark, contrasting its performance with OneMax. It establishes a clear trade-off: the conservative regime (large step sizes) suffers a substantial slowdown on DynBV, yielding a lower bound of $oxed{ ext{Ω}(K ext{min}\{K,n ight floor)}$ iterations and, for $K= ext{ω}(n)$, a runtime of $oxed{ ext{ω}(n^2)}$, while an aggressive regime (small $K$) preserves a quasi-linear runtime, with $oxed{Oig(n ext{polylog}(n)ig)}$ on DynBV and a conjectured $oxed{O(n ext{log} n)}$ under standard boundaries. The analysis employs drift theorems, Chernoff bounds, and hypergeometric tails to characterize signal steps, sampling variance, and frequency dynamics, and is complemented by simulations that confirm the phase transition and practical speedups for small $K$. The results highlight that embracing genetic drift can dramatically accelerate optimization on hard dynamic functions, informing parameter choices for EDAs in drift-prone settings. Overall, the paper provides tight lower bounds for the conservative regime, a conservative upper bound with polynomial $K$, and nontrivial upper bounds for the aggressive regime, supported by empirical validation.
Abstract
The compact Genetic Algorithm (cGA), parameterized by its hypothetical population size $K$, offers a low-memory alternative to evolving a large offspring population of solutions. It evolves a probability distribution, biasing it towards promising samples. For the classical benchmark OneMax, the cGA has to two different modes of operation: a conservative one with small step sizes $Θ(1/(\sqrt{n}\log n))$, which is slow but prevents genetic drift, and an aggressive one with large step sizes $Θ(1/\log n)$, in which genetic drift leads to wrong decisions, but those are corrected efficiently. On OneMax, an easy hill-climbing problem, both modes lead to optimization times of $Θ(n\log n)$ and are thus equally efficient. In this paper we study how both regimes change when we replace OneMax by the harder hill-climbing problem DynamicBinVal. It turns out that the aggressive mode is not affected and still yields quasi-linear runtime $O(n\cdot polylog (n))$. However, the conservative mode becomes substantially slower, yielding a runtime of $Ω(n^2)$, since genetic drift can only be avoided with smaller step sizes of $O(1/n)$. We complement our theoretical results with simulations.
