Abnormal Mutations: Evolution Strategies Don't Require Gaussianity
Jacob de Nobel, Diederick Vermetten, Hao Wang, Anna V. Kononova, Günter Rudolph, Thomas Bäck
TL;DR
Problem: Gaussian mutation has long been presumed essential for Evolution Strategies (ES). Approach: the paper empirically benchmarks multiple ES variants ((1+1)-ES, $\mu/\mu$, $\lambda$-SA-ES, CMA-ES) with alternative mutation distributions on the sphere model and 24 BBOB problems, and analyzes effective step length $||\boldsymbol{z}||_2$ and angle isotropy $\theta$; path-length normalization is adapted for non-Gaussian cases. Findings: performance is largely distribution-agnostic; local convergence slows with the Cauchy distribution in non-elitist setups but can aid in multimodal/neutral landscapes; CMA-ES and elitist variants remain robust under non-Gaussian mutations; isotropy differences largely dissipate when step sizes are matched. Significance: Gaussianity is not a strict prerequisite for ES, enabling flexible mutation design, potential efficiency gains, and avenues for dynamic switching or constraint-aware mutation strategies in practical optimization.
Abstract
The mutation process in evolution strategies has been interlinked with the normal distribution since its inception. Many lines of reasoning have been given for this strong dependency, ranging from maximum entropy arguments to the need for isotropy. However, some theoretical results suggest that other distributions might lead to similar local convergence properties. This paper empirically shows that a wide range of evolutionary strategies, from the (1+1)-ES to CMA-ES, show comparable optimization performance when using a mutation distribution other than the standard Gaussian. Replacing it with, e.g., uniformly distributed mutations, does not deteriorate the performance of ES, when using the default adaptation mechanism for the strategy parameters. We observe that these results hold not only for the sphere model but also for a wider range of benchmark problems.
