Complexity and accessibility of random landscapes
Sakshi Pahujani, Joachim Krug
TL;DR
This work analyzes high-dimensional fitness landscapes by contrasting uncorrelated (HoC) models with structured landscapes, deriving exact expressions for peak statistics and accessibility thresholds. It shows that in HoC, the expected number of peaks grows as $\mathbb{E}(N_L)=\dfrac{a^L}{(a-1)L+1}$ while the probability a random genotype is a peak vanishes as $L$ increases, revealing a tension between ruggedness and accessibility. The study then connects epistasis to submodularity via UNE, demonstrating how submodular landscapes guarantee a subset–superset accessibility property (AP) and yield large adaptive basins, with implications for evolutionary dynamics and optimization in complex systems. The results illuminate when accessible paths emerge (direct or indirect) and reveal that structured landscapes can organize accessible paths into expansive basins, offering insights relevant to biology and related fields, including statistical physics and machine learning.
Abstract
These notes introduce probabilistic landscape models defined on high-dimensional discrete sequence spaces. The models are motivated primarily by fitness landscapes in evolutionary biology, but links to statistical physics and computer science are mentioned where appropriate. Elementary and advanced results on the structure of landscapes are described with a focus on features that are relevant to evolutionary searches, such as the number of local maxima and the existence of fitness-monotonic paths. The recent discovery of submodularity as a biologically meaningful property of fitness landscapes and its consequences for their accessibility is discussed in detail.
