Trackable Agent-based Evolution Models at Wafer Scale
Matthew Andres Moreno, Connor Yang, Emily Dolson, Luis Zaman
TL;DR
This work tackles scaling agent-based evolution to wafer-scale hardware while preserving phylogenetic observability. It introduces an asynchronous island-model GA tailored for the Cerebras Wafer-Scale Engine and a surface-based hereditary stratigraphy framework for post hoc phylogeny tracking, implemented in Cerebras Software Language. Benchmarking shows significant runtime improvements and detectable phylometric signals under adaptive dynamics, with emulated and on-hardware experiments validating end-state genome reconstructions. The results suggest the potential to reach quadrillions of replications per day at wafer scale, enabling new inquiries in evolutionary biology and artificial life on emerging HPC platforms.
Abstract
Continuing improvements in computing hardware are poised to transform capabilities for in silico modeling of cross-scale phenomena underlying major open questions in evolutionary biology and artificial life, such as transitions in individuality, eco-evolutionary dynamics, and rare evolutionary events. Emerging ML/AI-oriented hardware accelerators, like the 850,000 processor Cerebras Wafer Scale Engine (WSE), hold particular promise. However, practical challenges remain in conducting informative evolution experiments that efficiently utilize these platforms' large processor counts. Here, we focus on the problem of extracting phylogenetic information from agent-based evolution on the WSE platform. This goal drove significant refinements to decentralized in silico phylogenetic tracking, reported here. These improvements yield order-of-magnitude performance improvements. We also present an asynchronous island-based genetic algorithm (GA) framework for WSE hardware. Emulated and on-hardware GA benchmarks with a simple tracking-enabled agent model clock upwards of 1 million generations a minute for population sizes reaching 16 million agents. We validate phylogenetic reconstructions from these trials and demonstrate their suitability for inference of underlying evolutionary conditions. In particular, we demonstrate extraction, from wafer-scale simulation, of clear phylometric signals that differentiate runs with adaptive dynamics enabled versus disabled. Together, these benchmark and validation trials reflect strong potential for highly scalable agent-based evolution simulation that is both efficient and observable. Developed capabilities will bring entirely new classes of previously intractable research questions within reach, benefiting further explorations within the evolutionary biology and artificial life communities across a variety of emerging high-performance computing platforms.
