Lifelong Evolution of Swarms
Lorenzo Leuzzi, Simon Jones, Sabine Hauert, Davide Bacciu, Andrea Cossu
TL;DR
This work integrates lifelong learning with swarm evolution by introducing a lifelong evolutionary framework where a population of swarm controllers evolves under dynamically changing tasks. The key idea is that population diversity can retain knowledge of previous tasks and enable faster adaptation when tasks reappear, while the top individual may forget prior tasks unless regularization is applied. To address forgetting, the authors introduce a genetic distance regularization term, inspired by Elastic Weight Consolidation, that ties the current population to a reference model from the previous task. Empirical results on a NEAT-based swarm foraging task show rapid adaptation and partial retention at the population level, with model-specific regularization significantly reducing forgetting for the top individual, albeit with some task-performance trade-offs. The findings illuminate the potential of combining evolutionary search with lifelong learning principles to build robust, adaptive swarm systems in dynamic environments and open avenues for replay-based or archive-informed lifelong strategies.
Abstract
Adapting to task changes without forgetting previous knowledge is a key skill for intelligent systems, and a crucial aspect of lifelong learning. Swarm controllers, however, are typically designed for specific tasks, lacking the ability to retain knowledge across changing tasks. Lifelong learning, on the other hand, focuses on individual agents with limited insights into the emergent abilities of a collective like a swarm. To address this gap, we introduce a lifelong evolutionary framework for swarms, where a population of swarm controllers is evolved in a dynamic environment that incrementally presents novel tasks. This requires evolution to find controllers that quickly adapt to new tasks while retaining knowledge of previous ones, as they may reappear in the future. We discover that the population inherently preserves information about previous tasks, and it can reuse it to foster adaptation and mitigate forgetting. In contrast, the top-performing individual for a given task catastrophically forgets previous tasks. To mitigate this phenomenon, we design a regularization process for the evolutionary algorithm, reducing forgetting in top-performing individuals. Evolving swarms in a lifelong fashion raises fundamental questions on the current state of deep lifelong learning and on the robustness of swarm controllers in dynamic environments.
