More complex environments may be required to discover benefits of lifetime learning in evolving robots
Ege de Bruin, Kyrre Glette, Kai Olav Ellefsen
TL;DR
The paper investigates whether intra-life learning enhances evolving robot locomotion by adding a Bayesian Optimization-based control-learning phase to morphology evolution and comparing performance in flat versus hills environments. Using a Revolve2 modular framework and MuJoCo simulation, it tests learning budgets of 1, 30, and 50 iterations to assess how control optimization interacts with environmental complexity. The key finding is that lifetime learning yields more pronounced gains in the hills environment, with significant performance differences observed after substantial function evaluations, underscoring the importance of environmental difficulty for observing lifelong learning benefits. The work suggests that environment design is crucial for evaluating learning-enabled evolution and points to future directions including no-learning baselines and Lamarckian extensions to control.
Abstract
It is well known that intra-life learning, defined as an additional controller optimization loop, is beneficial for evolving robot morphologies for locomotion. In this work, we investigate this further by comparing it in two different environments: an easy flat environment and a more challenging hills environment. We show that learning is significantly more beneficial in a hilly environment than in a flat environment and that it might be needed to evaluate robots in a more challenging environment to see the benefits of learning.
