Accelerated co-design of robots through morphological pretraining
Luke Strgar, Sam Kriegman
TL;DR
This work tackles the long-standing challenge of co-designing robot morphology and control by introducing large-scale morphological pretraining to learn a universal, morphology-agnostic controller via differentiable simulation. The pretrained controller enables zero-shot exploration of non-differentiable body changes and highlights a diversity-collapse issue when evolving morphology with a fixed controller. For mitigation, few-shot evolution with generational finetuning preserves and enhances diversity while achieving high performance, whereas simultaneous co-design from scratch without pretraining suffers from rapid diversity loss and slower progress. The approach dramatically accelerates co-design across massive morphospaces and uncovers important considerations for crossover, diversity maintenance, and potential pathways toward self-reconfigurable robots, with implications for real-world transfer and multi-task expansion.
Abstract
The co-design of robot morphology and neural control typically requires using reinforcement learning to approximate a unique control policy gradient for each body plan, demanding massive amounts of training data to measure the performance of each design. Here we show that a universal, morphology-agnostic controller can be rapidly and directly obtained by gradient-based optimization through differentiable simulation. This process of morphological pretraining allows the designer to explore non-differentiable changes to a robot's physical layout (e.g. adding, removing and recombining discrete body parts) and immediately determine which revisions are beneficial and which are deleterious using the pretrained model. We term this process "zero-shot evolution" and compare it with the simultaneous co-optimization of a universal controller alongside an evolving design population. We find the latter results in diversity collapse, a previously unknown pathology whereby the population -- and thus the controller's training data -- converges to similar designs that are easier to steer with a shared universal controller. We show that zero-shot evolution with a pretrained controller quickly yields a diversity of highly performant designs, and by fine-tuning the pretrained controller on the current population throughout evolution, diversity is not only preserved but significantly increased as superior performance is achieved.
