Meta-Evolve: Continuous Robot Evolution for One-to-many Policy Transfer
Xingyu Liu, Deepak Pathak, Ding Zhao
TL;DR
This paper tackles scalable policy transfer from a single source robot to multiple targets by proposing Meta-Evolve, which uses continuous robot evolution organized as an evolution tree with meta robots. By matching morphologies and interpolating parameters, it constructs shared training pathways and leverages a $p$-Steiner-tree framework to minimize total transfer cost in the evolution space. Across Hand Manipulation Suite and agile locomotion tasks, Meta-Evolve achieves significant reductions in both training and simulation budget compared to independent transfers, illustrating improved scalability when transferring policies to related robotic morphologies. The approach offers a principled, geometry-inspired method for cross-robot imitation learning with practical implications for deploying learned policies on a family of robots.
Abstract
We investigate the problem of transferring an expert policy from a source robot to multiple different robots. To solve this problem, we propose a method named $Meta$-$Evolve$ that uses continuous robot evolution to efficiently transfer the policy to each target robot through a set of tree-structured evolutionary robot sequences. The robot evolution tree allows the robot evolution paths to be shared, so our approach can significantly outperform naive one-to-one policy transfer. We present a heuristic approach to determine an optimized robot evolution tree. Experiments have shown that our method is able to improve the efficiency of one-to-three transfer of manipulation policy by up to 3.2$\times$ and one-to-six transfer of agile locomotion policy by 2.4$\times$ in terms of simulation cost over the baseline of launching multiple independent one-to-one policy transfers.
