MeMo: Meaningful, Modular Controllers via Noise Injection
Megan Tjandrasuwita, Jie Xu, Armando Solar-Lezama, Wojciech Matusik
TL;DR
MeMo proposes a modular control framework that learns assembly-specific modules coordinated by a boss controller, trained with a novel modularity objective and noise-injection to encourage robust, low-dimensional coordination. By pretraining modules on a single robot and reusing them for morphologies with the same assemblies, MeMo reduces training time for structure and task transfer, outperforming or matching strong baselines like NerveNet and MetaMorph in various locomotion and grasping scenarios. The approach combines behavior cloning with an invariance-to-noise objective, and the authors provide ablations showing the critical role of noise injection in enabling positive transfer. The work highlights interpretable, assembly-aligned control representations and suggests paths toward broader transfer, multi-robot settings, and real-world deployment, while noting sim-to-real and methodological limitations as areas for future work.
Abstract
Robots are often built from standardized assemblies, (e.g. arms, legs, or fingers), but each robot must be trained from scratch to control all the actuators of all the parts together. In this paper we demonstrate a new approach that takes a single robot and its controller as input and produces a set of modular controllers for each of these assemblies such that when a new robot is built from the same parts, its control can be quickly learned by reusing the modular controllers. We achieve this with a framework called MeMo which learns (Me)aningful, (Mo)dular controllers. Specifically, we propose a novel modularity objective to learn an appropriate division of labor among the modules. We demonstrate that this objective can be optimized simultaneously with standard behavior cloning loss via noise injection. We benchmark our framework in locomotion and grasping environments on simple to complex robot morphology transfer. We also show that the modules help in task transfer. On both structure and task transfer, MeMo achieves improved training efficiency to graph neural network and Transformer baselines.
