Knowledge Diversion for Efficient Morphology Control and Policy Transfer

Fu Feng; Ruixiao Shi; Yucheng Xie; Jianlu Shen; Jing Wang; Xin Geng

Knowledge Diversion for Efficient Morphology Control and Policy Transfer

Fu Feng, Ruixiao Shi, Yucheng Xie, Jianlu Shen, Jing Wang, Xin Geng

TL;DR

This work tackles the difficulty of transferring policies across diverse agent morphologies and tasks by introducing DivMorph, a modular learning framework that decomposes Transformer weights using SVD into shared learngenes and morphology-/task-specific tailors. It advances policy reuse and efficiency through dynamic soft gating based on morphology and task embeddings, enabling zero-shot generalization and substantial deployment savings. Empirical results on the UNIMAL benchmark show DivMorph delivers state-of-the-art cross-task transfer with roughly 3× faster sample efficiency and up to 17× model-size reduction for deployment, while maintaining or surpassing performance on training morphologies. Overall, DivMorph demonstrates that explicit knowledge disentanglement and modular routing can yield scalable, adaptable universal morphology control suitable for real-world robotic systems.

Abstract

Universal morphology control aims to learn a universal policy that generalizes across heterogeneous agent morphologies, with Transformer-based controllers emerging as a popular choice. However, such architectures incur substantial computational costs, resulting in high deployment overhead, and existing methods exhibit limited cross-task generalization, necessitating training from scratch for each new task. To this end, we propose \textbf{DivMorph}, a modular training paradigm that leverages knowledge diversion to learn decomposable controllers. DivMorph factorizes randomly initialized Transformer weights into factor units via SVD prior to training and employs dynamic soft gating to modulate these units based on task and morphology embeddings, separating them into shared \textit{learngenes} and morphology- and task-specific \textit{tailors}, thereby achieving knowledge disentanglement. By selectively activating relevant components, DivMorph enables scalable and efficient policy deployment while supporting effective policy transfer to novel tasks. Extensive experiments demonstrate that DivMorph achieves state-of-the-art performance, achieving a 3$\times$ improvement in sample efficiency over direct finetuning for cross-task transfer and a 17$\times$ reduction in model size for single-agent deployment.

Knowledge Diversion for Efficient Morphology Control and Policy Transfer

TL;DR

Abstract

Knowledge Diversion for Efficient Morphology Control and Policy Transfer

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)