LEGATO: Cross-Embodiment Imitation Using a Grasping Tool

Mingyo Seo; H. Andy Park; Shenli Yuan; Yuke Zhu; Luis Sentis

LEGATO: Cross-Embodiment Imitation Using a Grasping Tool

Mingyo Seo, H. Andy Park, Shenli Yuan, Yuke Zhu, Luis Sentis

TL;DR

LEGATO addresses the challenge of transferring visuomotor skills across robots with different morphologies by introducing a handheld gripper that unifies action and observation spaces for demonstrations and deployment. A high-level visuomotor policy outputs gripper trajectories in SE(3), which are then retargeted to diverse robots through an IK-based quadratic program with eSNS optimization, underpinned by a motion-invariant regularization via the Denavit-Hartenberg Bidirectional transform. The core technical contributions are the two-tier policy design, the motion-invariant loss that reduces embodiment bias, and the LEGATO Gripper design enabling hardware-agnostic demonstrations. Experimental results in simulation and on real robots demonstrate improved cross-embodiment transfer and practical viability for scalable imitation learning across heterogeneous robotic platforms.

Abstract

Cross-embodiment imitation learning enables policies trained on specific embodiments to transfer across different robots, unlocking the potential for large-scale imitation learning that is both cost-effective and highly reusable. This paper presents LEGATO, a cross-embodiment imitation learning framework for visuomotor skill transfer across varied kinematic morphologies. We introduce a handheld gripper that unifies action and observation spaces, allowing tasks to be defined consistently across robots. We train visuomotor policies on task demonstrations using this gripper through imitation learning, applying transformation to a motion-invariant space for computing the training loss. Gripper motions generated by the policies are retargeted into high-degree-of-freedom whole-body motions using inverse kinematics for deployment across diverse embodiments. Our evaluations in simulation and real-robot experiments highlight the framework's effectiveness in learning and transferring visuomotor skills across various robots. More information can be found on the project page: https://ut-hcrl.github.io/LEGATO.

LEGATO: Cross-Embodiment Imitation Using a Grasping Tool

TL;DR

Abstract

LEGATO: Cross-Embodiment Imitation Using a Grasping Tool

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)