DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning

Zhenyu Jiang; Yuqi Xie; Kevin Lin; Zhenjia Xu; Weikang Wan; Ajay Mandlekar; Linxi Fan; Yuke Zhu

DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning

Zhenyu Jiang, Yuqi Xie, Kevin Lin, Zhenjia Xu, Weikang Wan, Ajay Mandlekar, Linxi Fan, Yuke Zhu

TL;DR

DexMimicGen addresses the data bottleneck in training policies for bimanual dexterous robots by transforming a handful of human demonstrations into thousands of simulation trajectories. It introduces asynchronous per-arm execution, coordination synchronization, and sequential ordering to synthesize realistic multi-arm trajectories from limited data. Nine simulation environments across three embodiments demonstrate versatility, generating 21K demos from 60 source demonstrations and enabling a real2sim2real pipeline that achieves high success in a can-sorting task. The results, including comparisons to baselines and real-world tests with a digital twin, highlight the practical viability and potential for advancing data-efficient dexterous manipulation research.

Abstract

Imitation learning from human demonstrations is an effective means to teach robots manipulation skills. But data acquisition is a major bottleneck in applying this paradigm more broadly, due to the amount of cost and human effort involved. There has been significant interest in imitation learning for bimanual dexterous robots, like humanoids. Unfortunately, data collection is even more challenging here due to the challenges of simultaneously controlling multiple arms and multi-fingered hands. Automated data generation in simulation is a compelling, scalable alternative to fuel this need for data. To this end, we introduce DexMimicGen, a large-scale automated data generation system that synthesizes trajectories from a handful of human demonstrations for humanoid robots with dexterous hands. We present a collection of simulation environments in the setting of bimanual dexterous manipulation, spanning a range of manipulation behaviors and different requirements for coordination among the two arms. We generate 21K demos across these tasks from just 60 source human demos and study the effect of several data generation and policy learning decisions on agent performance. Finally, we present a real-to-sim-to-real pipeline and deploy it on a real-world humanoid can sorting task. Generated datasets, simulation environments and additional results are at https://dexmimicgen.github.io/

DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning

TL;DR

Abstract

DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)