GenDexHand: Generative Simulation for Dexterous Hands
Feng Chen, Zhuxiu Xu, Tianzhe Chu, Xunzhe Zhou, Li Sun, Zewen Wu, Shenghua Gao, Zhongyu Li, Yanchao Yang, Yi Ma
TL;DR
GenDexHand tackles data scarcity in dexterous hand manipulation by presenting a fully automated generative simulation pipeline that creates diverse tasks and environments in simulation. It combines task proposal by foundation models, multimodal model refinement, and hierarchical trajectory generation to produce high-quality dexterous hand data, achieving a substantial average improvement in task success via a motion-planning–RL hybrid and subtask decomposition. The approach yields richer task diversity and scalable data generation, addressing a key bottleneck in dexterous embodied intelligence. This work empowers scalable, simulation-based training for dexterous hands and lays groundwork for broader adoption of generative models in complex robotic manipulation.
Abstract
Data scarcity remains a fundamental bottleneck for embodied intelligence. Existing approaches use large language models (LLMs) to automate gripper-based simulation generation, but they transfer poorly to dexterous manipulation, which demands more specialized environment design. Meanwhile, dexterous manipulation tasks are inherently more difficult due to their higher degrees of freedom. Massively generating feasible and trainable dexterous hand tasks remains an open challenge. To this end, we present GenDexHand, a generative simulation pipeline that autonomously produces diverse robotic tasks and environments for dexterous manipulation. GenDexHand introduces a closed-loop refinement process that adjusts object placements and scales based on vision-language model (VLM) feedback, substantially improving the average quality of generated environments. Each task is further decomposed into sub-tasks to enable sequential reinforcement learning, reducing training time and increasing success rates. Our work provides a viable path toward scalable training of diverse dexterous hand behaviors in embodied intelligence by offering a simulation-based solution to synthetic data generation. Our website: https://winniechen2002.github.io/GenDexHand/.
