Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation
Jianglong Ye, Keyi Wang, Chengjing Yuan, Ruihan Yang, Yiquan Li, Jiyue Zhu, Yuzhe Qin, Xueyan Zou, Xiaolong Wang
TL;DR
Dex1B presents a scalable data-generation framework that blends optimization-based seed data with a conditional CVAE to produce 1B demonstrations for dexterous grasping and articulation. The DexSimple baseline, empowered by geometric constraints and post-optimization, achieves state-of-the-art performance and demonstrates strong sim-to-real transfer. The work provides a comprehensive benchmark, extensive analyses of data diversity and scaling, and a practical pipeline for high-volume, diverse dexterous demonstrations with real-world applicability. Overall, it advances data-centric approaches in dexterous manipulation and offers practical pathways for training robust, transferable policies.
Abstract
Generating large-scale demonstrations for dexterous hand manipulation remains challenging, and several approaches have been proposed in recent years to address this. Among them, generative models have emerged as a promising paradigm, enabling the efficient creation of diverse and physically plausible demonstrations. In this paper, we introduce Dex1B, a large-scale, diverse, and high-quality demonstration dataset produced with generative models. The dataset contains one billion demonstrations for two fundamental tasks: grasping and articulation. To construct it, we propose a generative model that integrates geometric constraints to improve feasibility and applies additional conditions to enhance diversity. We validate the model on both established and newly introduced simulation benchmarks, where it significantly outperforms prior state-of-the-art methods. Furthermore, we demonstrate its effectiveness and robustness through real-world robot experiments. Our project page is at https://jianglongye.com/dex1b
