Dexterous Functional Pre-Grasp Manipulation with Diffusion Policy
Tianhao Wu, Yunchong Gan, Mingdong Wu, Jingbo Cheng, Yaodong Yang, Yixin Zhu, Hao Dong
TL;DR
This work tackles dexterous functional pre-grasp manipulation, where objects must be repositioned and reoriented to achieve functional grasp poses. It introduces a teacher-student framework that uses a novel mutual reward, a mixture-of-experts policy, and a diffusion policy to model complex, high-DOF manipulation and generalize across diverse objects and goal poses. Through offline imitation learning from multiple experts, the diffusion-based student can achieve teacher-level performance and robustly leverage extrinsic dexterity, reporting 72.6% success across 30+ object categories. The approach advances generalizable pre-grasp manipulation with practical potential for real-world functional grasping, while noting challenges with irregular geometries and sim-to-real transfer.
Abstract
In real-world scenarios, objects often require repositioning and reorientation before they can be grasped, a process known as pre-grasp manipulation. Learning universal dexterous functional pre-grasp manipulation requires precise control over the relative position, orientation, and contact between the hand and object while generalizing to diverse dynamic scenarios with varying objects and goal poses. To address this challenge, we propose a teacher-student learning approach that utilizes a novel mutual reward, incentivizing agents to optimize three key criteria jointly. Additionally, we introduce a pipeline that employs a mixture-of-experts strategy to learn diverse manipulation policies, followed by a diffusion policy to capture complex action distributions from these experts. Our method achieves a success rate of 72.6\% across more than 30 object categories by leveraging extrinsic dexterity and adjusting from feedback.
