Dexterous Manipulation Based on Prior Dexterous Grasp Pose Knowledge
Hengxu Yan, Haoshu Fang, Cewu Lu
TL;DR
This work tackles dexterous manipulation by integrating prior dexterous grasp pose knowledge into a two-phase framework: first generate an initial grasp pose targeting the object's functional part using segmentation and anygrasp-based proposals, then refine the grasp through PPO-based reinforcement learning with partial-view observations. The method decomposes rewards into interaction, completion, and restriction components to guide safe and efficient manipulation, and maps two-finger grasps to a full dexterous hand, enabling realistic control of a high-DoF system. Extensive simulation on four tasks and real-world tests demonstrate substantial gains in learning efficiency and success rates over baselines, with successful transfer across robotic platforms. The results suggest that leveraging structured prior knowledge can dramatically improve sample efficiency and robustness in complex dexterous manipulation, paving the way for practical deployment in varied environments.
Abstract
Dexterous manipulation has received considerable attention in recent research. Predominantly, existing studies have concentrated on reinforcement learning methods to address the substantial degrees of freedom in hand movements. Nonetheless, these methods typically suffer from low efficiency and accuracy. In this work, we introduce a novel reinforcement learning approach that leverages prior dexterous grasp pose knowledge to enhance both efficiency and accuracy. Unlike previous work, they always make the robotic hand go with a fixed dexterous grasp pose, We decouple the manipulation process into two distinct phases: initially, we generate a dexterous grasp pose targeting the functional part of the object; after that, we employ reinforcement learning to comprehensively explore the environment. Our findings suggest that the majority of learning time is expended in identifying the appropriate initial position and selecting the optimal manipulation viewpoint. Experimental results demonstrate significant improvements in learning efficiency and success rates across four distinct tasks.
