DiPGrasp: Parallel Local Searching for Efficient Differentiable Grasp Planning
Wenqiang Xu, Jieyi Zhang, Tutian Tang, Zhenjun Yu, Yutong Li, Cewu Lu
TL;DR
DiPGrasp introduces a fast, differentiable grasp planner for high-DOF dexterous hands by coupling a force-based, differentiable surface-matching metric with gradient-based optimization and parallel sampling. It supports multiple gripper DOFs, provides collision-aware optimization through a barrier term and simple collision checks, and leverages a gripper weighting map to bias toward palmar and fingertip regions. The authors validate design through three applications—grasp dataset construction, mask-conditioned planning, and pose refinement—demonstrating faster search and higher-quality grasps than baselines, with real-world tests on Barrett and Schunk SVH hands. The work highlights practical impact for fast grasp generation, dataset production, and integration with learning-based perception systems, while also outlining avenues for robustness and broader differentiable manipulation frameworks.
Abstract
Grasp planning is an important task for robotic manipulation. Though it is a richly studied area, a standalone, fast, and differentiable grasp planner that can work with robot grippers of different DOFs has not been reported. In this work, we present DiPGrasp, a grasp planner that satisfies all these goals. DiPGrasp takes a force-closure geometric surface matching grasp quality metric. It adopts a gradient-based optimization scheme on the metric, which also considers parallel sampling and collision handling. This not only drastically accelerates the grasp search process over the object surface but also makes it differentiable. We apply DiPGrasp to three applications, namely grasp dataset construction, mask-conditioned planning, and pose refinement. For dataset generation, as a standalone planner, DiPGrasp has clear advantages over speed and quality compared with several classic planners. For mask-conditioned planning, it can turn a 3D perception model into a 3D grasp detection model instantly. As a pose refiner, it can optimize the coarse grasp prediction from the neural network, as well as the neural network parameters. Finally, we conduct real-world experiments with the Barrett hand and Schunk SVH 5-finger hand. Video and supplementary materials can be viewed on our website: \url{https://dipgrasp.robotflow.ai}.
