PCF-Grasp: Converting Point Completion to Geometry Feature to Enhance 6-DoF Grasp
Yaofeng Cheng, Fusheng Zha, Wei Guo, Pengfei Wang, Chao Zeng, Lining Sun, Chenguang Yang
TL;DR
The paper tackles the challenge of incomplete geometry in single-view depth-point clouds for 6-DoF grasping. It introduces Completion Feature Grasp (CFG), which converts point completion into a hidden-space shape feature (via a PCF-Layer) that augments the grasp network while retaining reliance on original points. A Score Filter then selects executable grasps by considering robot motion feasibility, bridging the gap between network output and real-world execution. Real-world experiments show notable gains over state-of-the-art methods and demonstrate robustness across viewpoints and clutter, with 89% success in real robot grasping and around an 18% improvement over prior best results.
Abstract
The 6-Degree of Freedom (DoF) grasp method based on point clouds has shown significant potential in enabling robots to grasp target objects. However, most existing methods are based on the point clouds (2.5D points) generated from single-view depth images. These point clouds only have one surface side of the object providing incomplete geometry information, which mislead the grasping algorithm to judge the shape of the target object, resulting in low grasping accuracy. Humans can accurately grasp objects from a single view by leveraging their geometry experience to estimate object shapes. Inspired by humans, we propose a novel 6-DoF grasping framework that converts the point completion results as object shape features to train the 6-DoF grasp network. Here, point completion can generate approximate complete points from the 2.5D points similar to the human geometry experience, and converting it as shape features is the way to utilize it to improve grasp efficiency. Furthermore, due to the gap between the network generation and actual execution, we integrate a score filter into our framework to select more executable grasp proposals for the real robot. This enables our method to maintain a high grasp quality in any camera viewpoint. Extensive experiments demonstrate that utilizing complete point features enables the generation of significantly more accurate grasp proposals and the inclusion of a score filter greatly enhances the credibility of real-world robot grasping. Our method achieves a 17.8\% success rate higher than the state-of-the-art method in real-world experiments.
