SegGrasp: Zero-Shot Task-Oriented Grasping via Semantic and Geometric Guided Segmentation

Haosheng Li; Weixin Mao; Weipeng Deng; Chenyu Meng; Rui Zhang; Fan Jia; Tiancai Wang; Haoqiang Fan; Hongan Wang; Xiaoming Deng

SegGrasp: Zero-Shot Task-Oriented Grasping via Semantic and Geometric Guided Segmentation

Haosheng Li, Weixin Mao, Weipeng Deng, Chenyu Meng, Rui Zhang, Fan Jia, Tiancai Wang, Haoqiang Fan, Hongan Wang, Xiaoming Deng

TL;DR

A training-free framework that incorporates both semantic and geometric priors for zero-shot task-oriented grasp generation, SegGrasp, which first leverages the vision-language models like GLIP for coarse segmentation and uses detailed geometric information from convex decomposition to improve segmentation quality through a fusion policy named GeoFusion.

Abstract

Task-oriented grasping, which involves grasping specific parts of objects based on their functions, is crucial for developing advanced robotic systems capable of performing complex tasks in dynamic environments. In this paper, we propose a training-free framework that incorporates both semantic and geometric priors for zero-shot task-oriented grasp generation. The proposed framework, SegGrasp, first leverages the vision-language models like GLIP for coarse segmentation. It then uses detailed geometric information from convex decomposition to improve segmentation quality through a fusion policy named GeoFusion. An effective grasp pose can be generated by a grasping network with improved segmentation. We conducted the experiments on both segmentation benchmark and real-world robot grasping. The experimental results show that SegGrasp surpasses the baseline by more than 15\% in grasp and segmentation performance.

SegGrasp: Zero-Shot Task-Oriented Grasping via Semantic and Geometric Guided Segmentation

TL;DR

Abstract

SegGrasp: Zero-Shot Task-Oriented Grasping via Semantic and Geometric Guided Segmentation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)