PreAfford: Universal Affordance-Based Pre-Grasping for Diverse Objects and Environments
Kairui Ding, Boyuan Chen, Ruihai Wu, Yuyang Li, Zongzheng Zhang, Huan-ang Gao, Siqi Li, Guyue Zhou, Yixin Zhu, Hao Dong, Hao Zhao
TL;DR
PreAfford addresses the challenge of grasping in diverse objects and environments with two-finger grippers by coupling a relay-trained, two-module framework (pre-grasping and grasping) to a point-level affordance representation. Each module contains affordance, proposal, and critic networks, operating on RGB-D inputs processed by PointNet++ to produce action proposals evaluated in a closed-loop fashion, with a data-driven reward signal guiding pre-grasping. Trained offline on ShapeNet-v2 and validated through both simulations and real-world experiments, PreAfford achieves up to a 69% improvement in grasping success on unseen categories and demonstrates practical deployability across multiple setups. The work advances scene-aware, geometry-conscious manipulation for robust handling of a wide range of objects and environments, while highlighting avenues for improving robustness to irregular shapes and dynamic contexts.
Abstract
Robotic manipulation with two-finger grippers is challenged by objects lacking distinct graspable features. Traditional pre-grasping methods, which typically involve repositioning objects or utilizing external aids like table edges, are limited in their adaptability across different object categories and environments. To overcome these limitations, we introduce PreAfford, a novel pre-grasping planning framework incorporating a point-level affordance representation and a relay training approach. Our method significantly improves adaptability, allowing effective manipulation across a wide range of environments and object types. When evaluated on the ShapeNet-v2 dataset, PreAfford not only enhances grasping success rates by 69% but also demonstrates its practicality through successful real-world experiments. These improvements highlight PreAfford's potential to redefine standards for robotic handling of complex manipulation tasks in diverse settings.
