Table of Contents
Fetching ...

Supermarket-6DoF: A Real-World Grasping Dataset and Grasp Pose Representation Analysis

Jason Toskov, Akansel Cosgun

TL;DR

The paper tackles the lack of real-world benchmarking for 6-DoF robotic grasping by introducing Supermarket-6DoF, a dataset of 1,500 real-world grasp attempts across 20 supermarket objects with ground-truth success and stability labels. It provides rich sensory data (RGB, depth, and point clouds) and exact 6-DoF grasp poses, enabling analysis of grasp representations beyond traditional top-down or analytic metrics. The authors compare three gripper-pose representations and demonstrate that modeling the gripper as a point cloud yields the best grasp-success prediction performance, with stability prediction remaining more challenging, especially for heavier or more rigid objects. The dataset and accompanying tools support reproducible benchmarking and are poised to advance learning-based grasping methods in real-world manipulation tasks.

Abstract

We present Supermarket-6DoF, a real-world dataset of 1500 grasp attempts across 20 supermarket objects with publicly available 3D models. Unlike most existing grasping datasets that rely on analytical metrics or simulation for grasp labeling, our dataset provides ground-truth outcomes from physical robot executions. Among the few real-world grasping datasets, wile more modest in size, Supermarket-6DoF uniquely features full 6-DoF grasp poses annotated with both initial grasp success and post-grasp stability under external perturbation. We demonstrate the dataset's utility by analyzing three grasp pose representations for grasp success prediction from point clouds. Our results show that representing the gripper geometry explicitly as a point cloud achieves higher prediction accuracy compared to conventional quaternion-based grasp pose encoding.

Supermarket-6DoF: A Real-World Grasping Dataset and Grasp Pose Representation Analysis

TL;DR

The paper tackles the lack of real-world benchmarking for 6-DoF robotic grasping by introducing Supermarket-6DoF, a dataset of 1,500 real-world grasp attempts across 20 supermarket objects with ground-truth success and stability labels. It provides rich sensory data (RGB, depth, and point clouds) and exact 6-DoF grasp poses, enabling analysis of grasp representations beyond traditional top-down or analytic metrics. The authors compare three gripper-pose representations and demonstrate that modeling the gripper as a point cloud yields the best grasp-success prediction performance, with stability prediction remaining more challenging, especially for heavier or more rigid objects. The dataset and accompanying tools support reproducible benchmarking and are poised to advance learning-based grasping methods in real-world manipulation tasks.

Abstract

We present Supermarket-6DoF, a real-world dataset of 1500 grasp attempts across 20 supermarket objects with publicly available 3D models. Unlike most existing grasping datasets that rely on analytical metrics or simulation for grasp labeling, our dataset provides ground-truth outcomes from physical robot executions. Among the few real-world grasping datasets, wile more modest in size, Supermarket-6DoF uniquely features full 6-DoF grasp poses annotated with both initial grasp success and post-grasp stability under external perturbation. We demonstrate the dataset's utility by analyzing three grasp pose representations for grasp success prediction from point clouds. Our results show that representing the gripper geometry explicitly as a point cloud achieves higher prediction accuracy compared to conventional quaternion-based grasp pose encoding.

Paper Structure

This paper contains 16 sections, 1 equation, 8 figures, 4 tables.

Figures (8)

  • Figure 1: We present a real-world grasping dataset with 1500 grasp attempts on 20 supermarket objects, featuring 6-DoF grasp attempts.
  • Figure 2: Example grasps from the Supermarket-6DoF dataset. Green grasps (top) are stable successes, orange grasps (middle) are successes that aren't stable, and red grasps (bottom) are failures.
  • Figure 3: Point cloud pre-processing steps. Red color indicates points to be removed. (a) original point cloud, (b) workspace crop, (c) normal computation, (d) plane removal, (e) outlier removal, (f) downsampling.
  • Figure 5: Success rate vs stable success rate of grasped objects.
  • Figure : Append grasp pose
  • ...and 3 more figures