A Benchmark Grocery Dataset of Realworld Point Clouds From Single View
Shivanand Venkanna Sheshappanavar, Tejas Anvekar, Shivanand Kundargi, Yufan Wang, Chandra Kambhamettu
TL;DR
3DGrocery100 introduces the largest real-world 3D grocery benchmark built from RGB-D single-view captures, yielding 87,898 point clouds across 100 fine-grained classes. The dataset enables rigorous evaluation of 3D point-cloud classifiers, few-shot generalization via the 63-class subset 3DGrocery63, and class-incremental learning with a LWF-based baseline, across color and no-color variants. Across multiple architectures (PointNet, PointNet++, DGCNN, PCT, PointMLP, PointNeXt), the work demonstrates strong color-based discriminability and highlights challenges in cross-domain and continual learning scenarios, providing valuable baselines and insights for real-world grocery recognition. The work also offers a pre-training subset (Packages) and extensive supplementary materials to support researchers in 3D grocery perception, with project resources at the provided page, underscoring its practical impact for automatic checkout, robotic navigation, and assistive technologies.
Abstract
Fine-grained grocery object recognition is an important computer vision problem with broad applications in automatic checkout, in-store robotic navigation, and assistive technologies for the visually impaired. Existing datasets on groceries are mainly 2D images. Models trained on these datasets are limited to learning features from the regular 2D grids. While portable 3D sensors such as Kinect were commonly available for mobile phones, sensors such as LiDAR and TrueDepth, have recently been integrated into mobile phones. Despite the availability of mobile 3D sensors, there are currently no dedicated real-world large-scale benchmark 3D datasets for grocery. In addition, existing 3D datasets lack fine-grained grocery categories and have limited training samples. Furthermore, collecting data by going around the object versus the traditional photo capture makes data collection cumbersome. Thus, we introduce a large-scale grocery dataset called 3DGrocery100. It constitutes 100 classes, with a total of 87,898 3D point clouds created from 10,755 RGB-D single-view images. We benchmark our dataset on six recent state-of-the-art 3D point cloud classification models. Additionally, we also benchmark the dataset on few-shot and continual learning point cloud classification tasks. Project Page: https://bigdatavision.org/3DGrocery100/.
