Find Any Part in 3D

Ziqi Ma; Yisong Yue; Georgia Gkioxari

Find Any Part in 3D

Ziqi Ma, Yisong Yue, Georgia Gkioxari

TL;DR

Find3D tackles data scarcity in 3D part segmentation by building a scalable data engine that leverages 2D foundation models to annotate 3D assets, producing a dataset with $2.1$ million part annotations across 761 object categories and 124,615 unique part types. A transformer-based 3D part model is trained with a simple contrastive objective to map per-point features into a CLIP-like embedding space, enabling open-world, text-driven segmentation for any object part. The approach yields a $260\%$ improvement in mIoU and speeds inference by $6\times$ to $300\times$ over existing open-world methods, and generalizes to unseen objects without dataset-specific finetuning. The authors release a new open-world 3D part benchmark and demonstrate strong scaling effects, suggesting data scale is the key driver of generalization in 3D segmentation.

Abstract

Why don't we have foundation models in 3D yet? A key limitation is data scarcity. For 3D object part segmentation, existing datasets are small in size and lack diversity. We show that it is possible to break this data barrier by building a data engine powered by 2D foundation models. Our data engine automatically annotates any number of object parts: 1755x more unique part types than existing datasets combined. By training on our annotated data with a simple contrastive objective, we obtain an open-world model that generalizes to any part in any object based on any text query. Even when evaluated zero-shot, we outperform existing methods on the datasets they train on. We achieve 260% improvement in mIoU and boost speed by 6x to 300x. Our scaling analysis confirms that this generalization stems from the data scale, which underscores the impact of our data engine. Finally, to advance general-category open-world 3D part segmentation, we release a benchmark covering a wide range of objects and parts. Project website: https://ziqi-ma.github.io/find3dsite/

Find Any Part in 3D

TL;DR

Abstract

Find Any Part in 3D

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)