Name That Part: 3D Part Segmentation and Naming
Soumava Paul, Prakhar Kaushik, Ankit Vaidya, Anand Bhattad, Alan Yuille
TL;DR
ALIGN-Parts reframes semantic 3D part segmentation as a direct set-alignment problem, producing named part decompositions in a single forward pass. It introduces Partlets—shape-conditioned, part-level representations—that are jointly learned with multi-modal geometry, appearance, and affordance-based text embeddings, then matched to candidate descriptions via differentiable optimal transport. The approach enables open-vocabulary naming, permutation-consistent labeling, and efficient annotation at scale, demonstrated by strong improvements over baselines and the creation of TexParts with human-in-the-loop verification. The work also provides new metrics for named 3D part segmentation and shows potential for scalable data generation and downstream 3D asset annotation tasks.
Abstract
We address semantic 3D part segmentation: decomposing objects into parts with meaningful names. While datasets exist with part annotations, their definitions are inconsistent across datasets, limiting robust training. Previous methods produce unlabeled decompositions or retrieve single parts without complete shape annotations. We propose ALIGN-Parts, which formulates part naming as a direct set alignment task. Our method decomposes shapes into partlets - implicit 3D part representations - matched to part descriptions via bipartite assignment. We combine geometric cues from 3D part fields, appearance from multi-view vision features, and semantic knowledge from language-model-generated affordance descriptions. Text-alignment loss ensures partlets share embedding space with text, enabling a theoretically open-vocabulary matching setup, given sufficient data. Our efficient and novel, one-shot, 3D part segmentation and naming method finds applications in several downstream tasks, including serving as a scalable annotation engine. As our model supports zero-shot matching to arbitrary descriptions and confidence-calibrated predictions for known categories, with human verification, we create a unified ontology that aligns PartNet, 3DCoMPaT++, and Find3D, consisting of 1,794 unique 3D parts. We also show examples from our newly created Tex-Parts dataset. We also introduce 2 novel metrics appropriate for the named 3D part segmentation task.
