3x2: 3D Object Part Segmentation by 2D Semantic Correspondences
Anh Thai, Weiyao Wang, Hao Tang, Stefan Stojanov, Matt Feiszli, James M. Rehg
TL;DR
3-By-2 addresses 3D object part segmentation under limited 3D annotations by transferring labels from richly annotated 2D datasets through diffusion-model–based semantic correspondences across multi-view renders, without training. The method renders the object in multiple views, performs 2D segmentation from a 2D database, and aggregates predictions with a novel mask-consistency module before back-projecting to 3D, all in a language-free, training-free framework. It introduces non-overlapping 2D mask generation and mask-level label transfer to preserve boundary precision across granularities and demonstrates strong cross-category transfer with state-of-the-art performance in zero-shot and few-shot settings on PartNetE and PartNet. The work provides extensive ablations, analyzes database composition, and includes qualitative results on real and synthetic objects, underscoring the practical impact of leveraging visual semantic correspondences for rapid 3D part segmentation without task-specific training.
Abstract
3D object part segmentation is essential in computer vision applications. While substantial progress has been made in 2D object part segmentation, the 3D counterpart has received less attention, in part due to the scarcity of annotated 3D datasets, which are expensive to collect. In this work, we propose to leverage a few annotated 3D shapes or richly annotated 2D datasets to perform 3D object part segmentation. We present our novel approach, termed 3-By-2 that achieves SOTA performance on different benchmarks with various granularity levels. By using features from pretrained foundation models and exploiting semantic and geometric correspondences, we are able to overcome the challenges of limited 3D annotations. Our approach leverages available 2D labels, enabling effective 3D object part segmentation. Our method 3-By-2 can accommodate various part taxonomies and granularities, demonstrating interesting part label transfer ability across different object categories. Project website: \url{https://ngailapdi.github.io/projects/3by2/}.
