PrEditor3D: Fast and Precise 3D Shape Editing
Ziya Erkoç, Can Gümeli, Chaoyang Wang, Matthias Nießner, Angela Dai, Peter Wonka, Hsin-Ying Lee, Peiye Zhuang
TL;DR
PrEditor3D tackles the challenge of fast, precise 3D editing without retraining diffusion models. It couples synchronized multi-view 2D editing with a 3D lifting and merging mechanism to confine edits to user-specified regions while preserving the rest of the shape. The method uses four orthogonal views and a color-coded 3D segmentation via GTR to identify edited regions and apply a robust averaging-based merge, enabling high fidelity and smooth boundaries. Quantitative results across GPTEval3D and directional CLIP metrics, plus user studies, show substantial improvements over state-of-the-art baselines in both visual quality and region preservation, with runtimes suitable for iterative artistic workflows.
Abstract
We propose a training-free approach to 3D editing that enables the editing of a single shape within a few minutes. The edited 3D mesh aligns well with the prompts, and remains identical for regions that are not intended to be altered. To this end, we first project the 3D object onto 4-view images and perform synchronized multi-view image editing along with user-guided text prompts and user-provided rough masks. However, the targeted regions to be edited are ambiguous due to projection from 3D to 2D. To ensure precise editing only in intended regions, we develop a 3D segmentation pipeline that detects edited areas in 3D space, followed by a merging algorithm to seamlessly integrate edited 3D regions with the original input. Extensive experiments demonstrate the superiority of our method over previous approaches, enabling fast, high-quality editing while preserving unintended regions.
