GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts
Zhenyu Shu, Junlong Yu, Kai Chao, Shiqing Xin, Ligang Liu
TL;DR
GaussEdit introduces a three‑stage framework for adaptive 3D scene editing driven by text and image prompts, anchored in 3D Gaussian Splatting. It combines fast ROI‑based Gaussian initialization, an Adaptive Global‑Local Optimization loop with category‑guided regularization to mitigate the Janus problem, and a texture refinement stage using image‑to‑image diffusion to achieve realistic, prompt‑aligned edits. Empirical results show superior editing accuracy, visual fidelity, and speed compared with prior work, across diverse real‑world and synthetic scenes. This approach enables precise, multi‑view‑consistent manipulation of 3D scenes while maintaining coherence with the surrounding environment, offering a practical tool for content creation and modeling workflows.
Abstract
This paper presents GaussEdit, a framework for adaptive 3D scene editing guided by text and image prompts. GaussEdit leverages 3D Gaussian Splatting as its backbone for scene representation, enabling convenient Region of Interest selection and efficient editing through a three-stage process. The first stage involves initializing the 3D Gaussians to ensure high-quality edits. The second stage employs an Adaptive Global-Local Optimization strategy to balance global scene coherence and detailed local edits and a category-guided regularization technique to alleviate the Janus problem. The final stage enhances the texture of the edited objects using a sophisticated image-to-image synthesis technique, ensuring that the results are visually realistic and align closely with the given prompts. Our experimental results demonstrate that GaussEdit surpasses existing methods in editing accuracy, visual fidelity, and processing speed. By successfully embedding user-specified concepts into 3D scenes, GaussEdit is a powerful tool for detailed and user-driven 3D scene editing, offering significant improvements over traditional methods.
