Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Yansong Qu, Dian Chen, Xinyang Li, Xiaofan Li, Shengchuan Zhang, Liujuan Cao, Rongrong Ji
TL;DR
This work tackles the challenge of geometric editing in 3D Gaussian Splatting (3DGS) by introducing Drag Your Gaussian (DYG), a drag-based method that uses 3D masks and control-point prompts to steer edits. It couples an implicit Multi-resolution Triplane Positional Encoder with a Region-Specific Positional Decoder and a Soft Local Edit strategy to overcome sparse Gaussian distributions and ensure localized, smooth geometry changes. Guided by an enhanced Drag-SDS loss that fuses a 2D Latent Diffusion Model with a composite noise prediction, DYG achieves multi-view consistent, fine-grained edits while preserving non-edited regions. Extensive experiments on real and generative scenes demonstrate state-of-the-art editing quality, including multi-round dragging and cross-domain generalization, with practical runtime efficiency relative to prior 3DGS editing methods. Limitations stem from the dependence on 2D priors, suggesting future work to speed up interaction and extend to dynamic 4D editing.
Abstract
Recent advancements in 3D scene editing have been propelled by the rapid development of generative models. Existing methods typically utilize generative models to perform text-guided editing on 3D representations, such as 3D Gaussian Splatting (3DGS). However, these methods are often limited to texture modifications and fail when addressing geometric changes, such as editing a character's head to turn around. Moreover, such methods lack accurate control over the spatial position of editing results, as language struggles to precisely describe the extent of edits. To overcome these limitations, we introduce DYG, an effective 3D drag-based editing method for 3D Gaussian Splatting. It enables users to conveniently specify the desired editing region and the desired dragging direction through the input of 3D masks and pairs of control points, thereby enabling precise control over the extent of editing. DYG integrates the strengths of the implicit triplane representation to establish the geometric scaffold of the editing results, effectively overcoming suboptimal editing outcomes caused by the sparsity of 3DGS in the desired editing regions. Additionally, we incorporate a drag-based Latent Diffusion Model into our method through the proposed Drag-SDS loss function, enabling flexible, multi-view consistent, and fine-grained editing. Extensive experiments demonstrate that DYG conducts effective drag-based editing guided by control point prompts, surpassing other baselines in terms of editing effect and quality, both qualitatively and quantitatively. Visit our project page at https://quyans.github.io/Drag-Your-Gaussian.
