TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation
Chaofan Luo, Donglin Di, Xun Yang, Yongjia Ma, Zhou Xue, Chen Wei, Yebin Liu
TL;DR
This work tackles the challenge of preserving multi-view consistency in text-guided 3D editing. It introduces Trajectory-Anchored Multi-View Editing (TrAME), which couples 2D view edits with 3D updates via the Trajectory-Anchored Scheme (TAS) and enforces cross-view coherence with the View-Consistent Attention Control (VCAC) module. A theoretical bridge is drawn between optimization-based SDS and reconstruction-based DDIM/DDCM approaches, offering a unified perspective for design choices. Empirical results show improved editing quality and view consistency over state-of-the-art methods, with extensive ablations validating the contributions. The approach enables more reliable, progressively updated 3D scene edits and the code will be released publicly.
Abstract
Despite significant strides in the field of 3D scene editing, current methods encounter substantial challenge, particularly in preserving 3D consistency in multi-view editing process. To tackle this challenge, we propose a progressive 3D editing strategy that ensures multi-view consistency via a Trajectory-Anchored Scheme (TAS) with a dual-branch editing mechanism. Specifically, TAS facilitates a tightly coupled iterative process between 2D view editing and 3D updating, preventing error accumulation yielded from text-to-image process. Additionally, we explore the relationship between optimization-based methods and reconstruction-based methods, offering a unified perspective for selecting superior design choice, supporting the rationale behind the designed TAS. We further present a tuning-free View-Consistent Attention Control (VCAC) module that leverages cross-view semantic and geometric reference from the source branch to yield aligned views from the target branch during the editing of 2D views. To validate the effectiveness of our method, we analyze 2D examples to demonstrate the improved consistency with the VCAC module. Further extensive quantitative and qualitative results in text-guided 3D scene editing indicate that our method achieves superior editing quality compared to state-of-the-art methods. We will make the complete codebase publicly available following the conclusion of the review process.
