View-consistent Object Removal in Radiance Fields
Yiren Lu, Jing Ma, Yu Yin
TL;DR
This work tackles cross-view inconsistency in editing radiance fields by proposing a single-reference inpainting pipeline that propagates edits to all training views through depth-based projection. It augments realism under varying lighting with directional appearance variants and enforces consistency with depth-aware occlusion handling, while yielding a fast, robust multi-view segmentation as a byproduct. The method is demonstrated on NeRF and 3D-Gaussian Splatting backends, achieving superior cross-view coherence and image quality compared to strong baselines, as evidenced by quantitative metrics including PSNR, LPIPS, and FID. Training relies on a reconstruction loss over projected views, and the approach promises practical gains for RF editing in VR/AR and related applications.
Abstract
Radiance Fields (RFs) have emerged as a crucial technology for 3D scene representation, enabling the synthesis of novel views with remarkable realism. However, as RFs become more widely used, the need for effective editing techniques that maintain coherence across different perspectives becomes evident. Current methods primarily depend on per-frame 2D image inpainting, which often fails to maintain consistency across views, thus compromising the realism of edited RF scenes. In this work, we introduce a novel RF editing pipeline that significantly enhances consistency by requiring the inpainting of only a single reference image. This image is then projected across multiple views using a depth-based approach, effectively reducing the inconsistencies observed with per-frame inpainting. However, projections typically assume photometric consistency across views, which is often impractical in real-world settings. To accommodate realistic variations in lighting and viewpoint, our pipeline adjusts the appearance of the projected views by generating multiple directional variants of the inpainted image, thereby adapting to different photometric conditions. Additionally, we present an effective and robust multi-view object segmentation approach as a valuable byproduct of our pipeline. Extensive experiments demonstrate that our method significantly surpasses existing frameworks in maintaining content consistency across views and enhancing visual quality. More results are available at https://vulab-ai.github.io/View-consistent_Object_Removal_in_Radiance_Fields.
