AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting
Chung-Ho Wu, Yang-Jung Chen, Ying-Huan Chen, Jie-Ying Lee, Bo-Hsu Ke, Chun-Wei Tuan Mu, Yi-Chuan Huang, Chin-Yang Lin, Min-Hung Chen, Yen-Yu Lin, Yu-Lun Liu
TL;DR
AuraFusion360 tackles 360° unbounded scene inpainting by marrying explicit 3D Gaussian Splatting with diffusion-based 2D inpainting guided by a reference view. The approach introduces depth-aware unseen mask generation, Adaptive Guided Depth Diffusion (AGDD), and SDEdit-based RGB guidance to ensure multi-view consistency and geometric fidelity across large viewpoint changes. A new 360-USID dataset with ground-truth novel views enables rigorous evaluation, and experiments show superior perceptual quality (lower LPIPS) and higher PSNR compared with state-of-the-art methods. This framework enables robust, reference-guided 3D inpainting for VR/AR and architectural visualization, with potential extensions to efficiency, dynamic scenes, and language-guided editing.
Abstract
Three-dimensional scene inpainting is crucial for applications from virtual reality to architectural visualization, yet existing methods struggle with view consistency and geometric accuracy in 360° unbounded scenes. We present AuraFusion360, a novel reference-based method that enables high-quality object removal and hole filling in 3D scenes represented by Gaussian Splatting. Our approach introduces (1) depth-aware unseen mask generation for accurate occlusion identification, (2) Adaptive Guided Depth Diffusion, a zero-shot method for accurate initial point placement without requiring additional training, and (3) SDEdit-based detail enhancement for multi-view coherence. We also introduce 360-USID, the first comprehensive dataset for 360° unbounded scene inpainting with ground truth. Extensive experiments demonstrate that AuraFusion360 significantly outperforms existing methods, achieving superior perceptual quality while maintaining geometric accuracy across dramatic viewpoint changes.
