BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling
Cheng Peng, Yutao Tang, Yifan Zhou, Nengyu Wang, Xijun Liu, Deming Li, Rama Chellappa
TL;DR
This work tackles the fragility of 3D Gaussian Splatting under real-world blur by introducing Blur Agnostic Gaussian Splatting (BAGS). BAGS adds a Blur Proposal Network to estimate per-pixel blur kernels and a per-pixel mask, and employs a coarse-to-fine optimization across scales to stabilize joint 3D scene optimization with 2D degradation modeling. The method leverages RGBD-aware, multi-modal features to disentangle blur from geometry, achieving state-of-the-art photorealistic renderings across camera motion, defocus, and mixed-resolution scenarios, including unbounded 360 drone data. The approach yields interpretable blur kernels and region masks, enabling both robust reconstruction in degraded imagery and practical analysis of blur patterns.
Abstract
Recent efforts in using 3D Gaussians for scene reconstruction and novel view synthesis can achieve impressive results on curated benchmarks; however, images captured in real life are often blurry. In this work, we analyze the robustness of Gaussian-Splatting-based methods against various image blur, such as motion blur, defocus blur, downscaling blur, \etc. Under these degradations, Gaussian-Splatting-based methods tend to overfit and produce worse results than Neural-Radiance-Field-based methods. To address this issue, we propose Blur Agnostic Gaussian Splatting (BAGS). BAGS introduces additional 2D modeling capacities such that a 3D-consistent and high quality scene can be reconstructed despite image-wise blur. Specifically, we model blur by estimating per-pixel convolution kernels from a Blur Proposal Network (BPN). BPN is designed to consider spatial, color, and depth variations of the scene to maximize modeling capacity. Additionally, BPN also proposes a quality-assessing mask, which indicates regions where blur occur. Finally, we introduce a coarse-to-fine kernel optimization scheme; this optimization scheme is fast and avoids sub-optimal solutions due to a sparse point cloud initialization, which often occurs when we apply Structure-from-Motion on blurry images. We demonstrate that BAGS achieves photorealistic renderings under various challenging blur conditions and imaging geometry, while significantly improving upon existing approaches.
