Deblur Gaussian Splatting SLAM
Francesco Girlanda, Denys Rozumnyi, Marc Pollefeys, Martin R. Oswald
TL;DR
Deblur-SLAM addresses the challenge of reconstructing sharp 3D scenes from motion-blurred RGB sequences by explicitly modeling the image formation during exposure. It unifies frame-to-frame tracking with a frame-to-model tracker on a deformable 3D Gaussian representation, rendering sharp sub-frames and matching their average blur to observations. Online loop closure and global bundle adjustment ensure global consistency, aided by a monocular depth estimator for robust map initialization. Empirically, it achieves state-of-the-art sub-frame trajectory recovery and sharp map estimates on synthetic Replica data and real-world datasets (ScanNet, TUM-RGBD, ETH3D), outperforming baselines in PSNR, SSIM, LPIPS, and ATE. This approach broadens RGB-only SLAM to highly blurred sequences, enabling robust, high-fidelity mapping for robotics and AR/VR in motion-rich environments.
Abstract
We present Deblur-SLAM, a robust RGB SLAM pipeline designed to recover sharp reconstructions from motion-blurred inputs. The proposed method bridges the strengths of both frame-to-frame and frame-to-model approaches to model sub-frame camera trajectories that lead to high-fidelity reconstructions in motion-blurred settings. Moreover, our pipeline incorporates techniques such as online loop closure and global bundle adjustment to achieve a dense and precise global trajectory. We model the physical image formation process of motion-blurred images and minimize the error between the observed blurry images and rendered blurry images obtained by averaging sharp virtual sub-frame images. Additionally, by utilizing a monocular depth estimator alongside the online deformation of Gaussians, we ensure precise mapping and enhanced image deblurring. The proposed SLAM pipeline integrates all these components to improve the results. We achieve state-of-the-art results for sharp map estimation and sub-frame trajectory recovery both on synthetic and real-world blurry input data.
