VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM
Zihan Zhu, Wei Zhang, Norbert Haala, Marc Pollefeys, Daniel Barath
TL;DR
VIGS-SLAM targets robust dense SLAM by tightly integrating high-frequency IMU data with a learning-enhanced, Gaussian Splatting representation. The approach couples visual residuals with inertial terms, uses staged IMU initialization, and maintains loop closures via a Sim(3) pose graph, followed by efficient Gaussian map updates. Extensive experiments across indoor/outdoor, handheld/drone, and diverse datasets show state-of-the-art tracking accuracy and high-quality novel-view rendering, including resilience under motion blur, low texture, and frame-drop conditions. This work advances dense VI-SLAM by enabling real-time photorealistic mapping with robust loop closure and online parameter adaptation, making it particularly suitable for AR/VR and robotics applications.
Abstract
We present VIGS-SLAM, a visual-inertial 3D Gaussian Splatting SLAM system that achieves robust real-time tracking and high-fidelity reconstruction. Although recent 3DGS-based SLAM methods achieve dense and photorealistic mapping, their purely visual design degrades under motion blur, low texture, and exposure variations. Our method tightly couples visual and inertial cues within a unified optimization framework, jointly refining camera poses, depths, and IMU states. It features robust IMU initialization, time-varying bias modeling, and loop closure with consistent Gaussian updates. Experiments on four challenging datasets demonstrate our superiority over state-of-the-art methods. Project page: https://vigs-slam.github.io
