MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping

Zhihao Cao; Hanyu Wu; Li Wa Tang; Zizhou Luo; Zihan Zhu; Wei Zhang; Marc Pollefeys; Martin R. Oswald

MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping

Zhihao Cao, Hanyu Wu, Li Wa Tang, Zizhou Luo, Zihan Zhu, Wei Zhang, Marc Pollefeys, Martin R. Oswald

TL;DR

MCGS-SLAM addresses the limitations of monocular dense SLAM by leveraging synchronized RGB inputs from a calibrated multi-camera rig and a 3D Gaussian Splatting map. It introduces MCBA to jointly optimize camera poses and dense depths across views, and JDSA to enforce metric-scale consistency, all within a differentiable Gaussian-mapping and rendering framework that includes an offline global refinement. The approach yields high-fidelity, photorealistic reconstructions and accurate trajectories, benefiting from wide-field observations that reveal side-view structures otherwise occluded in single-camera setups. Evaluations on Waymo, Oxford Spires, and AirSim demonstrate robust real-time performance, superior geometry and appearance fidelity, and improved coverage essential for safe autonomous operation.

Abstract

Recent progress in dense SLAM has primarily targeted monocular setups, often at the expense of robustness and geometric coverage. We present MCGS-SLAM, the first purely RGB-based multi-camera SLAM system built on 3D Gaussian Splatting (3DGS). Unlike prior methods relying on sparse maps or inertial data, MCGS-SLAM fuses dense RGB inputs from multiple viewpoints into a unified, continuously optimized Gaussian map. A multi-camera bundle adjustment (MCBA) jointly refines poses and depths via dense photometric and geometric residuals, while a scale consistency module enforces metric alignment across views using low-rank priors. The system supports RGB input and maintains real-time performance at large scale. Experiments on synthetic and real-world datasets show that MCGS-SLAM consistently yields accurate trajectories and photorealistic reconstructions, usually outperforming monocular baselines. Notably, the wide field of view from multi-camera input enables reconstruction of side-view regions that monocular setups miss, critical for safe autonomous operation. These results highlight the promise of multi-camera Gaussian Splatting SLAM for high-fidelity mapping in robotics and autonomous driving.

MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping

TL;DR

Abstract

MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)