RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting
Zhenzhong Cao, Chenyang Zhao, Qianyi Zhang, Jinzheng Guang, Yinuo Song Jingtai Liu
TL;DR
RGBDS-SLAM tackles the challenge of obtaining high-fidelity RGB-D semantic reconstructions in dense SLAM by coupling 3D Multi-Level Pyramid Gaussian Splatting with a tightly integrated multi-feature optimization. The method extends ORB-SLAM3 with a four-thread pipeline and represents the scene using 3D Gaussian primitives projected via learned pyramids to jointly refine RGB, depth, and semantic maps. Experiments on Replica and ScanNet show state-of-the-art RGB-D reconstruction quality and semantic accuracy while maintaining real-time performance, validated by ablations that confirm the contribution of the ML-P-GS module and the cross-feature optimization. The approach advances dense SLAM by enabling detailed, semantically aware reconstructions in real time, with open-source code to facilitate adoption, though dynamic scenes remain an area for future work.
Abstract
High-quality reconstruction is crucial for dense SLAM. Recent popular approaches utilize 3D Gaussian Splatting (3D GS) techniques for RGB, depth, and semantic reconstruction of scenes. However, these methods often overlook issues of detail and consistency in different parts of the scene. To address this, we propose RGBDS-SLAM, a RGB-D semantic dense SLAM system based on 3D multi-level pyramid gaussian splatting, which enables high-quality dense reconstruction of scene RGB, depth, and semantics.In this system, we introduce a 3D multi-level pyramid gaussian splatting method that restores scene details by extracting multi-level image pyramids for gaussian splatting training, ensuring consistency in RGB, depth, and semantic reconstructions. Additionally, we design a tightly-coupled multi-features reconstruction optimization mechanism, allowing the reconstruction accuracy of RGB, depth, and semantic maps to mutually enhance each other during the rendering optimization process. Extensive quantitative, qualitative, and ablation experiments on the Replica and ScanNet public datasets demonstrate that our proposed method outperforms current state-of-the-art methods. The open-source code will be available at: https://github.com/zhenzhongcao/RGBDS-SLAM.
