SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM
Siting Zhu, Renjie Qin, Guangming Wang, Jiuming Liu, Hesheng Wang
TL;DR
SemGauss-SLAM addresses the challenge of dense semantic SLAM by embedding semantic features into a 3D Gaussian splatting framework. It introduces a feature-level loss and semantic-informed bundle adjustment to enable robust multi-view optimization of both camera poses and the 3D semantic map. The method achieves superior mapping, tracking, and semantic reconstruction on Replica and ScanNet compared with radiance-field-based SLAM baselines, and delivers high-quality novel-view semantic segmentation. This work advances dense semantic understanding in unbounded 3D spaces with efficient differentiable rendering and multi-frame semantic constraints.
Abstract
We propose SemGauss-SLAM, a dense semantic SLAM system utilizing 3D Gaussian representation, that enables accurate 3D semantic mapping, robust camera tracking, and high-quality rendering simultaneously. In this system, we incorporate semantic feature embedding into 3D Gaussian representation, which effectively encodes semantic information within the spatial layout of the environment for precise semantic scene representation. Furthermore, we propose feature-level loss for updating 3D Gaussian representation, enabling higher-level guidance for 3D Gaussian optimization. In addition, to reduce cumulative drift in tracking and improve semantic reconstruction accuracy, we introduce semantic-informed bundle adjustment. By leveraging multi-frame semantic associations, this strategy enables joint optimization of 3D Gaussian representation and camera poses, resulting in low-drift tracking and accurate semantic mapping. Our SemGauss-SLAM demonstrates superior performance over existing radiance field-based SLAM methods in terms of mapping and tracking accuracy on Replica and ScanNet datasets, while also showing excellent capabilities in high-precision semantic segmentation and dense semantic mapping.
