Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM
Shouhe Zhang, Dayong Ren, Sensen Song, Yurong Qian, Zhenhong Jia
TL;DR
This work tackles illumination sensitivity in 3D Gaussian Splatting SLAM by introducing a proactive Intrinsic Appearance Normalization (IAN) that quantizes albedo to a canonical palette, and a reactive Dynamic Radiance Balancing Loss (DRB-Loss) that adaptively corrects extreme exposure frames. By jointly optimizing geometry, appearance, and semantics with differentiable Gaussian rendering, the approach achieves robust camera tracking, high-fidelity maps, and accurate semantic labeling under varied lighting. Experiments on Replica and ScanNet demonstrate state-of-the-art tracking, rendering quality, and mIoU, highlighting the method's practical impact for real-world robotics and AR. The proposed combination of proactive normalization and reactive correction provides a significant advance toward illumination-invariant 3D semantic mapping.
Abstract
Extreme exposure degrades both the 3D map reconstruction and semantic segmentation accuracy, which is particularly detrimental to tightly-coupled systems. To achieve illumination invariance, we propose a novel semantic SLAM framework with two designs. First, the Intrinsic Appearance Normalization (IAN) module proactively disentangles the scene's intrinsic properties, such as albedo, from transient lighting. By learning a standardized, illumination-invariant appearance model, it assigns a stable and consistent color representation to each Gaussian primitive. Second, the Dynamic Radiance Balancing Loss (DRB-Loss) reactively handles frames with extreme exposure. It activates only when an image's exposure is poor, operating directly on the radiance field to guide targeted optimization. This prevents error accumulation from extreme lighting without compromising performance under normal conditions. The synergy between IAN's proactive invariance and DRB-Loss's reactive correction endows our system with unprecedented robustness. Evaluations on public datasets demonstrate state-of-the-art performance in camera tracking, map quality, and semantic and geometric accuracy.
