OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions
Maxim Popov, Regina Kurkova, Mikhail Iumanov, Jaafar Mahmoud, Sergey Kolyubin
TL;DR
OSMa-Bench tackles robustness evaluation of open semantic mapping under indoor lighting variability. It introduces a dynamic, automated benchmarking pipeline powered by LLM/LVLMs and Habitat-Sim simulations, and extends ReplicaCAD and HM3D with per-instance semantics and lighting variations. The study benchmarks OpenScene, ConceptGraphs, and BBQ, using semantic segmentation metrics and a novel scene-graph VQA evaluation to quantify performance and reasoning under different lighting. Findings reveal distinct strengths among methods and reveal systematic failure modes under low light and dynamic illumination, guiding future development of resilient OSM systems. The framework is designed for scalability and extension to additional capabilities and environments.
Abstract
Open Semantic Mapping (OSM) is a key technology in robotic perception, combining semantic segmentation and SLAM techniques. This paper introduces a dynamically configurable and highly automated LLM/LVLM-powered pipeline for evaluating OSM solutions called OSMa-Bench (Open Semantic Mapping Benchmark). The study focuses on evaluating state-of-the-art semantic mapping algorithms under varying indoor lighting conditions, a critical challenge in indoor environments. We introduce a novel dataset with simulated RGB-D sequences and ground truth 3D reconstructions, facilitating the rigorous analysis of mapping performance across different lighting conditions. Through experiments on leading models such as ConceptGraphs, BBQ, and OpenScene, we evaluate the semantic fidelity of object recognition and segmentation. Additionally, we introduce a Scene Graph evaluation method to analyze the ability of models to interpret semantic structure. The results provide insights into the robustness of these models, forming future research directions for developing resilient and adaptable robotic systems. Project page is available at https://be2rlab.github.io/OSMa-Bench/.
