Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar
Wouter Jansen, Jan Steckel
TL;DR
Approach addresses robust landmark-based SLAM in harsh environments where optical sensing fails. It uses a CNN that ingests cochleogram representations of echoes from the in-air eRTIS, jointly performing ten-way landmark classification and orientation regression. Key results show high test accuracy for landmark classification and an RMSE around 9.15 degrees for orientation, with near-100% detection of empty scenes. The work demonstrates that semantically defined acoustic landmarks can significantly improve SLAM reliability and autonomous navigation in challenging conditions.
Abstract
In challenging environments where traditional sensing modalities struggle, in-air sonar offers resilience to optical interference. Placing a priori known landmarks in these environments can eliminate accumulated errors in autonomous mobile systems such as Simultaneous Localization and Mapping (SLAM) and autonomous navigation. We present a novel approach using a convolutional neural network to detect and classify ten different reflector landmarks with varying radii using in-air 3D sonar. Additionally, the network predicts the orientation angle of the detected landmarks. The neural network is trained on cochleograms, representing echoes received by the sensor in a time-frequency domain. Experimental results in cluttered indoor settings show promising performance. The CNN achieves a 97.3% classification accuracy on the test dataset, accurately detecting both the presence and absence of landmarks. Moreover, the network predicts landmark orientation angles with an RMSE lower than 10 degrees, enhancing the utility in SLAM and autonomous navigation applications. This advancement improves the robustness and accuracy of autonomous systems in challenging environments.
