SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization
Hanjun Kim, Minwoo Jung, Wooseong Yang, Ayoung Kim
TL;DR
SHeRLoc tackles cross-modal localization across heterogeneous radar types by transforming data into synchronized RCS polar BEV representations and learning rotation-invariant, cross-modal embeddings. It introduces HOLMES, a hierarchical optimal-transport-based descriptor that fuses local RCS patterns with global context under an adaptive entropy-regularized Sinkhorn framework, and couples it with FoV-aware FFT-based data mining and an adaptive margin triplet loss. The approach yields dramatic gains on a public heterogeneous radar dataset, raising recall@1 from below $0.1$ to $0.9$, and demonstrates strong zero-shot generalization and cross-modal applicability to LiDAR. This work enables robust cross-modal place recognition and paves the way for heterogeneous sensor SLAM, with open-source code to accelerate community adoption.
Abstract
Despite the growing adoption of radar in robotics, the majority of research has been confined to homogeneous sensor types, overlooking the integration and cross-modality challenges inherent in heterogeneous radar technologies. This leads to significant difficulties in generalizing across diverse radar data types, with modality-aware approaches that could leverage the complementary strengths of heterogeneous radar remaining unexplored. To bridge these gaps, we propose SHeRLoc, the first deep network tailored for heterogeneous radar, which utilizes RCS polar matching to align multimodal radar data. Our hierarchical optimal transport-based feature aggregation method generates rotationally robust multi-scale descriptors. By employing FFT-similarity-based data mining and adaptive margin-based triplet loss, SHeRLoc enables FOV-aware metric learning. SHeRLoc achieves an order of magnitude improvement in heterogeneous radar place recognition, increasing recall@1 from below 0.1 to 0.9 on a public dataset and outperforming state of-the-art methods. Also applicable to LiDAR, SHeRLoc paves the way for cross-modal place recognition and heterogeneous sensor SLAM. The supplementary materials and source code are available at https://sites.google.com/view/radar-sherloc.
