UM3: Unsupervised Map to Map Matching
Chaolong Ying, Yinan Zhang, Lei Zhang, Jiazhuang Wang, Shujun Jia, Tianshu Yu
TL;DR
UM3 tackles map-to-map matching without labeled data by introducing pseudo coordinates that capture the relative spatial layout of nodes, and by unifying feature-based and geometry-based similarities within an unsupervised optimization framework. A geometric-consistent loss together with a tile-based extension enables scalable, boundary-coherent matching on large-scale maps. The approach employs a GNN backbone to learn node embeddings and uses Sinkhorn normalization with a Hungarian post-processing step to produce hard correspondences, achieving state-of-the-art accuracy and robust performance under noise. This work offers a practical, scalable solution for integrating heterogeneous geospatial datasets and has significant implications for map alignment, navigation, and urban analytics.
Abstract
Map-to-map matching is a critical task for aligning spatial data across heterogeneous sources, yet it remains challenging due to the lack of ground truth correspondences, sparse node features, and scalability demands. In this paper, we propose an unsupervised graph-based framework that addresses these challenges through three key innovations. First, our method is an unsupervised learning approach that requires no training data, which is crucial for large-scale map data where obtaining labeled training samples is challenging. Second, we introduce pseudo coordinates that capture the relative spatial layout of nodes within each map, which enhances feature discriminability and enables scale-invariant learning. Third, we design an mechanism to adaptively balance feature and geometric similarity, as well as a geometric-consistent loss function, ensuring robustness to noisy or incomplete coordinate data. At the implementation level, to handle large-scale maps, we develop a tile-based post-processing pipeline with overlapping regions and majority voting, which enables parallel processing while preserving boundary coherence. Experiments on real-world datasets demonstrate that our method achieves state-of-the-art accuracy in matching tasks, surpassing existing methods by a large margin, particularly in high-noise and large-scale scenarios. Our framework provides a scalable and practical solution for map alignment, offering a robust and efficient alternative to traditional approaches.
