Scale-adaptive UAV Geo-localization via Height-aware Partition Learning
Quan Chen, Tingyu Wang, Rongfeng Lu, Yu Liu, Bolun Zheng, Zhedong Zheng
TL;DR
This work tackles scale mismatch in UAV-to-satellite geo-localization by introducing SaLPN, a scale-adaptive partition framework that uses a height-derived factor $\theta$ to adjust drone-view partitions via HAAS, paired with a saliency-guided refinement (SGRS) to produce robust global, salient, and background descriptors. A square partition strategy (SPS) enables simultaneous capture of fine-grained and global information, and three classifier branches supervise the part-level features in a shared embedding space. Extensive experiments on University-1652 and SUES-200 show state-of-the-art performance and strong robustness to cross-view scale variations, with ablations confirming the effectiveness of SPS, HAAS, and SGRS across ResNet-50 and ViT backbones. The approach advances GNSS-denied UAV geo-localization by enabling explicit semantic alignment across views under varying drone heights, with practical impact for reliable cross-view retrieval in real-world deployment.
Abstract
UAV Geo-Localization faces significant challenges due to the drastic appearance discrepancy between dronecaptured images and satellite views. Existing methods typically assume a consistent scaling factor across views and rely on predefined partition alignment to extract viewpoint-invariant representations through part-level feature construction. However, this scaling assumption often fails in real-world scenarios, where variations in drone flight states lead to scale mismatches between cross-view images, resulting in severe performance degradation. To address this issue, we propose a scale-adaptive partition learning framework that leverages known drone flight height to predict scale factors and dynamically adjust feature extraction. Our key contribution is a height-aware adjustment strategy, which calculates the relative height ratio between drone and satellite views, dynamically adjusting partition sizes to explicitly align semantic information between partition pairs. This strategy is integrated into a Scale-adaptive Local Partition Network (SaLPN), building upon an existing square partition strategy to extract both finegrained and global features. Additionally, we propose a saliencyguided refinement strategy to enhance part-level features, further improving retrieval accuracy. Extensive experiments validate that our height-aware, scale-adaptive approach achieves stateof-the-art geo-localization accuracy in various scale-inconsistent scenarios and exhibits strong robustness against scale variations. The code will be made publicly available.
