View Distribution Alignment with Progressive Adversarial Learning for UAV Visual Geo-Localization

Cuiwei Liu; Jiahao Liu; Huaijun Qiu; Zhaokui Li; Xiangbin Shi

View Distribution Alignment with Progressive Adversarial Learning for UAV Visual Geo-Localization

Cuiwei Liu, Jiahao Liu, Huaijun Qiu, Zhaokui Li, Xiangbin Shi

TL;DR

The paper introduces PVDA, an end-to-end framework for UAV visual geo-localization that directly addresses the distribution gap between UAV-view and satellite-view images. It combines a shared ResNet-50 feature encoder, a multi-branch location classifier, and a view discriminator, trained with a progressive adversarial strategy that gradually emphasizes view-invariance while preserving location-discriminative power. The method achieves state-of-the-art results on the University-1652 dataset for both UAV-to-satellite and satellite-to-UAV tasks, with competitive inference time and robustness to unseen locations. The approach demonstrates the practical potential of distribution alignment in cross-view image retrieval for geo-localization applications.

Abstract

Unmanned Aerial Vehicle (UAV) visual geo-localization aims to match images of the same geographic target captured from different views, i.e., the UAV view and the satellite view. It is very challenging due to the large appearance differences in UAV-satellite image pairs. Previous works map images captured by UAVs and satellites to a shared feature space and employ a classification framework to learn location-dependent features while neglecting the overall distribution shift between the UAV view and the satellite view. In this paper, we address these limitations by introducing distribution alignment of the two views to shorten their distance in a common space. Specifically, we propose an end-to-end network, called PVDA (Progressive View Distribution Alignment). During training, feature encoder, location classifier, and view discriminator are jointly optimized by a novel progressive adversarial learning strategy. Competition between feature encoder and view discriminator prompts both of them to be stronger. It turns out that the adversarial learning is progressively emphasized until UAV-view images are indistinguishable from satellite-view images. As a result, the proposed PVDA becomes powerful in learning location-dependent yet view-invariant features with good scalability towards unseen images of new locations. Compared to the state-of-the-art methods, the proposed PVDA requires less inference time but has achieved superior performance on the University-1652 dataset.

View Distribution Alignment with Progressive Adversarial Learning for UAV Visual Geo-Localization

TL;DR

Abstract

Paper Structure (17 sections, 4 equations, 4 figures, 3 tables)

This paper contains 17 sections, 4 equations, 4 figures, 3 tables.

Introduction
Related works
Method
Architecture of the proposed PVDA
Feature encoder.
Location classifier.
View discriminator.
Progressive adversarial learning strategy
Cross-view image matching
Experiments
Dataset and experimental settings
Experimental results
Comparison to the state-of-the-arts.
Multi-query image matching.
Visualization of UAV visual geo-localization results.
...and 2 more sections

Figures (4)

Figure 1: Overall framework of our method. In this exemplar, a feature encoder, a 701-way location classifier, and a 2-way view discriminator are trained on images of 701 buildings. The feature encoder takes images resized to $256 \times 256 \times 3$ as input and outputs four 512-dimensional vectors for each image.
Figure 2: Architecture of the view discriminator.
Figure 3: Top-5 retrieved satellite-view images in the UAV-view target localization task. The first and second rows display the matching results with a single query, while the third row employs multiple UAV-view images as queries. The true-matched satellite-view images are annotated with green borders.
Figure 4: Top-5 retrieved UAV-view images in the UAV navigation task. The true-matched UAV-view images are annotated with green borders.

View Distribution Alignment with Progressive Adversarial Learning for UAV Visual Geo-Localization

TL;DR

Abstract

View Distribution Alignment with Progressive Adversarial Learning for UAV Visual Geo-Localization

Authors

TL;DR

Abstract

Table of Contents

Figures (4)