Table of Contents
Fetching ...

UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization

Wenjia Xu, Yaxuan Yao, Jiaqi Cao, Zhiwei Wei, Chunbo Liu, Jiuniu Wang, Mugen Peng

TL;DR

This work addresses UAV localization under GNSS-denied conditions by introducing UAV-VisLoc, a large-scale dataset that links ground-down drone imagery to high-resolution ortho-satellite maps. The dataset comprises 6,742 drone images and 11 Google Earth satellite maps from 11 locations in China, captured by fixed-wing and multi-terrain drones at altitudes $h \in [400,2000]$ m and headings $\Phi \in [-180\circ,180\circ]$, with rich per-image metadata. It fills gaps in prior benchmarks that were limited in geographic scope, altitude, or scene diversity, enabling robust cross-view localization research across varied terrains. The dataset aims to facilitate training and evaluation for GNSS-denied UAV navigation by providing diverse, large-scale ground-to-satellite map correspondences that support learning-based localization methods.

Abstract

The application of unmanned aerial vehicles (UAV) has been widely extended recently. It is crucial to ensure accurate latitude and longitude coordinates for UAVs, especially when the global navigation satellite systems (GNSS) are disrupted and unreliable. Existing visual localization methods achieve autonomous visual localization without error accumulation by matching the ground-down view image of UAV with the ortho satellite maps. However, collecting UAV ground-down view images across diverse locations is costly, leading to a scarcity of large-scale datasets for real-world scenarios. Existing datasets for UAV visual localization are often limited to small geographic areas or are focused only on urban regions with distinct textures. To address this, we define the UAV visual localization task by determining the UAV's real position coordinates on a large-scale satellite map based on the captured ground-down view. In this paper, we present a large-scale dataset, UAV-VisLoc, to facilitate the UAV visual localization task. This dataset comprises images from diverse drones across 11 locations in China, capturing a range of topographical features. The dataset features images from fixed-wing drones and multi-terrain drones, captured at different altitudes and orientations. Our dataset includes 6,742 drone images and 11 satellite maps, with metadata such as latitude, longitude, altitude, and capture date. Our dataset is tailored to support both the training and testing of models by providing a diverse and extensive data.

UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization

TL;DR

This work addresses UAV localization under GNSS-denied conditions by introducing UAV-VisLoc, a large-scale dataset that links ground-down drone imagery to high-resolution ortho-satellite maps. The dataset comprises 6,742 drone images and 11 Google Earth satellite maps from 11 locations in China, captured by fixed-wing and multi-terrain drones at altitudes m and headings , with rich per-image metadata. It fills gaps in prior benchmarks that were limited in geographic scope, altitude, or scene diversity, enabling robust cross-view localization research across varied terrains. The dataset aims to facilitate training and evaluation for GNSS-denied UAV navigation by providing diverse, large-scale ground-to-satellite map correspondences that support learning-based localization methods.

Abstract

The application of unmanned aerial vehicles (UAV) has been widely extended recently. It is crucial to ensure accurate latitude and longitude coordinates for UAVs, especially when the global navigation satellite systems (GNSS) are disrupted and unreliable. Existing visual localization methods achieve autonomous visual localization without error accumulation by matching the ground-down view image of UAV with the ortho satellite maps. However, collecting UAV ground-down view images across diverse locations is costly, leading to a scarcity of large-scale datasets for real-world scenarios. Existing datasets for UAV visual localization are often limited to small geographic areas or are focused only on urban regions with distinct textures. To address this, we define the UAV visual localization task by determining the UAV's real position coordinates on a large-scale satellite map based on the captured ground-down view. In this paper, we present a large-scale dataset, UAV-VisLoc, to facilitate the UAV visual localization task. This dataset comprises images from diverse drones across 11 locations in China, capturing a range of topographical features. The dataset features images from fixed-wing drones and multi-terrain drones, captured at different altitudes and orientations. Our dataset includes 6,742 drone images and 11 satellite maps, with metadata such as latitude, longitude, altitude, and capture date. Our dataset is tailored to support both the training and testing of models by providing a diverse and extensive data.
Paper Structure (6 sections, 2 figures, 2 tables)

This paper contains 6 sections, 2 figures, 2 tables.

Figures (2)

  • Figure 1: The dataset collecting process. The red point in the coordinate system represents the projection of the drone's current location on the ground, i.e., the center point of the image taken by the drone. The yellow points represent the satellite map boundaries of the entire flight range.
  • Figure 2: An example of drone images and satellite map. The red dots in satellite map represent the center points of drone images. The satellite map encompasses various terrains such as cities, towns, farms, and rivers. We also show the drone images of these terrains.