UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization
Rouwan Wu, Xiaoya Cheng, Juelin Zhu, Xuxiang Liu, Maojun Zhang, Shen Yan
TL;DR
UAVD4L provides a large-scale, GPS-denied UAV localization benchmark with a world-aligned textured $3$D model and accurate $6$-DoF ground truth, enabling offline synthetic data generation and online visual localization. The authors introduce a two-stage UAVLoc pipeline that uses synthetic renders and rotation priors to constrain retrieval and a gravity-guided PnP RANSAC for robust pose estimation, followed by a hierarchical system for ground-target tracking using a wide-angle and a zoom camera with DEM-based projection. Empirical results show strong performance in image retrieval, 6-DoF localization, and target tracking, with ablations confirming benefits from multi-layer rendering and sensor priors. The dataset and code are released to advance research in GPS-denied UAV perception, navigation, and 3D target tracking. Overall, UAVD4L bridges gaps in scale, viewpoint diversity, GT accuracy, and sensor integration for airborne visual localization."
Abstract
Despite significant progress in global localization of Unmanned Aerial Vehicles (UAVs) in GPS-denied environments, existing methods remain constrained by the availability of datasets. Current datasets often focus on small-scale scenes and lack viewpoint variability, accurate ground truth (GT) pose, and UAV build-in sensor data. To address these limitations, we introduce a large-scale 6-DoF UAV dataset for localization (UAVD4L) and develop a two-stage 6-DoF localization pipeline (UAVLoc), which consists of offline synthetic data generation and online visual localization. Additionally, based on the 6-DoF estimator, we design a hierarchical system for tracking ground target in 3D space. Experimental results on the new dataset demonstrate the effectiveness of the proposed approach. Code and dataset are available at https://github.com/RingoWRW/UAVD4L
