GL-DT: Multi-UAV Detection and Tracking with Global-Local Integration
Juanqin Liu, Leonardo Plotegher, Eloy Roura, Shaoming He
TL;DR
GL-DT tackles real-time multi-UAV tracking in challenging aerial scenes by jointly modeling motion and appearance through a Spatio-Temporal Feature Fusion (STFF) and a frame-level global–local detection strategy. The framework combines Global Detection with an enhanced AM-YOLO backbone and Local Detection on ROI trajectories, coupled with the JPTrack tracker that uses JCMA for robust data association and PMR to recover trajectories after short-term occlusions. Experiments on MOT-FLY and FT datasets show state-of-the-art performance in ID continuity and localization accuracy (e.g., high IDF1 and MOTA) while maintaining real-time speed on both PC (≈124 FPS with TensorRT) and embedded platforms (≈25 FPS on Xavier NX). These results underscore GL-DT’s practical viability for UAV-based surveillance and autonomous navigation, especially in scenarios with small, cluttered, and interacting drones.
Abstract
The extensive application of unmanned aerial vehicles (UAVs) in military reconnaissance, environmental monitoring, and related domains has created an urgent need for accurate and efficient multi-object tracking (MOT) technologies, which are also essential for UAV situational awareness. However, complex backgrounds, small-scale targets, and frequent occlusions and interactions continue to challenge existing methods in terms of detection accuracy and trajectory continuity. To address these issues, this paper proposes the Global-Local Detection and Tracking (GL-DT) framework. It employs a Spatio-Temporal Feature Fusion (STFF) module to jointly model motion and appearance features, combined with a global-local collaborative detection strategy, effectively enhancing small-target detection. Building upon this, the JPTrack tracking algorithm is introduced to mitigate common issues such as ID switches and trajectory fragmentation. Experimental results demonstrate that the proposed approach significantly improves the continuity and stability of MOT while maintaining real-time performance, providing strong support for the advancement of UAV detection and tracking technologies.
