View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV
Deyi Ji, Lanyun Zhu, Siqi Gao, Qi Zhu, Yiru Zhao, Peng Xu, Yue Ding, Hongtao Lu, Jieping Ye, Feng Wu, Feng Zhao
TL;DR
HomView-MOT tackles multi-object tracking in moving UAV scenarios by integrating scene homography with a fast estimation step, view-centric identity learning, and homographic cross-frame matching. The Fast Homography Estimation provides efficient view projection, while View-Centric ID Learning and the Homographic Matching Filter robustly handle view changes and camera motion, yielding state-of-the-art results on VisDrone2019 and UAVDT. The approach demonstrates strong generalization through geometry-based compensation and cross-view ID features, offering practical gains for UAV-based tracking in dynamic environments. Overall, the framework combines geometric view alignment with learned cross-view ID representations to advance MOT under moving-camera conditions.
Abstract
In this paper, we address the challenge of Multi-Object Tracking (MOT) in moving Unmanned Aerial Vehicle (UAV) scenarios, where irregular flight trajectories, such as hovering, turning left/right, and moving up/down, lead to significantly greater complexity compared to fixed-camera MOT. Specifically, changes in the scene background not only render traditional frame-to-frame object IoU association methods ineffective but also introduce significant view shifts in the objects, which complicates tracking. To overcome these issues, we propose a novel HomView-MOT framework, which for the first time, harnesses the view homography inherent in changing scenes to solve MOT challenges in moving environments, incorporating homographic matching and view-centric concepts. We introduce a Fast Homography Estimation (FHE) algorithm for rapid computation of homography matrices between video frames, enabling object View-Centric ID Learning (VCIL) and leveraging multi-view homography to learn cross-view ID features. Concurrently, our Homographic Matching Filter (HMF) maps object bounding boxes from different frames onto a common view plane for a more realistic physical IoU association. Extensive experiments have proven that these innovations allow HomView-MOT to achieve state-of-the-art performance on prominent UAV MOT datasets VisDrone and UAVDT.
