MM-Tracker: Motion Mamba with Margin Loss for UAV-platform Multiple Object Tracking
Mufeng Yao, Jinlong Peng, Qingdong He, Bo Peng, Hao Chen, Mingmin Chi, Chao Liu, Jon Atli Benediktsson
TL;DR
MM-Tracker tackles UAV-based multi-object tracking by addressing both local object motion and global camera motion, as well as motion blur. It introduces Motion Mamba, a lightweight module that fuses local cross-correlation with bi-directional global scanning via vertical and horizontal state-space models to generate a motion map from bi-temporal detections. It also proposes Motion Margin Loss to create motion-aware decision boundaries, improving detection of fast-moving, motion-blurred objects using ground-truth motion maps derived from optical flow. Together, Motion Mamba and Motion Margin Loss yield state-of-the-art MOTA and IDF1 on Visdrone and UAVDT with fast inference, demonstrating practical benefits for UAV-MOT applications. The work advances UAV tracking by integrating efficient global motion modeling with motion-aware detector training, offering a compelling approach for real-time, robust object tracking from moving platforms.
Abstract
Multiple object tracking (MOT) from unmanned aerial vehicle (UAV) platforms requires efficient motion modeling. This is because UAV-MOT faces both local object motion and global camera motion. Motion blur also increases the difficulty of detecting large moving objects. Previous UAV motion modeling approaches either focus only on local motion or ignore motion blurring effects, thus limiting their tracking performance and speed. To address these issues, we propose the Motion Mamba Module, which explores both local and global motion features through cross-correlation and bi-directional Mamba Modules for better motion modeling. To address the detection difficulties caused by motion blur, we also design motion margin loss to effectively improve the detection accuracy of motion blurred objects. Based on the Motion Mamba module and motion margin loss, our proposed MM-Tracker surpasses the state-of-the-art in two widely open-source UAV-MOT datasets. Code will be available.
