FairMOT: On the Fairness of Detection and Re-Identification in Multiple Object Tracking
Yifu Zhang, Chunyu Wang, Xinggang Wang, Wenjun Zeng, Wenyu Liu
TL;DR
FairMOT tackles the fairness conflict between object detection and re-ID in MOT by introducing an anchor-free, center-based single-shot network built on CenterNet. By identifying and mitigating three unfairness factors—anchor-based sampling, shared features, and high re-ID dimensionality—the approach achieves balanced, high-accuracy detection and tracking. It uses two homogeneous branches (detection and re-ID) trained with uncertainty-based loss balancing and a lightweight single-image pre-training strategy for data efficiency. Empirical results on MOT Challenge datasets show state-of-the-art performance and real-time inference, highlighting the practical benefits of carefully balancing multi-task learning in MOT.
Abstract
Multi-object tracking (MOT) is an important problem in computer vision which has a wide range of applications. Formulating MOT as multi-task learning of object detection and re-ID in a single network is appealing since it allows joint optimization of the two tasks and enjoys high computation efficiency. However, we find that the two tasks tend to compete with each other which need to be carefully addressed. In particular, previous works usually treat re-ID as a secondary task whose accuracy is heavily affected by the primary detection task. As a result, the network is biased to the primary detection task which is not fair to the re-ID task. To solve the problem, we present a simple yet effective approach termed as FairMOT based on the anchor-free object detection architecture CenterNet. Note that it is not a naive combination of CenterNet and re-ID. Instead, we present a bunch of detailed designs which are critical to achieve good tracking results by thorough empirical studies. The resulting approach achieves high accuracy for both detection and tracking. The approach outperforms the state-of-the-art methods by a large margin on several public datasets. The source code and pre-trained models are released at https://github.com/ifzhang/FairMOT.
