A Comprehensive Study of Object Tracking in Low-Light Environments
Anqi Yi, Nantheera Anantrasirichai
TL;DR
The paper tackles object tracking in low-light conditions by studying how distortions such as noise, color imbalance, and low contrast degrade performance and proposing a preprocessing pipeline that combines SUNet denoising and EnlightenGAN enhancement within a transformer-based MixFormer tracker. By training on synthetically dark data and leveraging a Mixed Attention Module (MAM) with an online Score Prediction Module (SPM), the approach achieves superior tracking accuracy over vanilla MixFormer and Siam R-CNN. Key findings show that denoising yields larger gains than enhancement and that training with diverse low-light features improves robustness across varying conditions, with quantified AUC gains when applying the preprocessing at train/test stages. The results have practical impact for surveillance, ethology, and real-time tracking in challenging lighting, and suggest future work toward more diverse distortions and temporal modeling.
Abstract
Accurate object tracking in low-light environments is crucial, particularly in surveillance and ethology applications. However, achieving this is significantly challenging due to the poor quality of captured sequences. Factors such as noise, color imbalance, and low contrast contribute to these challenges. This paper presents a comprehensive study examining the impact of these distortions on automatic object trackers. Additionally, we propose a solution to enhance tracking performance by integrating denoising and low-light enhancement methods into the transformer-based object tracking system. Experimental results show that the proposed tracker, trained with low-light synthetic datasets, outperforms both the vanilla MixFormer and Siam R-CNN.
