Multi-Object Tracking in the Dark
Xinzhe Wang, Kang Ma, Qiankun Liu, Yunhao Zou, Ying Fu
TL;DR
This work tackles multi-object tracking in dark scenes by introducing the LMOT dataset, built with a dual-camera RAW system that yields aligned low-light and well-lit video pairs along with high-quality MOT annotations. It then presents LTrack, a tracking method that avoids heavy low-light enhancement and instead learns invariant, noise-robust features through adaptive low-pass downsampling and degradation suppression learning. Key contributions include the LMOT dataset design and statistics, the ALD and DSL components, and extensive experiments showing state-of-the-art performance in real night scenes. The approach offers practical impact for night-time autonomous driving and surveillance by providing a robust MOT pipeline that leverages RAW data and targeted feature regularization.
Abstract
Low-light scenes are prevalent in real-world applications (e.g. autonomous driving and surveillance at night). Recently, multi-object tracking in various practical use cases have received much attention, but multi-object tracking in dark scenes is rarely considered. In this paper, we focus on multi-object tracking in dark scenes. To address the lack of datasets, we first build a Low-light Multi-Object Tracking (LMOT) dataset. LMOT provides well-aligned low-light video pairs captured by our dual-camera system, and high-quality multi-object tracking annotations for all videos. Then, we propose a low-light multi-object tracking method, termed as LTrack. We introduce the adaptive low-pass downsample module to enhance low-frequency components of images outside the sensor noises. The degradation suppression learning strategy enables the model to learn invariant information under noise disturbance and image quality degradation. These components improve the robustness of multi-object tracking in dark scenes. We conducted a comprehensive analysis of our LMOT dataset and proposed LTrack. Experimental results demonstrate the superiority of the proposed method and its competitiveness in real night low-light scenes. Dataset and Code: https: //github.com/ying-fu/LMOT
