Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression
Siddiqua Namrah
TL;DR
The paper tackles the challenge of reliable perception in nighttime traffic imagery by proposing a fully unsupervised, multi-stage LLIE framework that decomposes images into illumination and reflectance via Retinex-inspired networks and refines them with Channel-Guidance, Color Enhancement, and Over-Exposure Correction modules. It introduces a comprehensive loss design—Projection, Consistency, Retinex, and Perceptual losses—to learn physically valid decomposition and perceptually faithful enhancements without ground-truth pairs. Across BDD100K-Night, LoLI-Street, and MVLT, the method achieves state-of-the-art or near-state-of-the-art performance in full-reference and no-reference metrics, while delivering visually coherent results suitable for real-time traffic applications. The approach significantly improves visibility, texture preservation, and downstream perception reliability in real-world low-light urban environments, with potential for video extension and edge-device deployment.
Abstract
Enhancing low-light traffic images is crucial for reliable perception in autonomous driving, intelligent transportation, and urban surveillance systems. Nighttime and dimly lit traffic scenes often suffer from poor visibility due to low illumination, noise, motion blur, non-uniform lighting, and glare from vehicle headlights or street lamps, which hinder tasks such as object detection and scene understanding. To address these challenges, we propose a fully unsupervised multi-stage deep learning framework for low-light traffic image enhancement. The model decomposes images into illumination and reflectance components, progressively refined by three specialized modules: (1) Illumination Adaptation, for global and local brightness correction; (2) Reflectance Restoration, for noise suppression and structural detail recovery using spatial-channel attention; and (3) Over-Exposure Compensation, for reconstructing saturated regions and balancing scene luminance. The network is trained using self-supervised reconstruction, reflectance smoothness, perceptual consistency, and domain-aware regularization losses, eliminating the need for paired ground-truth images. Experiments on general and traffic-specific datasets demonstrate superior performance over state-of-the-art methods in both quantitative metrics (PSNR, SSIM, LPIPS, NIQE) and qualitative visual quality. Our approach enhances visibility, preserves structure, and improves downstream perception reliability in real-world low-light traffic scenarios.
