Latent Disentanglement for Low Light Image Enhancement
Zhihao Zheng, Mooi Choo Chuah
TL;DR
This work introduces LDE-Net, a transformer-based latent-disentanglement framework for low-light image enhancement that separates content and illumination in latent space to avoid corruption from pixel-space Retinex decompositions. A Content-Aware Embedding (CAE) module guides a light-weight illumination enhancer, producing strong LLIE results and enabling efficient downstream tasks such as nighttime UAV tracking and low-light object detection. The approach achieves state-of-the-art performance on public LLIE benchmarks (LOL-v1, LOL-v2, LDIS) and demonstrates transferability and practicality through ablations and downstream task experiments. By combining robust latent disentanglement with cross-attention-based fusion, the method delivers sharper details, better lighting consistency, and competitive or superior performance with significantly reduced computational cost.
Abstract
Many learning-based low-light image enhancement (LLIE) algorithms are based on the Retinex theory. However, the Retinex-based decomposition techniques in such models introduce corruptions which limit their enhancement performance. In this paper, we propose a Latent Disentangle-based Enhancement Network (LDE-Net) for low light vision tasks. The latent disentanglement module disentangles the input image in latent space such that no corruption remains in the disentangled Content and Illumination components. For LLIE task, we design a Content-Aware Embedding (CAE) module that utilizes Content features to direct the enhancement of the Illumination component. For downstream tasks (e.g. nighttime UAV tracking and low-light object detection), we develop an effective light-weight enhancer based on the latent disentanglement framework. Comprehensive quantitative and qualitative experiments demonstrate that our LDE-Net significantly outperforms state-of-the-art methods on various LLIE benchmarks. In addition, the great results obtained by applying our framework on the downstream tasks also demonstrate the usefulness of our latent disentanglement design.
