Learning Physics-Informed Color-Aware Transforms for Low-Light Image Enhancement
Xingxing Yang, Jie Chen, Zaifeng Yang
TL;DR
This work addresses the instability and color distortion observed when enhancing low-light images via direct sRGB mappings, especially under varied SPD. It introduces PiCat, a physics-informed pipeline comprising a Color-aware Transform (CAT) that yields illumination-invariant descriptors under a Lambertian prior, and a Content-Noise Decomposition Network (CNDN) that explicitly estimates and separates noise to preserve content during enhancement; a downstream restoration network (MST) completes the process. The approach delivers state-of-the-art performance across five LLIE datasets with lower computational cost, and demonstrates robustness to SPD perturbations as well as plug-and-play compatibility with other restoration models. Overall, PiCat provides a principled, efficient, and robust LLIE solution that better handles complex lighting and SPD variations in practical scenarios.
Abstract
Image decomposition offers deep insights into the imaging factors of visual data and significantly enhances various advanced computer vision tasks. In this work, we introduce a novel approach to low-light image enhancement based on decomposed physics-informed priors. Existing methods that directly map low-light to normal-light images in the sRGB color space suffer from inconsistent color predictions and high sensitivity to spectral power distribution (SPD) variations, resulting in unstable performance under diverse lighting conditions. To address these challenges, we introduce a Physics-informed Color-aware Transform (PiCat), a learning-based framework that converts low-light images from the sRGB color space into deep illumination-invariant descriptors via our proposed Color-aware Transform (CAT). This transformation enables robust handling of complex lighting and SPD variations. Complementing this, we propose the Content-Noise Decomposition Network (CNDN), which refines the descriptor distributions to better align with well-lit conditions by mitigating noise and other distortions, thereby effectively restoring content representations to low-light images. The CAT and the CNDN collectively act as a physical prior, guiding the transformation process from low-light to normal-light domains. Our proposed PiCat framework demonstrates superior performance compared to state-of-the-art methods across five benchmark datasets.
