CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement
Wei Wang, Zhi Jin
TL;DR
This work tackles the challenge of low-light image enhancement when JPEG compression degrades dark-region information. It introduces CAPformer, a compression-aware pre-trained Transformer with a Brightness-Guided Self-Attention (BGSA) mechanism, designed to learn lossless information from uncompressed low-light data and suppress unreliable information from very dark regions during enhancement. The model employs a U-shaped encoder–decoder with a Transformer bottleneck and a pre-training–fine-tuning strategy, achieving state-of-the-art PSNR/SSIM on JPEG LLIE benchmarks and strong qualitative results with fewer artifacts and better color fidelity. The approach enables robust LLIE in resource-constrained settings, benefiting mobile photography where storage and transmission constraints are common.
Abstract
Low-Light Image Enhancement (LLIE) has advanced with the surge in phone photography demand, yet many existing methods neglect compression, a crucial concern for resource-constrained phone photography. Most LLIE methods overlook this, hindering their effectiveness. In this study, we investigate the effects of JPEG compression on low-light images and reveal substantial information loss caused by JPEG due to widespread low pixel values in dark areas. Hence, we propose the Compression-Aware Pre-trained Transformer (CAPformer), employing a novel pre-training strategy to learn lossless information from uncompressed low-light images. Additionally, the proposed Brightness-Guided Self-Attention (BGSA) mechanism enhances rational information gathering. Experiments demonstrate the superiority of our approach in mitigating compression effects on LLIE, showcasing its potential for improving LLIE in resource-constrained scenarios.
