Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior
Baiang Li, Sizhuo Ma, Yanhong Zeng, Xiaogang Xu, Youqing Fang, Zhao Zhang, Jian Wang, Kai Chen
TL;DR
The paper tackles the challenge of recovering high dynamic range content from a single 8-bit LDR image. It introduces a two-stage LS-Sagiri pipeline: Stage 1 Latent-SwinIR_c performs color restoration and brightness adjustment, and Stage 2 Sagiri employs a diffusion-prior conditioned on the restored image to generate plausible details in dynamic-range extremes. The training deploys specialized losses for color ($L_{color}$) and content (multi-term $L_{content}$) and a plug-in training strategy to enable compatibility with existing LDR methods, while using adaptive regional prompts during sampling. Empirical results on HDR-Real, HDR-Eye, and NTIRE datasets show consistent improvements in non-reference quality metrics and demonstrate Sagiri’s ability to generate realistic details in saturated and dark regions, with efficient inference via 30 DDPM steps.
Abstract
Capturing High Dynamic Range (HDR) scenery using 8-bit cameras often suffers from over-/underexposure, loss of fine details due to low bit-depth compression, skewed color distributions, and strong noise in dark areas. Traditional LDR image enhancement methods primarily focus on color mapping, which enhances the visual representation by expanding the image's color range and adjusting the brightness. However, these approaches fail to effectively restore content in dynamic range extremes, which are regions with pixel values close to 0 or 255. To address the full scope of challenges in HDR imaging and surpass the limitations of current models, we propose a novel two-stage approach. The first stage maps the color and brightness to an appropriate range while keeping the existing details, and the second stage utilizes a diffusion prior to generate content in dynamic range extremes lost during capture. This generative refinement module can also be used as a plug-and-play module to enhance and complement existing LDR enhancement models. The proposed method markedly improves the quality and details of LDR images, demonstrating superior performance through rigorous experimental validation. The project page is at https://sagiri0208.github.io
