LEDiff: Latent Exposure Diffusion for HDR Generation
Chao Wang, Zhihao Xia, Thomas Leimkuehler, Karol Myszkowski, Xuaner Zhang
TL;DR
LEDiff tackles the limitation of 8-bit LDR content by enabling HDR generation within pre-trained latent diffusion models through latent space exposure fusion. It preserves the original latent space and uses a small HDR dataset to train a learnable fusion module and a fine-tuned HDR decoder, complemented by highlight and shadow denoisers to hallucinate clipped regions. The approach supports both HDR content generation (text-to-HDR, panorama, and video) and LDR-to-HDR reconstruction, showing competitive HDR quality across objective metrics and user studies. This work provides a practical, plug-and-play pathway to photorealistic HDR outputs for generative content and HDR-enabled downstream tasks like image-based lighting and depth-of-field rendering. It also demonstrates robust performance with limited HDR data, suggesting strong potential for broad adoption and further improvements in HDR-aware generative modeling.
Abstract
While consumer displays increasingly support more than 10 stops of dynamic range, most image assets such as internet photographs and generative AI content remain limited to 8-bit low dynamic range (LDR), constraining their utility across high dynamic range (HDR) applications. Currently, no generative model can produce high-bit, high-dynamic range content in a generalizable way. Existing LDR-to-HDR conversion methods often struggle to produce photorealistic details and physically-plausible dynamic range in the clipped areas. We introduce LEDiff, a method that enables a generative model with HDR content generation through latent space fusion inspired by image-space exposure fusion techniques. It also functions as an LDR-to-HDR converter, expanding the dynamic range of existing low-dynamic range images. Our approach uses a small HDR dataset to enable a pretrained diffusion model to recover detail and dynamic range in clipped highlights and shadows. LEDiff brings HDR capabilities to existing generative models and converts any LDR image to HDR, creating photorealistic HDR outputs for image generation, image-based lighting (HDR environment map generation), and photographic effects such as depth of field simulation, where linear HDR data is essential for realistic quality.
