Intrinsic Single-Image HDR Reconstruction
Sebastian Dille, Chris Careaga, Yağız Aksoy
TL;DR
Problem: Reconstructing HDR from a single LDR image is challenging due to saturation and nonlinear camera processing. Approach: A physically-mounded intrinsic HDR pipeline decomposes the image into shading $S$ and albedo $A$, learning separate HDR extensions in the shading domain via $D_H$ and in the albedo domain via $A_H$, with an additional refinement stage; shading uses the inverse domain $D_L = 1/(S_L+1)$ to handle long-tail illumination, and a soft-albedo guidance mask aids color recovery. Contributions: (i) a modular two-subtask HDR framework in the intrinsic domain, (ii) tailored loss terms on implied intrinsic components and an inverse-HDR representation, (iii) a dedicated refinement module, and (iv) strong quantitative and qualitative results across multiple in-the-wild datasets. Significance: enables high-fidelity recovery of bright-luminance details and accurate color from a single image, enhancing HDR display, editing, and downstream computational photography applications.
Abstract
The low dynamic range (LDR) of common cameras fails to capture the rich contrast in natural scenes, resulting in loss of color and details in saturated pixels. Reconstructing the high dynamic range (HDR) of luminance present in the scene from single LDR photographs is an important task with many applications in computational photography and realistic display of images. The HDR reconstruction task aims to infer the lost details using the context present in the scene, requiring neural networks to understand high-level geometric and illumination cues. This makes it challenging for data-driven algorithms to generate accurate and high-resolution results. In this work, we introduce a physically-inspired remodeling of the HDR reconstruction problem in the intrinsic domain. The intrinsic model allows us to train separate networks to extend the dynamic range in the shading domain and to recover lost color details in the albedo domain. We show that dividing the problem into two simpler sub-tasks improves performance in a wide variety of photographs.
