Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging
Mahmoud Afifi, Zhenhua Hu, Liang Liang
TL;DR
This paper tackles illuminant estimation in dual-exposure HDR imaging by introducing a compact dual-exposure feature (DEF) computed from long ($I_l$) and short ($I_s$) exposure frames. DEF guides two lightweight estimators: an exposure-based MLP (EMLP) that takes DEF as input, and an exposure-based CCC (ECCC) that dynamically biases a two-histogram CCC using DEF-driven interpolation of learnable biases. On a newly collected multi-exposure HDR dataset, DEF-enabled models achieve competitive accuracy with far fewer parameters (EMLP ~354 params; ECCC ~6,156 params) and show that ensemble predictions can surpass state-of-the-art methods while remaining computationally efficient. The work demonstrates practical applicability for on-device white balance in camera ISPs and suggests further exploration of cross-camera stability and spatially varying DEF variants for real-world HDR pipelines. $I_l$ and $I_s$ are used to derive the DEF, which in turn informs the illuminant estimate $oldsymbol{\, ext{ell}}$ through the proposed models.$
Abstract
High dynamic range (HDR) imaging involves capturing a series of frames of the same scene, each with different exposure settings, to broaden the dynamic range of light. This can be achieved through burst capturing or using staggered HDR sensors that capture long and short exposures simultaneously in the camera image signal processor (ISP). Within camera ISP pipeline, illuminant estimation is a crucial step aiming to estimate the color of the global illuminant in the scene. This estimation is used in camera ISP white-balance module to remove undesirable color cast in the final image. Despite the multiple frames captured in the HDR pipeline, conventional illuminant estimation methods often rely only on a single frame of the scene. In this paper, we explore leveraging information from frames captured with different exposure times. Specifically, we introduce a simple feature extracted from dual-exposure images to guide illuminant estimators, referred to as the dual-exposure feature (DEF). To validate the efficiency of DEF, we employed two illuminant estimators using the proposed DEF: 1) a multilayer perceptron network (MLP), referred to as exposure-based MLP (EMLP), and 2) a modified version of the convolutional color constancy (CCC) to integrate our DEF, that we call ECCC. Both EMLP and ECCC achieve promising results, in some cases surpassing prior methods that require hundreds of thousands or millions of parameters, with only a few hundred parameters for EMLP and a few thousand parameters for ECCC.
