Event-assisted 12-stop HDR Imaging of Dynamic Scene
Shi Guo, Zixuan Chen, Ziran Zhang, Yutian Chen, Gangwei Xu, Tianfan Xue
TL;DR
This work tackles pushing HDR imaging in dynamic scenes to 12 stops by combining an event camera with an RGB camera. It introduces an explicit event-assisted alignment module to robustly align LDR frames with large exposure gaps, and a diffusion-based fusion module that leverages priors from pretrained diffusion models to suppress alignment artifacts and preserve detail. The authors also present the ESHDR dataset, a paired RGB-Event dataset for 12-stop HDR, and validate the approach on simulated and real-world data, achieving state-of-the-art results on both fidelity and perceptual metrics. The approach has practical implications for high-contrast imaging in dynamic scenes and highlights the value of integrating event signals with diffusion priors to extend HDR capabilities.
Abstract
High dynamic range (HDR) imaging is a crucial task in computational photography, which captures details across diverse lighting conditions. Traditional HDR fusion methods face limitations in dynamic scenes with extreme exposure differences, as aligning low dynamic range (LDR) frames becomes challenging due to motion and brightness variation. In this work, we propose a novel 12-stop HDR imaging approach for dynamic scenes, leveraging a dual-camera system with an event camera and an RGB camera. The event camera provides temporally dense, high dynamic range signals that improve alignment between LDR frames with large exposure differences, reducing ghosting artifacts caused by motion. Also, a real-world finetuning strategy is proposed to increase the generalization of alignment module on real-world events. Additionally, we introduce a diffusion-based fusion module that incorporates image priors from pre-trained diffusion models to address artifacts in high-contrast regions and minimize errors from the alignment process. To support this work, we developed the ESHDR dataset, the first dataset for 12-stop HDR imaging with synchronized event signals, and validated our approach on both simulated and real-world data. Extensive experiments demonstrate that our method achieves state-of-the-art performance, successfully extending HDR imaging to 12 stops in dynamic scenes.
