HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images
Shreyas Singh, Aryan Garg, Kaushik Mitra
TL;DR
HDRSplat extends 3D Gaussian Splatting to operate directly in 14-bit linear HDR space by supervising from denoised raw images, enabling realtime HDR 3D reconstruction in challenging nighttime and low-light scenes. The approach combines Bayer-space denoising (PMRID), a stop-gradient scaled $\mathcal{L}_1$ loss with DSSIM, and rasterization tuning to overcome initialization sparsity, achieving up to $\le 15$ minutes per scene training and $\ge 120$ fps inference. Quantitatively, it outperforms RawNeRF and Raw3DGS on 14-bit HDR renders with significant gains in LPIPS, SSIM, and PSNR, while dramatically reducing memory usage to about $0.35$ million points per scene. The work enables downstream applications like synthetic defocus, dense depth extraction, and post-capture exposure/tonemapping control, highlighting HDR rendering as a practical pathway for real-time HDR 3D scene understanding and editing.
Abstract
The recent advent of 3D Gaussian Splatting (3DGS) has revolutionized the 3D scene reconstruction space enabling high-fidelity novel view synthesis in real-time. However, with the exception of RawNeRF, all prior 3DGS and NeRF-based methods rely on 8-bit tone-mapped Low Dynamic Range (LDR) images for scene reconstruction. Such methods struggle to achieve accurate reconstructions in scenes that require a higher dynamic range. Examples include scenes captured in nighttime or poorly lit indoor spaces having a low signal-to-noise ratio, as well as daylight scenes with shadow regions exhibiting extreme contrast. Our proposed method HDRSplat tailors 3DGS to train directly on 14-bit linear raw images in near darkness which preserves the scenes' full dynamic range and content. Our key contributions are two-fold: Firstly, we propose a linear HDR space-suited loss that effectively extracts scene information from noisy dark regions and nearly saturated bright regions simultaneously, while also handling view-dependent colors without increasing the degree of spherical harmonics. Secondly, through careful rasterization tuning, we implicitly overcome the heavy reliance and sensitivity of 3DGS on point cloud initialization. This is critical for accurate reconstruction in regions of low texture, high depth of field, and low illumination. HDRSplat is the fastest method to date that does 14-bit (HDR) 3D scene reconstruction in $\le$15 minutes/scene ($\sim$30x faster than prior state-of-the-art RawNeRF). It also boasts the fastest inference speed at $\ge$120fps. We further demonstrate the applicability of our HDR scene reconstruction by showcasing various applications like synthetic defocus, dense depth map extraction, and post-capture control of exposure, tone-mapping and view-point.
