Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications
Brayan Monroy, Kebin Contreras, Jorge Bacca
TL;DR
This work addresses HDR reconstruction from modulo imaging by reframing Unlimited Sampling Framework recovery as an autoregressive high-order finite-difference phase-unwrapping problem in 2D. It introduces AHFD, which combines neighborhood vectorization, a $DCT$-domain autoregressive solver, and a stride-artifact removal step to enable robust HDR reconstruction from modulo measurements. The method achieves competitive HDR restoration compared with state-of-the-art optimization and deep-learning approaches and improves object-detection performance in autonomous-driving scenarios without retraining. These contributions advance practical HDR sensing with modulo-ADC hardware, enabling better handling of saturated scenes in real-time vision tasks.
Abstract
High dynamic range (HDR) imaging is vital for capturing the full range of light tones in scenes, essential for computer vision tasks such as autonomous driving. Standard commercial imaging systems face limitations in capacity for well depth, and quantization precision, hindering their HDR capabilities. Modulo imaging, based on unlimited sampling (US) theory, addresses these limitations by using a modulo analog-to-digital approach that resets signals upon saturation, enabling estimation of pixel resets through neighboring pixel intensities. Despite the effectiveness of (US) algorithms in one-dimensional signals, their optimization problem for two-dimensional signals remains unclear. This work formulates the US framework as an autoregressive $\ell_2$ phase unwrapping problem, providing computationally efficient solutions in the discrete cosine domain jointly with a stride removal algorithm also based on spatial differences. By leveraging higher-order finite differences for two-dimensional images, our approach enhances HDR image reconstruction from modulo images, demonstrating its efficacy in improving object detection in autonomous driving scenes without retraining.
