Real-time Noise Source Estimation of a Camera System from an Image and Metadata
Maik Wischow, Patrick Irmisch, Anko Boerner, Guillermo Gallego
TL;DR
This work presents a real-time, memory-efficient noise-source estimator for camera systems that fuses image data with camera metadata to decompose total image noise into photon shot noise, dark current shot noise, readout noise, and an unexpected-noise term. A four-branch DNN (PN, DCSN, RN, and $\xi_{ ext{M/I}}$) extends a DRNE-based baseline, with metadata configuration into w/o-Meta, Min-Meta, and Full-Meta variants; training relies on simulated physics-based noise on a large, mostly noise-free dataset. Across six datasets and two real camera systems, Full-Meta consistently yields the most accurate and robust per-noise-source estimates and total noise, even handling unexpected noise via $oldsymbol{\xi}_{ ext{M/I}}$ detection. The approach demonstrates real-time performance (approximately 1–2 ms per patch) and can improve downstream tasks such as denoising, while providing a foundation for automatic countermeasures in autonomous perception systems. Overall, this work advances noise characterization from symptom suppression to root-cause identification by leveraging physics-informed metadata fusion in a compact neural architecture.
Abstract
Autonomous machines must self-maintain proper functionality to ensure the safety of humans and themselves. This pertains particularly to its cameras as predominant sensors to perceive the environment and support actions. A fundamental camera problem addressed in this study is noise. Solutions often focus on denoising images a posteriori, that is, fighting symptoms rather than root causes. However, tackling root causes requires identifying the noise sources, considering the limitations of mobile platforms. This work investigates a real-time, memory-efficient and reliable noise source estimator that combines data- and physically-based models. To this end, a DNN that examines an image with camera metadata for major camera noise sources is built and trained. In addition, it quantifies unexpected factors that impact image noise or metadata. This study investigates seven different estimators on six datasets that include synthetic noise, real-world noise from two camera systems, and real field campaigns. For these, only the model with most metadata is capable to accurately and robustly quantify all individual noise contributions. This method outperforms total image noise estimators and can be plug-and-play deployed. It also serves as a basis to include more advanced noise sources, or as part of an automatic countermeasure feedback-loop to approach fully reliable machines.
