Hybrid Event Frame Sensors: Modeling, Calibration, and Simulation
Yunfan Lu, Nico Messikommer, Xiaogang Xu, Liming Chen, Yuhan Chen, Nikola Zubic, Davide Scaramuzza, Hui Xiong
TL;DR
This work addresses the challenge of accurately modeling and simulating hybrid APS–EVS sensors by introducing a unified, statistics-based noise model that jointly captures photon shot noise, dark current noise, fixed-pattern noise, and quantization noise for both APS frames and EVS events. A calibration pipeline, leveraging per-pixel Quad-Bayer positioning and an APS–EVS domain mapping via a $Q$-function, yields interpretable noise parameters that enable realistic joint noise generation. Building on this, the authors introduce H-ESIM, a physics-grounded simulator that produces RAW frames and events from high-frame-rate video, with an open ISP and data-driven parameter estimation, achieving strong transfer to real data for tasks like video frame interpolation and deblurring. The framework is validated on GEN2 and Eiger hybrid sensors, showing accurate noise representation and improved downstream performance when models are fine-tuned on synthetic data, thereby enabling reproducible research and robust evaluation of hybrid-event vision systems.
Abstract
Event frame hybrid sensors integrate an Active Pixel Sensor (APS) and an Event Vision Sensor (EVS) within a single chip, combining the high dynamic range and low latency of the EVS with the rich spatial intensity information from the APS. While this tight integration offers compact, temporally precise imaging, the complex circuit architecture introduces non-trivial noise patterns that remain poorly understood and unmodeled. In this work, we present the first unified, statistics-based imaging noise model that jointly describes the noise behavior of APS and EVS pixels. Our formulation explicitly incorporates photon shot noise, dark current noise, fixed-pattern noise, and quantization noise, and links EVS noise to illumination level and dark current. Based on this formulation, we further develop a calibration pipeline to estimate noise parameters from real data and offer a detailed analysis of both APS and EVS noise behaviors. Finally, we propose HESIM, a statistically grounded simulator that generates RAW frames and events under realistic, jointly calibrated noise statistics. Experiments on two hybrid sensors validate our model across multiple imaging tasks (e.g., video frame interpolation and deblurring), demonstrating strong transfer from simulation to real data.
