Table of Contents
Fetching ...

RGB-Event ISP: The Dataset and Benchmark

Yunfan Lu, Yanlin Qian, Ziyang Rao, Junren Xiao, Liming Chen, Hui Xiong

TL;DR

This paper tackles the lack of ISP-focused research for event sensors by introducing the first aligned RAW–EVS dataset collected with a hybrid vision sensor, and by defining a controllable MATLAB-based ISP to produce RGB references. It benchmarks existing learnable ISP methods across full-pipeline, stage-wise, and image-enhancement categories, and additionally proposes a simple event-guided UNet variant to fuse events with RAW data. The results reveal strong outdoor gains when incorporating events (notably with EV-UNet) but highlight indoor flicker and scene-dependent performance as key challenges, underscoring the need for more advanced event fusion and scene-aware ISP strategies. The dataset and benchmark establish a foundation for exploring how event data can reform the ISP process and enable more robust, high-dynamic-range imaging under diverse conditions.

Abstract

Event-guided imaging has received significant attention due to its potential to revolutionize instant imaging systems. However, the prior methods primarily focus on enhancing RGB images in a post-processing manner, neglecting the challenges of image signal processor (ISP) dealing with event sensor and the benefits events provide for reforming the ISP process. To achieve this, we conduct the first research on event-guided ISP. First, we present a new event-RAW paired dataset, collected with a novel but still confidential sensor that records pixel-level aligned events and RAW images. This dataset includes 3373 RAW images with 2248 x 3264 resolution and their corresponding events, spanning 24 scenes with 3 exposure modes and 3 lenses. Second, we propose a conventional ISP pipeline to generate good RGB frames as reference. This conventional ISP pipleline performs basic ISP operations, e.g.demosaicing, white balancing, denoising and color space transforming, with a ColorChecker as reference. Third, we classify the existing learnable ISP methods into 3 classes, and select multiple methods to train and evaluate on our new dataset. Lastly, since there is no prior work for reference, we propose a simple event-guided ISP method and test it on our dataset. We further put forward key technical challenges and future directions in RGB-Event ISP. In summary, to the best of our knowledge, this is the very first research focusing on event-guided ISP, and we hope it will inspire the community. The code and dataset are available at: https://github.com/yunfanLu/RGB-Event-ISP.

RGB-Event ISP: The Dataset and Benchmark

TL;DR

This paper tackles the lack of ISP-focused research for event sensors by introducing the first aligned RAW–EVS dataset collected with a hybrid vision sensor, and by defining a controllable MATLAB-based ISP to produce RGB references. It benchmarks existing learnable ISP methods across full-pipeline, stage-wise, and image-enhancement categories, and additionally proposes a simple event-guided UNet variant to fuse events with RAW data. The results reveal strong outdoor gains when incorporating events (notably with EV-UNet) but highlight indoor flicker and scene-dependent performance as key challenges, underscoring the need for more advanced event fusion and scene-aware ISP strategies. The dataset and benchmark establish a foundation for exploring how event data can reform the ISP process and enable more robust, high-dynamic-range imaging under diverse conditions.

Abstract

Event-guided imaging has received significant attention due to its potential to revolutionize instant imaging systems. However, the prior methods primarily focus on enhancing RGB images in a post-processing manner, neglecting the challenges of image signal processor (ISP) dealing with event sensor and the benefits events provide for reforming the ISP process. To achieve this, we conduct the first research on event-guided ISP. First, we present a new event-RAW paired dataset, collected with a novel but still confidential sensor that records pixel-level aligned events and RAW images. This dataset includes 3373 RAW images with 2248 x 3264 resolution and their corresponding events, spanning 24 scenes with 3 exposure modes and 3 lenses. Second, we propose a conventional ISP pipeline to generate good RGB frames as reference. This conventional ISP pipleline performs basic ISP operations, e.g.demosaicing, white balancing, denoising and color space transforming, with a ColorChecker as reference. Third, we classify the existing learnable ISP methods into 3 classes, and select multiple methods to train and evaluate on our new dataset. Lastly, since there is no prior work for reference, we propose a simple event-guided ISP method and test it on our dataset. We further put forward key technical challenges and future directions in RGB-Event ISP. In summary, to the best of our knowledge, this is the very first research focusing on event-guided ISP, and we hope it will inspire the community. The code and dataset are available at: https://github.com/yunfanLu/RGB-Event-ISP.

Paper Structure

This paper contains 11 sections, 8 figures, 4 tables.

Figures (8)

  • Figure 1: (a), (b), and (c) display a RAW, Events, and RGB frame captured by the hybrid vision sensor (HVS), respectively. The RAW image follows a quad-Bayer pattern yang2022mipi, while the events are positioned at the lower-right corner of each color pixel block, making the RAW resolution twice that of the events. (d) illustrates the traditional ISP process. (e) shows the potential event-guided ISP process, where the higher temporal resolution of events can captures motion information for ISP.
  • Figure 2: Overview of dataset collection. (a) illustrates the variety of scenes in the dataset, including buildings, plants, animals, and calibration boards. (b) presents a schematic of the HVS sensor, composed of a stacked active pixel sensor (APS) and an event vision sensor (EVS). (c) displays dataset samples.
  • Figure 3: Flows in controllable ISP process. (a) Quad-bayer pattern raw image, which serves as the initial input. (b) Black pattern and fixed-pattern noise removal to suppress sensor-induced artifacts. (c) Demosaicing to reconstruct a rgb image from the raw data. (d) White balancing using a ColorChecker for accurate color reproduction. (e) Denoising to filter out spatial noise from the image. (f) Color space transformation and Gamma to convert the image into the desired color space for final output.
  • Figure 4: Fixed pattern noise (FPN) removal. (a) Visualizes the camera's fixed pattern noise. (b) and (c) show the RGB images without and with fixed pattern noise removal, respectively. The image in (c) demonstrates lower noise and more accurate white balance after the removal of fixed pattern noise.
  • Figure 5: Color errors and fluctuations of our ISP method, computed using a ColorChecker. (a) CIEDE 2000 Error Probability Density Distribution: Displays CIEDE 2000 error values distribution with annotations for average (5.84), median (5.07), and maximum error means (14.62). (b) CIEDE Lab Error Probability Density Distribution: Shows CIEDE Lab error values distribution, indicating average (8.99), median (7.0), and maximum error means (24.37). (c) Time-Domain Fluctuations of RGB Color Values: Illustrates RGB color values fluctuations over time, representing temporal stability and variations in color accuracy.
  • ...and 3 more figures