Leveraging Multispectral Sensors for Color Correction in Mobile Cameras
Luca Cogo, Marco Buzzelli, Simone Bianco, Javier Vazquez-Corral, Raimondo Schettini
TL;DR
This work tackles color correction for mobile cameras by fusing high-resolution RGB data with a low-resolution multispectral signal in an end-to-end framework. The authors adapt two lightweight image-to-image backbones (LPIENet and cmKAN) with a spectral encoder to jointly estimate illumination, discount it, and perform color space transformation, producing color-accurate CIE XYZ outputs. They introduce a physically grounded dataset of 116,688 RGB–MS–XYZ triplets under 102 illuminants, plus a misaligned variant, and demonstrate substantial improvements (up to ~50% in $\Delta E_{00}$) over RGB-only and MS baselines, including robustness to exposure changes. The results validate the value of incorporating multispectral cues throughout the color-correction pipeline and highlight the framework’s flexibility to accommodate future backbones and hardware configurations.
Abstract
Recent advances in snapshot multispectral (MS) imaging have enabled compact, low-cost spectral sensors for consumer and mobile devices. By capturing richer spectral information than conventional RGB sensors, these systems can enhance key imaging tasks, including color correction. However, most existing methods treat the color correction pipeline in separate stages, often discarding MS data early in the process. We propose a unified, learning-based framework that (i) performs end-to-end color correction and (ii) jointly leverages data from a high-resolution RGB sensor and an auxiliary low-resolution MS sensor. Our approach integrates the full pipeline within a single model, producing coherent and color-accurate outputs. We demonstrate the flexibility and generality of our framework by refactoring two different state-of-the-art image-to-image architectures. To support training and evaluation, we construct a dedicated dataset by aggregating and repurposing publicly available spectral datasets, rendering under multiple RGB camera sensitivities. Extensive experiments show that our approach improves color accuracy and stability, reducing error by up to 50% compared to RGB-only and MS-driven baselines. Datasets, code, and models will be made available upon acceptance.
