SEL-CIE: Knowledge-Guided Self-Supervised Learning Framework for CIE-XYZ Reconstruction from Non-Linear sRGB Images
Shir Barzel, Moshe Salhov, Ofir Lindenbaum, Amir Averbuch
TL;DR
The paper tackles reconstructing device-independent CIE-XYZ color images from non-linear sRGB inputs under limited paired data. It introduces SEL-CIE, a three-phase framework that combines supervised learning on paired sRGB2XYZ data, a color-board–guided self-supervised pre-task, and a final supervised refinement, facilitated by a dual-branch global/local network with a ResNet50 backbone. A trainable balancing parameter and a Delta E 76–based color-space loss enable effective SSL integration, yielding superior PSNR and SSIM on the sRGB2XYZ benchmark, especially when using the ResNet50 backbone (SEL-CIE-RB). These results demonstrate robust color-space reconstruction with potential impact on color-critical computer vision and medical imaging tasks requiring standardized color representations.
Abstract
Modern cameras typically offer two types of image states: a minimally processed linear raw RGB image representing the raw sensor data, and a highly-processed non-linear image state, such as the sRGB state. The CIE-XYZ color space is a device-independent linear space used as part of the camera pipeline and can be helpful for computer vision tasks, such as image deblurring, dehazing, and color recognition tasks in medical applications, where color accuracy is important. However, images are usually saved in non-linear states, and achieving CIE-XYZ color images using conventional methods is not always possible. To tackle this issue, classical methodologies have been developed that focus on reversing the acquisition pipeline. More recently, supervised learning has been employed, using paired CIE-XYZ and sRGB representations of identical images. However, obtaining a large-scale dataset of CIE-XYZ and sRGB pairs can be challenging. To overcome this limitation and mitigate the reliance on large amounts of paired data, self-supervised learning (SSL) can be utilized as a substitute for relying solely on paired data. This paper proposes a framework for using SSL methods alongside paired data to reconstruct CIE-XYZ images and re-render sRGB images, outperforming existing approaches. The proposed framework is applied to the sRGB2XYZ dataset.
