Multi-scale HSV Color Feature Embedding for High-fidelity NIR-to-RGB Spectrum Translation
Huiyu Zhai, Mo Chen, Xingxing Yang, Gusheng Kang
TL;DR
This work addresses the challenging NIR-to-RGB translation problem, where spectral gaps cause mapping ambiguities that threaten texture fidelity and color diversity. It introduces Multi-scale HSV Color Feature Embedding Network (MCFNet), which decomposes the task into three sub-tasks—NIR texture maintenance, coarse geometry reconstruction, and RGB color prediction—via Texture Preserving Block, HSV Color Feature Embedding Module, and Geometry Reconstruction Module, respectively, with a multi-scale fusion strategy. The method leverages HSV color space guidance and Laplacian-based texture features, integrated through SPADE fusion, and optimizes with a loss suite including GAN, pair-consistent, cycle-consistent, and edge losses. Experimental results on the VCIP dataset show substantial improvements over state-of-the-art NIR colorization methods in PSNR, AE, and LPIPS, while ablations confirm the critical role of texture fusion, multiscale color guidance, and HSV-CFEM. The approach provides a practical, high-fidelity pipeline for NIR-to-RGB spectrum translation with improved texture preservation and color accuracy, and the authors release code for reproducibility.
Abstract
The NIR-to-RGB spectral domain translation is a formidable task due to the inherent spectral mapping ambiguities within NIR inputs and RGB outputs. Thus, existing methods fail to reconcile the tension between maintaining texture detail fidelity and achieving diverse color variations. In this paper, we propose a Multi-scale HSV Color Feature Embedding Network (MCFNet) that decomposes the mapping process into three sub-tasks, including NIR texture maintenance, coarse geometry reconstruction, and RGB color prediction. Thus, we propose three key modules for each corresponding sub-task: the Texture Preserving Block (TPB), the HSV Color Feature Embedding Module (HSV-CFEM), and the Geometry Reconstruction Module (GRM). These modules contribute to our MCFNet methodically tackling spectral translation through a series of escalating resolutions, progressively enriching images with color and texture fidelity in a scale-coherent fashion. The proposed MCFNet demonstrates substantial performance gains over the NIR image colorization task. Code is released at: https://github.com/AlexYangxx/MCFNet.
