GCC: Generative Color Constancy via Diffusing a Color Checker
Chen-Wei Chang, Cheng-De Fan, Chia-Che Chang, Yi-Chen Lo, Yu-Chee Tseng, Jiun-Long Huang, Yu-Lun Liu
TL;DR
This work addresses color constancy under varying camera sensors by leveraging diffusion priors to inpaint a virtual color checker into scenes, from which illumination is estimated. It introduces a deterministic, single-step inference at fixed timestep $t=T$, a Laplacian decomposition to preserve checker structure while adapting to illumination, and a mask-based augmentation to tolerate imprecise color-checker annotations. The approach demonstrates strong cross-camera generalization without sensor-specific training, offering practical, efficient white balance across diverse scenes and enabling spatially varying illumination handling. While effective, it notes limitations in extreme multi-illuminant conditions and small datasets, suggesting avenues for future refinement and data augmentation strategies.
Abstract
Color constancy methods often struggle to generalize across different camera sensors due to varying spectral sensitivities. We present GCC, which leverages diffusion models to inpaint color checkers into images for illumination estimation. Our key innovations include (1) a single-step deterministic inference approach that inpaints color checkers reflecting scene illumination, (2) a Laplacian decomposition technique that preserves checker structure while allowing illumination-dependent color adaptation, and (3) a mask-based data augmentation strategy for handling imprecise color checker annotations. By harnessing rich priors from pre-trained diffusion models, GCC demonstrates strong robustness in challenging cross-camera scenarios. These results highlight our method's effective generalization capability across different camera characteristics without requiring sensor-specific training, making it a versatile and practical solution for real-world applications.
