Color-Quality Invariance for Robust Medical Image Segmentation
Ravi Shah, Atsushi Fukuda, Quan Huu Cap
TL;DR
Single-source domain generalization for medical image segmentation suffers when color distribution and image quality shift across domains. The authors propose Dynamic Color Image Normalization (DCIN) with Global and Local Reference Image Selection (GRIS/LRIS) and a Color-Quality Generalization (CQG) loss to enforce consistent segmentation across color/quality variants. Across throat-pharyngeal segmentation tasks, these components yield substantial cross-domain gains (up to 32.3 Dice points) and are model-agnostic. This approach improves robustness to unseen domains and holds promise for broader color-sensitive medical imaging applications.
Abstract
Single-source domain generalization (SDG) in medical image segmentation remains a significant challenge, particularly for images with varying color distributions and qualities. Previous approaches often struggle when models trained on high-quality images fail to generalize to low-quality test images due to these color and quality shifts. In this work, we propose two novel techniques to enhance generalization: dynamic color image normalization (DCIN) module and color-quality generalization (CQG) loss. The DCIN dynamically normalizes the color of test images using two reference image selection strategies. Specifically, the DCIN utilizes a global reference image selection (GRIS), which finds a universal reference image, and a local reference image selection (LRIS), which selects a semantically similar reference image per test sample. Additionally, CQG loss enforces invariance to color and quality variations by ensuring consistent segmentation predictions across transformed image pairs. Experimental results show that our proposals significantly improve segmentation performance over the baseline on two target domain datasets, despite being trained solely on a single source domain. Notably, our model achieved up to a 32.3-point increase in Dice score compared to the baseline, consistently producing robust and usable results even under substantial domain shifts. Our work contributes to the development of more robust medical image segmentation models that generalize across unseen domains. The implementation code is available at https://github.com/RaviShah1/DCIN-CQG.
