Cross-modal tumor segmentation using generative blending augmentation and self training
Guillaume Sallé, Pierre-Henri Conze, Julien Bert, Nicolas Boussion, Dimitris Visvikis, Vincent Jaouen
TL;DR
Domain shift and data scarcity limit cross-modal medical image segmentation. The authors propose Generative Blending Augmentation (GBA), which uses a SinGAN cascade to diversify tumor appearances and harmonize altered ROIs within CycleGAN-generated pseudo-targets, paired with iterative self-training to improve segmentation on unlabelled target modalities. Integrated into a conventional image-to-image translation plus segmentation pipeline, the approach achieves top performance on vestibular schwannoma segmentation in the CrossMoDA 2022 challenge, driven by improved Dice and ASSD metrics. The study highlights center-specific augmentation and self-training as effective strategies to close appearance gaps between centers and modalities, with applicability to other segmentation tasks under domain shifts.
Abstract
\textit{Objectives}: Data scarcity and domain shifts lead to biased training sets that do not accurately represent deployment conditions. A related practical problem is cross-modal image segmentation, where the objective is to segment unlabelled images using previously labelled datasets from other imaging modalities. \textit{Methods}: We propose a cross-modal segmentation method based on conventional image synthesis boosted by a new data augmentation technique called Generative Blending Augmentation (GBA). GBA leverages a SinGAN model to learn representative generative features from a single training image to diversify realistically tumor appearances. This way, we compensate for image synthesis errors, subsequently improving the generalization power of a downstream segmentation model. The proposed augmentation is further combined to an iterative self-training procedure leveraging pseudo labels at each pass. \textit{Results}: The proposed solution ranked first for vestibular schwannoma (VS) segmentation during the validation and test phases of the MICCAI CrossMoDA 2022 challenge, with best mean Dice similarity and average symmetric surface distance measures. \textit{Conclusion and significance}: Local contrast alteration of tumor appearances and iterative self-training with pseudo labels are likely to lead to performance improvements in a variety of segmentation contexts.
