The 1st Solution for CARE Liver Task Challenge 2025: Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation
Jincan Lou, Jingkun Chen, Haoquan Li, Hang Li, Wenjian Huang, Weihua Chen, Fan Wang, Jianguo Zhang
TL;DR
This work tackles accurate liver segmentation in contrast-enhanced MRI under annotation scarcity and cross-site domain shifts. It introduces CoSSeg-TTA, a semi-supervised framework built on nnU-Netv2 that integrates a mean-teacher paradigm, histogram-based domain adaptation, a contrast-aware module mapping $X_{T1}$ to $\hat{X}_{GED4}$, and continual test-time adaptation with CoTTA, plus post-processing. By leveraging both labeled and unlabeled data and generating pseudo-labels for further fine-tuning, the approach achieves superior Dice and Hausdorff distances compared to the baseline, with strong generalization to unseen domains. The contributions offer a practical, robust pipeline for cross-domain liver segmentation in clinical MRI, though limitations include the need for paired modalities and substantial computation for real-time deployment.
Abstract
Accurate liver segmentation from contrast-enhanced MRI is essential for diagnosis, treatment planning, and disease monitoring. However, it remains challenging due to limited annotated data, heterogeneous enhancement protocols, and significant domain shifts across scanners and institutions. Traditional image-to-image translation frameworks have made great progress in domain generalization, but their application is not straightforward. For example, Pix2Pix requires image registration, and cycle-GAN cannot be integrated seamlessly into segmentation pipelines. Meanwhile, these methods are originally used to deal with cross-modality scenarios, and often introduce structural distortions and suffer from unstable training, which may pose drawbacks in our single-modality scenario. To address these challenges, we propose CoSSeg-TTA, a compact segmentation framework for the GED4 (Gd-EOB-DTPA enhanced hepatobiliary phase MRI) modality built upon nnU-Netv2 and enhanced with a semi-supervised mean teacher scheme to exploit large amounts of unlabeled volumes. A domain adaptation module, incorporating a randomized histogram-based style appearance transfer function and a trainable contrast-aware network, enriches domain diversity and mitigates cross-center variability. Furthermore, a continual test-time adaptation strategy is employed to improve robustness during inference. Extensive experiments demonstrate that our framework consistently outperforms the nnU-Netv2 baseline, achieving superior Dice score and Hausdorff Distance while exhibiting strong generalization to unseen domains under low-annotation conditions.
