Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation
Qing Xu, Yuxiang Luo, Wenting Duan, Zhen Chen
TL;DR
The paper addresses the need for versatile medical segmentation by coupling semantic and instance tasks rather than treating them in isolation. It introduces Co-Seg++ with a spatio-sequential prompt encoder (SSP-Encoder) and a multi-task collaborative decoder (MTC-Decoder) to enable mutual, bidirectional guidance between tasks. Across histopathology and dental CBCT datasets, Co-Seg++ achieves state-of-the-art performance in semantic, instance, and panoptic segmentation, while offering better efficiency and robustness to domain shifts and limited annotations. Ablation and interpretability analyses demonstrate the tangible benefits of the co-segmentation paradigm and cross-task prompts, guiding future work toward 3D and multi-modal medical image analysis.
Abstract
Medical image analysis is critical yet challenged by the need of jointly segmenting organs or tissues, and numerous instances for anatomical structures and tumor microenvironment analysis. Existing studies typically formulated different segmentation tasks in isolation, which overlooks the fundamental interdependencies between these tasks, leading to suboptimal segmentation performance and insufficient medical image understanding. To address this issue, we propose a Co-Seg++ framework for versatile medical segmentation. Specifically, we introduce a novel co-segmentation paradigm, allowing semantic and instance segmentation tasks to mutually enhance each other. We first devise a spatio-sequential prompt encoder (SSP-Encoder) to capture long-range spatial and sequential relationships between segmentation regions and image embeddings as prior spatial constraints. Moreover, we devise a multi-task collaborative decoder (MTC-Decoder) that leverages cross-guidance to strengthen the contextual consistency of both tasks, jointly computing semantic and instance segmentation masks. Extensive experiments on diverse CT and histopathology datasets demonstrate that the proposed Co-Seg++ outperforms state-of-the-arts in the semantic, instance, and panoptic segmentation of dental anatomical structures, histopathology tissues, and nuclei instances. The source code is available at https://github.com/xq141839/Co-Seg-Plus.
