CS3: Cascade SAM for Sperm Segmentation
Yi Shi, Xu-Peng Tian, Yun-Kai Wang, Tie-Yi Zhang, Bin Yao, Hui Wang, Yong Shao, Cen-Cen Wang, Rong Zeng, De-Chuan Zhan
TL;DR
CS3 tackles the critical problem of segmenting individual sperm in overlapping images for automated morphology analysis. It introduces Cascade SAM (CS3), an unsupervised pipeline that applies multiple SAM passes in a cascade to segment heads, simple tails, and complex tails, followed by mask matching to assemble complete sperm masks. The method leverages color-based head filtering, skeletonization-based tail filtration, and end-point geometry with thresholds ($\lambda_{dis}=20$ px, $\lambda_{angle}=60^\circ$) to pair head and tail masks without labeled data. Experimental results on clinical data (≈2,000 unlabeled images and 240 expert-annotated images) show CS3 achieves higher mean IoU ($mIOU$) and mean Dice ($mDice$) than supervised and unsupervised baselines, particularly for overlapping sperms. The work enables scalable automated sperm morphology analysis and suggests applicability to segmentation of other elongated biological structures.
Abstract
Automated sperm morphology analysis plays a crucial role in the assessment of male fertility, yet its efficacy is often compromised by the challenges in accurately segmenting sperm images. Existing segmentation techniques, including the Segment Anything Model(SAM), are notably inadequate in addressing the complex issue of sperm overlap-a frequent occurrence in clinical samples. Our exploratory studies reveal that modifying image characteristics by removing sperm heads and easily segmentable areas, alongside enhancing the visibility of overlapping regions, markedly enhances SAM's efficiency in segmenting intricate sperm structures. Motivated by these findings, we present the Cascade SAM for Sperm Segmentation (CS3), an unsupervised approach specifically designed to tackle the issue of sperm overlap. This method employs a cascade application of SAM to segment sperm heads, simple tails, and complex tails in stages. Subsequently, these segmented masks are meticulously matched and joined to construct complete sperm masks. In collaboration with leading medical institutions, we have compiled a dataset comprising approximately 2,000 unlabeled sperm images to fine-tune our method, and secured expert annotations for an additional 240 images to facilitate comprehensive model assessment. Experimental results demonstrate superior performance of CS3 compared to existing methods.
