HCMA-UNet: A Hybrid CNN-Mamba UNet with Axial Self-Attention for Efficient Breast Cancer Segmentation
Haoxuan Li, Wei song, Peiwu Qin, Xi Yuan, Zhenglin Chen
TL;DR
HCMA-UNet tackles the challenge of breast cancer lesion segmentation in DCE-MRI by marrying a lightweight CNN backbone with the Multi-view Axial Self-Attention Mamba (MISM) to capture both local and global context efficiently. The MISM module combines VSSB for intra-slice feature extraction and ASA for inter-slice dependencies, with an ASC channel allocation to reduce redundancy. A novel FRLoss, consisting of Foreground Center, Positive Compactness, Boundary Aware, and Hard Negative Pixels Mining components, guides feature learning to improve discrimination and boundary accuracy. Across a private and two public DCE-MRI datasets, HCMA-UNet delivers state-of-the-art performance with significantly fewer parameters (2.87M) and lower FLOPs (126.44 GFLOPs), with FRLoss showing strong cross-architecture generalization and robustness for clinical deployment.
Abstract
Breast cancer lesion segmentation in DCE-MRI remains challenging due to heterogeneous tumor morphology and indistinct boundaries. To address these challenges, this study proposes a novel hybrid segmentation network, HCMA-UNet, for lesion segmentation of breast cancer. Our network consists of a lightweight CNN backbone and a Multi-view Axial Self-Attention Mamba (MISM) module. The MISM module integrates Visual State Space Block (VSSB) and Axial Self-Attention (ASA) mechanism, effectively reducing parameters through Asymmetric Split Channel (ASC) strategy to achieve efficient tri-directional feature extraction. Our lightweight model achieves superior performance with 2.87M parameters and 126.44 GFLOPs. A Feature-guided Region-aware loss function (FRLoss) is proposed to enhance segmentation accuracy. Extensive experiments on one private and two public DCE-MRI breast cancer datasets demonstrate that our approach achieves state-of-the-art performance while maintaining computational efficiency. FRLoss also exhibits good cross-architecture generalization capabilities. The source code is available at https://github.com/Haoxuanli-Thu/HCMA-UNet.
