Prototype Learning Guided Hybrid Network for Breast Tumor Segmentation in DCE-MRI
Lei Zhou, Yuzhong Zhang, Jiadong Zhang, Xuejun Qian, Chen Gong, Kun Sun, Zhongxiang Ding, Xing Wang, Zhenhui Li, Zaiyi Liu, Dinggang Shen
TL;DR
The paper tackles accurate breast tumor segmentation in DCE-MRI under computational constraints by introducing a prototype-guided hybrid network (PLHN) that fuses a lightweight 3D CNN encoder with a parallel transformer bottleneck and a prototype-based prediction module. Key innovations include two parallel encoders for efficient global and local feature modeling, online clustering to learn category prototypes, and an attention-based fusion that combines prototype similarity with decoder features to produce refined tumor masks. A two-stage optimization strategy, targeted patch sampling, and a robust loss design underpin effective training, while a radiomics-based CAD pipeline assesses HER2 status using automatically generated masks. Across internal and external datasets, PLHN achieves state-of-the-art segmentation performance with favorable computation costs and demonstrates potential for aiding clinical decision-making in breast cancer diagnosis and HER2 status prediction.
Abstract
Automated breast tumor segmentation on the basis of dynamic contrast-enhancement magnetic resonance imaging (DCE-MRI) has shown great promise in clinical practice, particularly for identifying the presence of breast disease. However, accurate segmentation of breast tumor is a challenging task, often necessitating the development of complex networks. To strike an optimal trade-off between computational costs and segmentation performance, we propose a hybrid network via the combination of convolution neural network (CNN) and transformer layers. Specifically, the hybrid network consists of a encoder-decoder architecture by stacking convolution and decovolution layers. Effective 3D transformer layers are then implemented after the encoder subnetworks, to capture global dependencies between the bottleneck features. To improve the efficiency of hybrid network, two parallel encoder subnetworks are designed for the decoder and the transformer layers, respectively. To further enhance the discriminative capability of hybrid network, a prototype learning guided prediction module is proposed, where the category-specified prototypical features are calculated through on-line clustering. All learned prototypical features are finally combined with the features from decoder for tumor mask prediction. The experimental results on private and public DCE-MRI datasets demonstrate that the proposed hybrid network achieves superior performance than the state-of-the-art (SOTA) methods, while maintaining balance between segmentation accuracy and computation cost. Moreover, we demonstrate that automatically generated tumor masks can be effectively applied to identify HER2-positive subtype from HER2-negative subtype with the similar accuracy to the analysis based on manual tumor segmentation. The source code is available at https://github.com/ZhouL-lab/PLHN.
