DSCENet: Dynamic Screening and Clinical-Enhanced Multimodal Fusion for MPNs Subtype Classification
Yuan Zhang, Yaolei Qi, Xiaoming Qi, Yongyue Wei, Guanyu Yang
TL;DR
This work tackles the challenging problem of classifying myeloproliferative neoplasm subtypes by leveraging multimodal data from whole slide images and clinical information. The authors introduce DSCENet, which combines a Dynamic Screening Module for adaptive patch selection with a Clinical-enhanced Fusion Module that uses cross-modal attention to fuse image and clinical features. On a multicenter dataset of 383 cases, DSCENet achieves an ACC of 83.12% and an AUC around 96%, significantly outperforming state-of-the-art single-modality and fusion methods and demonstrating the value of dynamic patch screening and clinical guidance. Ablation studies corroborate the contributions of both modules, with clinical data providing substantial gains in AUC and patch screening improving feature representativeness, indicating strong potential for improving diagnostic accuracy and informing treatment planning in MPNs.
Abstract
The precise subtype classification of myeloproliferative neoplasms (MPNs) based on multimodal information, which assists clinicians in diagnosis and long-term treatment plans, is of great clinical significance. However, it remains a great challenging task due to the lack of diagnostic representativeness for local patches and the absence of diagnostic-relevant features from a single modality. In this paper, we propose a Dynamic Screening and Clinical-Enhanced Network (DSCENet) for the subtype classification of MPNs on the multimodal fusion of whole slide images (WSIs) and clinical information. (1) A dynamic screening module is proposed to flexibly adapt the feature learning of local patches, reducing the interference of irrelevant features and enhancing their diagnostic representativeness. (2) A clinical-enhanced fusion module is proposed to integrate clinical indicators to explore complementary features across modalities, providing comprehensive diagnostic information. Our approach has been validated on the real clinical data, achieving an increase of 7.91% AUC and 16.89% accuracy compared with the previous state-of-the-art (SOTA) methods. The code is available at https://github.com/yuanzhang7/DSCENet.
