Joint Modelling Histology and Molecular Markers for Cancer Classification
Xiaofei Wang, Hanyu Liu, Yupei Zhang, Boyang Zhao, Hao Duan, Wanming Hu, Yonggao Mou, Stephen Price, Chao Li
TL;DR
This work tackles glioma classification under the WHO 2021 molecular pathology framework by jointly predicting histology features and molecular markers from whole-slide images. It introduces the M3C2 framework, combining a multi-scale disentangling module, an attention-based hierarchical multi-task MIL, a co-occurrence label graph for molecular markers, and a cross-modal interaction pipeline with dynamic gradient modulation and confidence-based losses. The approach demonstrates superior performance and generalizability on internal and external glioma datasets, with ablations confirming the value of each component and multi-scale fusion. By explicitly modeling histology–molecular marker interactions and leveraging multi-scale information, M3C2 holds promise for more accurate, interpretable, and cost-effective cancer diagnosis and precision oncology workflows.
Abstract
Cancers are characterized by remarkable heterogeneity and diverse prognosis. Accurate cancer classification is essential for patient stratification and clinical decision-making. Although digital pathology has been advancing cancer diagnosis and prognosis, the paradigm in cancer pathology has shifted from purely relying on histology features to incorporating molecular markers. There is an urgent need for digital pathology methods to meet the needs of the new paradigm. We introduce a novel digital pathology approach to jointly predict molecular markers and histology features and model their interactions for cancer classification. Firstly, to mitigate the challenge of cross-magnification information propagation, we propose a multi-scale disentangling module, enabling the extraction of multi-scale features from high-magnification (cellular-level) to low-magnification (tissue-level) whole slide images. Further, based on the multi-scale features, we propose an attention-based hierarchical multi-task multi-instance learning framework to simultaneously predict histology and molecular markers. Moreover, we propose a co-occurrence probability-based label correlation graph network to model the co-occurrence of molecular markers. Lastly, we design a cross-modal interaction module with the dynamic confidence constrain loss and a cross-modal gradient modulation strategy, to model the interactions of histology and molecular markers. Our experiments demonstrate that our method outperforms other state-of-the-art methods in classifying glioma, histology features and molecular markers. Our method promises to promote precise oncology with the potential to advance biomedical research and clinical applications. The code is available at https://github.com/LHY1007/M3C2
