HistoMet: A Pan-Cancer Deep Learning Framework for Prognostic Prediction of Metastatic Progression and Site Tropism from Primary Tumor Histopathology
Yixin Chen, Ziyu Su, Lingbin Meng, Elshad Hasanov, Wei Chen, Anil Parwani, M. Khalid Khan Niazi
TL;DR
HistoMet tackles the challenge of predicting metastatic progression and site tropism from primary tumor histology by proposing a two-stage, decision-aware MIL framework that gates downstream site prediction on high-risk cases. It integrates vision-language based semantic concepts with data-driven visual prototypes, using multi-scale (10x and 20x) features and a cross-attention prototype condensation mechanism to produce interpretable slide-level predictions. The approach achieves state-of-the-art performance on a large pan-cancer dataset, reduces downstream workload, and provides qualitative interpretability through prototype-level visual-semantic mappings, while highlighting practical considerations such as potential LLM prompt hallucinations and frozen encoders. These advances suggest a path toward deployment-ready, decision-aware prognostic tools in clinical oncology that can generalize across cancer types and metastatic sites.
Abstract
Metastatic Progression remains the leading cause of cancer-related mortality, yet predicting whether a primary tumor will metastasize and where it will disseminate directly from histopathology remains a fundamental challenge. Although whole-slide images (WSIs) provide rich morphological information, prior computational pathology approaches typically address metastatic status or site prediction as isolated tasks, and do not explicitly model the clinically sequential decision process of metastatic risk assessment followed by downstream site-specific evaluation. To address this research gap, we present a decision-aware, concept-aligned MIL framework, HistoMet, for prognostic metastatic outcome prediction from primary tumor WSIs. Our proposed framework adopts a two-module prediction pipeline in which the likelihood of metastatic progression from the primary tumor is first estimated, followed by conditional prediction of metastatic site for high-risk cases. To guide representation learning and improve clinical interpretability, our framework integrates linguistically defined and data-adaptive metastatic concepts through a pretrained pathology vision-language model. We evaluate HistoMet on a multi-institutional pan-cancer cohort of 6504 patients with metastasis follow-up and site annotations. Under clinically relevant high-sensitivity screening settings (95 percent sensitivity), HistoMet significantly reduces downstream workload while maintaining high metastatic risk recall. Conditional on metastatic cases, HistoMet achieves a macro F1 of 74.6 with a standard deviation of 1.3 and a macro one-vs-rest AUC of 92.1. These results demonstrate that explicitly modeling clinical decision structure enables robust and deployable prognostic prediction of metastatic progression and site tropism directly from primary tumor histopathology.
