Cross-Domain Validation of a Resection-Trained Self-Supervised Model on Multicentre Mesothelioma Biopsies
Farzaneh Seyedshahi, Francesca Damiola, Sylvie Lantuejoul, Ke Yuan, John Le Quesne
TL;DR
This study tests whether a self-supervised encoder trained on resection mesothelioma tissue can be transferred to small biopsy slides collected across multiple French centers. By deriving biopsy-specific histomorphological phenotype clusters via Leiden clustering and representing slides through compositional CLR features, the approach enables downstream tasks of epithelioid vs non-epithelioid classification and patient survival prediction using Cox modeling. Despite domain shifts in tissue type and staining, the method identifies robust biopsy-relevant phenotypes and achieves strong subtype classification (AUC ≈ 0.92) with reasonable survival prognostication (c-index ≈ 0.60–0.64), demonstrating practical clinical potential for AI-assisted mesothelioma diagnosis and prognosis on routine biopsies. The work provides a reproducible pipeline and supports cross-institutional validation, while acknowledging limitations like subtype rarity and the absence of multimodal data, suggesting future multimodal integration and broader prospective validation.
Abstract
Accurate subtype classification and outcome prediction in mesothelioma are essential for guiding therapy and patient care. Most computational pathology models are trained on large tissue images from resection specimens, limiting their use in real-world settings where small biopsies are common. We show that a self-supervised encoder trained on resection tissue can be applied to biopsy material, capturing meaningful morphological patterns. Using these patterns, the model can predict patient survival and classify tumor subtypes. This approach demonstrates the potential of AI-driven tools to support diagnosis and treatment planning in mesothelioma.
