Multi-modal AI for comprehensive breast cancer prognostication

Jan Witowski; Ken G. Zeng; Joseph Cappadona; Jailan Elayoubi; Khalil Choucair; Elena Diana Chiru; Nancy Chan; Young-Joon Kang; Frederick Howard; Irina Ostrovnaya; Carlos Fernandez-Granda; Freya Schnabel; Zoe Steinsnyder; Ugur Ozerdem; Kangning Liu; Waleed Abdulsattar; Yu Zong; Lina Daoud; Rafic Beydoun; Anas Saad; Nitya Thakore; Mohammad Sadic; Frank Yeung; Elisa Liu; Theodore Hill; Benjamin Swett; Danielle Rigau; Andrew Clayburn; Valerie Speirs; Marcus Vetter; Lina Sojak; Simone Soysal; Daniel Baumhoer; Jia-Wern Pan; Haslina Makmur; Soo-Hwang Teo; Linda Ma Pak; Victor Angel; Dovile Zilenaite-Petrulaitiene; Arvydas Laurinavicius; Natalie Klar; Brian D. Piening; Carlo Bifulco; Sun-Young Jun; Jae Pak Yi; Su Hyun Lim; Adam Brufsky; Francisco J. Esteva; Lajos Pusztai; Yann LeCun; Krzysztof J. Geras

Multi-modal AI for comprehensive breast cancer prognostication

Jan Witowski, Ken G. Zeng, Joseph Cappadona, Jailan Elayoubi, Khalil Choucair, Elena Diana Chiru, Nancy Chan, Young-Joon Kang, Frederick Howard, Irina Ostrovnaya, Carlos Fernandez-Granda, Freya Schnabel, Zoe Steinsnyder, Ugur Ozerdem, Kangning Liu, Waleed Abdulsattar, Yu Zong, Lina Daoud, Rafic Beydoun, Anas Saad, Nitya Thakore, Mohammad Sadic, Frank Yeung, Elisa Liu, Theodore Hill, Benjamin Swett, Danielle Rigau, Andrew Clayburn, Valerie Speirs, Marcus Vetter, Lina Sojak, Simone Soysal, Daniel Baumhoer, Jia-Wern Pan, Haslina Makmur, Soo-Hwang Teo, Linda Ma Pak, Victor Angel, Dovile Zilenaite-Petrulaitiene, Arvydas Laurinavicius, Natalie Klar, Brian D. Piening, Carlo Bifulco, Sun-Young Jun, Jae Pak Yi, Su Hyun Lim, Adam Brufsky, Francisco J. Esteva, Lajos Pusztai, Yann LeCun, Krzysztof J. Geras

TL;DR

This study presents a multi-modal AI test for breast cancer prognosis that fuses pathology-derived features from a self-supervised vision transformer (Kestrel) with standard clinical variables to predict recurrence and survival. The model ensembles pathology and clinical predictions and is evaluated across 8,161 patients from 15 cohorts, achieving a pooled $C\text{-}index$ of $0.71$ for DFI and a hazard ratio of $3.63$ for high-risk patients, outperforming Oncotype DX in HR+ cohorts. Importantly, the AI test maintains accuracy across major subtypes, including TNBC and HER2+, and provides independent prognostic information beyond existing clinical and genomic markers, suggesting broad applicability and potential to refine treatment decisions. The approach offers faster, cheaper risk stratification using routine H&E slides, with strong translational potential pending prospective validation and integration into clinical workflows.

Abstract

Treatment selection in breast cancer is guided by molecular subtypes and clinical characteristics. However, current tools including genomic assays lack the accuracy required for optimal clinical decision-making. We developed a novel artificial intelligence (AI)-based approach that integrates digital pathology images with clinical data, providing a more robust and effective method for predicting the risk of cancer recurrence in breast cancer patients. Specifically, we utilized a vision transformer pan-cancer foundation model trained with self-supervised learning to extract features from digitized H&E-stained slides. These features were integrated with clinical data to form a multi-modal AI test predicting cancer recurrence and death. The test was developed and evaluated using data from a total of 8,161 female breast cancer patients across 15 cohorts originating from seven countries. Of these, 3,502 patients from five cohorts were used exclusively for evaluation, while the remaining patients were used for training. Our test accurately predicted our primary endpoint, disease-free interval, in the five evaluation cohorts (C-index: 0.71 [0.68-0.75], HR: 3.63 [3.02-4.37, p<0.001]). In a direct comparison (n=858), the AI test was more accurate than Oncotype DX, the standard-of-care 21-gene assay, achieving a C-index of 0.67 [0.61-0.74] versus 0.61 [0.49-0.73], respectively. Additionally, the AI test added independent prognostic information to Oncotype DX in a multivariate analysis (HR: 3.11 [1.91-5.09, p<0.001)]). The test demonstrated robust accuracy across major molecular breast cancer subtypes, including TNBC (C-index: 0.71 [0.62-0.81], HR: 3.81 [2.35-6.17, p=0.02]), where no diagnostic tools are currently recommended by clinical guidelines. These results suggest that our AI test improves upon the accuracy of existing prognostic tests, while being applicable to a wider range of patients.

Multi-modal AI for comprehensive breast cancer prognostication

TL;DR

Abstract

Multi-modal AI for comprehensive breast cancer prognostication

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)