Karyotype AI for Precision Oncology
Zahra Shamsi, Isaac Reid, Drew Bryant, Jacob Wilson, Xiaoyu Qu, Avinava Dubey, Konik Kothari, Mostafa Dehghani, Mariya Chavarha, Valerii Likhosherstov, Brian Williams, Michael Frumkin, Fred Appelbaum, Krzysztof Choromanski, Ali Bashir, Min Fang
TL;DR
The paper addresses automated karyotyping directly from metaphase images to diagnose hematologic malignancies, confronting data scarcity and preprocessing challenges with a novel pretraining on chromosome identity followed by finetuning for aberration detection. It introduces an end-to-end Vision Transformer based pipeline that uses OWL-ViT for chromosome detection, SAM for segmentation, and dedicated ViT modules for chromosome identity and specific anomalies such as del(5q) and t(9;22), achieving high diagnostic performance and enabling zero-shot aberration detection. Key findings include a PR-AUC of $0.94$ for the targeted anomalies, rapid inference of about $15$ seconds per metaphase image, and strong performance even on rare aberrations with limited data. The work suggests substantial clinical impact by enabling faster, scalable, and more accessible karyotyping, with potential to reveal subclonal architecture and reduce turnaround times in cancer diagnostics.
Abstract
We present a machine learning method capable of accurately detecting chromosome abnormalities that cause blood cancers directly from microscope images of the metaphase stage of cell division. The pipeline is built on a series of fine-tuned Vision Transformers. Current state of the art (and standard clinical practice) requires expensive, manual expert analysis, whereas our pipeline takes only 15 seconds per metaphase image. Using a novel pretraining-finetuning strategy to mitigate the challenge of data scarcity, we achieve a high precision-recall score of 94% AUC for the clinically significant del(5q) and t(9;22) anomalies. Our method also unlocks zero-shot detection of rare aberrations based on model latent embeddings. The ability to quickly, accurately, and scalably diagnose genetic abnormalities directly from metaphase images could transform karyotyping practice and improve patient outcomes. We will make code publicly available.
