Multimodal Analysis of White Blood Cell Differentiation in Acute Myeloid Leukemia Patients using a β-Variational Autoencoder
Gizem Mert, Ario Sadafi, Raheleh Salehi, Nassir Navab, Carsten Marr
TL;DR
The paper addresses how to integrate morphology from single-cell images with transcriptomic profiles to understand white blood cell differentiation in AML. It introduces a beta-VAE–based multimodal framework with a Mask R-CNN prior to isolate single cells, mapping image features and scRNA-seq into a shared latent space without supervision. The approach enables joint reconstruction and smooth trajectory interpolation between progenitor and mature cell types via Gaussian Process regression, revealing coupling between cellular morphology and gene expression. The results rediscover known driver genes and propose additional candidates, demonstrating potential for improved diagnostics and mechanistic insight, while noting data limitations for trajectories and the need for further validation.
Abstract
Biomedical imaging and RNA sequencing with single-cell resolution improves our understanding of white blood cell diseases like leukemia. By combining morphological and transcriptomic data, we can gain insights into cellular functions and trajectoriess involved in blood cell differentiation. However, existing methodologies struggle with integrating morphological and transcriptomic data, leaving a significant research gap in comprehensively understanding the dynamics of cell differentiation. Here, we introduce an unsupervised method that explores and reconstructs these two modalities and uncovers the relationship between different subtypes of white blood cells from human peripheral blood smears in terms of morphology and their corresponding transcriptome. Our method is based on a beta-variational autoencoder (ß-VAE) with a customized loss function, incorporating a R-CNN architecture to distinguish single-cell from background and to minimize any interference from artifacts. This implementation of ß-VAE shows good reconstruction capability along with continuous latent embeddings, while maintaining clear differentiation between single-cell classes. Our novel approach is especially helpful to uncover the correlation of two latent features in complex biological processes such as formation of granules in the cell (granulopoiesis) with gene expression patterns. It thus provides a unique tool to improve the understanding of white blood cell maturation for biomedicine and diagnostics.
