Uncovering the Genetic Basis of Glioblastoma Heterogeneity through Multimodal Analysis of Whole Slide Images and RNA Sequencing Data
Ahmad Berjaoui, Louis Roussel, Eduardo Hugo Sanchez, Elizabeth Cohen-Jonathan Moyal
TL;DR
This study tackles glioblastoma heterogeneity by integrating whole-slide images and RNA-seq data through a novel multimodal deep learning framework. RNA-seq encodings are constructed via directed PPI-based gene clustering and masked autoencoding with a contrastive objective, while WSI representations are learned from 256×256 patches using ViT features and triplet loss. A cross-attention based fusion and multimodal contrastive objective align RNA and imaging modalities, enabling RNA retrieval from WSIs and revealing genetic profiles linked to distinct tumor patterns; Grad-CAM highlights key GBM-related genes and microenvironment regulators. The work identifies both known and novel genetic targets, demonstrates strong cross-modal generalization, and suggests potential avenues for personalized GBM therapies and biomarker discovery, leveraging TCGA and STEMRI datasets.
Abstract
Glioblastoma is a highly aggressive form of brain cancer characterized by rapid progression and poor prognosis. Despite advances in treatment, the underlying genetic mechanisms driving this aggressiveness remain poorly understood. In this study, we employed multimodal deep learning approaches to investigate glioblastoma heterogeneity using joint image/RNA-seq analysis. Our results reveal novel genes associated with glioblastoma. By leveraging a combination of whole-slide images and RNA-seq, as well as introducing novel methods to encode RNA-seq data, we identified specific genetic profiles that may explain different patterns of glioblastoma progression. These findings provide new insights into the genetic mechanisms underlying glioblastoma heterogeneity and highlight potential targets for therapeutic intervention. Code and data downloading instructions are available at: https://github.com/ma3oun/gbheterogeneity.
