Accurate Spatial Gene Expression Prediction by integrating Multi-resolution features
Youngmin Chung, Ji Hun Ha, Kyeong Chan Im, Joo Sang Lee
TL;DR
TRIPLEX addresses the challenge of predicting spatial gene expression from WSIs by leveraging multi-resolution information from target spots, surrounding neighborhoods, and global tissue context. It uses three dedicated encoders and a fusion layer with a fusion-loss objective to integrate information efficiently, aided by the APEG positional encoding for irregular WSIs. Across three ST datasets and external Visium data, TRIPLEX achieves superior $MSE$, $MAE$, and $PCC$ metrics, with notable gains in highly predictive genes and robust generalization to unseen tissue types. The approach holds potential to improve cancer diagnostics by providing accurate, interpretable spatial gene expression predictions aligned with tumor annotations, while maintaining practical computational costs. The combination of cross-attention-based fusion, multi-resolution tokens, and thorough methodological rigor strengthens its applicability to clinical spatial omics analyses.
Abstract
Recent advancements in Spatial Transcriptomics (ST) technology have facilitated detailed gene expression analysis within tissue contexts. However, the high costs and methodological limitations of ST necessitate a more robust predictive model. In response, this paper introduces TRIPLEX, a novel deep learning framework designed to predict spatial gene expression from Whole Slide Images (WSIs). TRIPLEX uniquely harnesses multi-resolution features, capturing cellular morphology at individual spots, the local context around these spots, and the global tissue organization. By integrating these features through an effective fusion strategy, TRIPLEX achieves accurate gene expression prediction. Our comprehensive benchmark study, conducted on three public ST datasets and supplemented with Visium data from 10X Genomics, demonstrates that TRIPLEX outperforms current state-of-the-art models in Mean Squared Error (MSE), Mean Absolute Error (MAE), and Pearson Correlation Coefficient (PCC). The model's predictions align closely with ground truth gene expression profiles and tumor annotations, underscoring TRIPLEX's potential in advancing cancer diagnosis and treatment.
