High-Resolution Spatial Transcriptomics from Histology Images using HisToSGE
Zhiceng Shi, Shuailin Xue, Fangfang Zhu, Wenwen Min
TL;DR
This work tackles the challenge of generating high-resolution spatial gene expression from histology by introducing HisToSGE, a model that fuses rich histology features from a Pathology Image Large Model with a learnable spatial feature encoder. The architecture comprises a two-module design: a feature extraction block that creates multimodal patch features and a feature learning block based on multi-head attention to integrate spot coordinates, followed by gene projection heads to produce expression profiles. Empirically, HisToSGE achieves state-of-the-art performance across four ST datasets, improving $PCC$ by up to ~32% and reducing $MSE$ and $MAE$ relative to strong baselines, while better preserving spatial domains and enhancing marker-gene patterns. The results highlight the practical impact of leveraging large-scale histology representations and attention-based fusion for accurate, high-resolution spatial transcriptomics analyses.
Abstract
Spatial transcriptomics (ST) is a groundbreaking genomic technology that enables spatial localization analysis of gene expression within tissue sections. However, it is significantly limited by high costs and sparse spatial resolution. An alternative, more cost-effective strategy is to use deep learning methods to predict high-density gene expression profiles from histological images. However, existing methods struggle to capture rich image features effectively or rely on low-dimensional positional coordinates, making it difficult to accurately predict high-resolution gene expression profiles. To address these limitations, we developed HisToSGE, a method that employs a Pathology Image Large Model (PILM) to extract rich image features from histological images and utilizes a feature learning module to robustly generate high-resolution gene expression profiles. We evaluated HisToSGE on four ST datasets, comparing its performance with five state-of-the-art baseline methods. The results demonstrate that HisToSGE excels in generating high-resolution gene expression profiles and performing downstream tasks such as spatial domain identification. All code and public datasets used in this paper are available at https://github.com/wenwenmin/HisToSGE and https://zenodo.org/records/12792163.
