TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics Data

Shuo Shuo Liu; Shikun Wang; Yuxuan Chen; Anil K. Rustgi; Ming Yuan; Jianhua Hu

TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics Data

Shuo Shuo Liu, Shikun Wang, Yuxuan Chen, Anil K. Rustgi, Ming Yuan, Jianhua Hu

TL;DR

TransST introduces a transfer-learning embedded spatial factor modeling framework to enhance spatial transcriptomics analysis by leveraging external labeled data. It combines supervised probabilistic dimension reduction on a source dataset, adaptive transfer of the learned loading matrix to a target dataset, and a spatial Gaussian mixture model with Markov random field smoothing to cluster cells with spatial coherence. Across simulations and real datasets (breast cancer, DLPFC brain, mouse embryo, cSCC), TransST improves clustering accuracy and identifies biologically meaningful cell types and driving genes, outperforming existing methods. The approach offers robust, scalable cross-study integration to better characterize cellular heterogeneity in spatial contexts and to detect biomarkers within spatially resolved data.

Abstract

Background: Spatial transcriptomics have emerged as a powerful tool in biomedical research because of its ability to capture both the spatial contexts and abundance of the complete RNA transcript profile in organs of interest. However, limitations of the technology such as the relatively low resolution and comparatively insufficient sequencing depth make it difficult to reliably extract real biological signals from these data. To alleviate this challenge, we propose a novel transfer learning framework, referred to as TransST, to adaptively leverage the cell-labeled information from external sources in inferring cell-level heterogeneity of a target spatial transcriptomics data. Results: Applications in several real studies as well as a number of simulation settings show that our approach significantly improves existing techniques. For example, in the breast cancer study, TransST successfully identifies five biologically meaningful cell clusters, including the two subgroups of cancer in situ and invasive cancer; in addition, only TransST is able to separate the adipose tissues from the connective issues among all the studied methods. Conclusions: In summary, the proposed method TransST is both effective and robust in identifying cell subclusters and detecting corresponding driving biomarkers in spatial transcriptomics data.

TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics Data

TL;DR

Abstract

TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics Data

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)