Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics

Mingcheng Qu; Yuncong Wu; Donglin Di; Anyang Su; Tonghua Su; Yang Song; Lei Fan

Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics

Mingcheng Qu, Yuncong Wu, Donglin Di, Anyang Su, Tonghua Su, Yang Song, Lei Fan

TL;DR

The paper tackles the problem of predicting spatial gene expression from whole-slide images in spatial transcriptomics, addressing the limitation that prior methods often overlook boundary-based cellular morphology and microenvironment cues. It introduces BG-TRIPLEX, a three-branch architecture (spot, in-context, global) that integrates boundary information via Multi-Head Cross-Attention, using PiDiNet for edges and HoverNet for nuclei, with a global positional encoding (APEG) to capture tissue layout. The model is trained with a fused-output $MSE$ loss plus branch-guided losses, and shows notable improvements in $PCC$ across HER2ST, STNet, and Skin datasets, with demonstrated generalization to Visium data. These findings highlight boundary features as a key driver of accurate gene-expression prediction and offer a geometry-aware approach for pathology-informed transcriptomics analyses.

Abstract

Spatial transcriptomics (ST) has emerged as an advanced technology that provides spatial context to gene expression. Recently, deep learning-based methods have shown the capability to predict gene expression from WSI data using ST data. Existing approaches typically extract features from images and the neighboring regions using pretrained models, and then develop methods to fuse this information to generate the final output. However, these methods often fail to account for the cellular structure similarity, cellular density and the interactions within the microenvironment. In this paper, we propose a framework named BG-TRIPLEX, which leverages boundary information extracted from pathological images as guiding features to enhance gene expression prediction from WSIs. Specifically, our model consists of three branches: the spot, in-context and global branches. In the spot and in-context branches, boundary information, including edge and nuclei characteristics, is extracted using pretrained models. These boundary features guide the learning of cellular morphology and the characteristics of microenvironment through Multi-Head Cross-Attention. Finally, these features are integrated with global features to predict the final output. Extensive experiments were conducted on three public ST datasets. The results demonstrate that our BG-TRIPLEX consistently outperforms existing methods in terms of Pearson Correlation Coefficient (PCC). This method highlights the crucial role of boundary features in understanding the complex interactions between WSI and gene expression, offering a promising direction for future research.

Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics

TL;DR

Abstract

Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)