SEPAL: Spatial Gene Expression Prediction from Local Graphs
Gabriel Mejia, Paula Cárdenas, Daniela Ruiz, Angela Castillo, Pablo Arbeláez
TL;DR
SEPAL addresses the challenge of predicting spatial gene expression from histology by introducing a delta-based supervision scheme and a two-stage local-graph framework. A local image encoder first predicts gene expression deltas relative to the training mean, which is then refined by a graph neural network operating on a neighborhood graph to produce spatial corrections; the final expression is the delta plus the training mean. The method leverages $\ar{y}_{train}$, $\Delta y = y - \\bar{y}_{train}$, and a graph-based aggregation to capture local spatial dependencies, with genes selected via Moran's I and a robust denoising/benchmarking pipeline. Across two breast cancer datasets (Visium and STNet), SEPAL achieves state-of-the-art performance on multiple metrics, demonstrating the value of integrating local spatial context for spatial transcriptomics and enabling more accurate interpretation of tissue morphology and molecular profiles.
Abstract
Spatial transcriptomics is an emerging technology that aligns histopathology images with spatially resolved gene expression profiling. It holds the potential for understanding many diseases but faces significant bottlenecks such as specialized equipment and domain expertise. In this work, we present SEPAL, a new model for predicting genetic profiles from visual tissue appearance. Our method exploits the biological biases of the problem by directly supervising relative differences with respect to mean expression, and leverages local visual context at every coordinate to make predictions using a graph neural network. This approach closes the gap between complete locality and complete globality in current methods. In addition, we propose a novel benchmark that aims to better define the task by following current best practices in transcriptomics and restricting the prediction variables to only those with clear spatial patterns. Our extensive evaluation in two different human breast cancer datasets indicates that SEPAL outperforms previous state-of-the-art methods and other mechanisms of including spatial context.
