SPADE: Spatial Transcriptomics and Pathology Alignment Using a Mixture of Data Experts for an Expressive Latent Space
Ekaterina Redekop, Mara Pleasure, Zichen Wang, Kimberly Flores, Anthony Sisk, William Speier, Corey W. Arnold
TL;DR
SPADE addresses the need to jointly model histology and spatial transcriptomics to capture molecular heterogeneity in tissue. It learns an ST-informed latent space through a mixture of data experts trained with a contrastive objective on paired H&E and Visium data, guided by a two-step clustering strategy that enables hard-negative mining across many organs. Evaluated on 20 downstream tasks, SPADE consistently outperforms baselines, including methods relying on bulk RNA-seq, and shows strong improvements in cancer subtyping, survival, and biomarker prediction, with interpretable attention heatmaps highlighting tumor-focused regions. The work demonstrates the value of multimodal supervision for robust pathology representations and provides a scalable framework that can extend to additional ST modalities and organ types.
Abstract
The rapid growth of digital pathology and advances in self-supervised deep learning have enabled the development of foundational models for various pathology tasks across diverse diseases. While multimodal approaches integrating diverse data sources have emerged, a critical gap remains in the comprehensive integration of whole-slide images (WSIs) with spatial transcriptomics (ST), which is crucial for capturing critical molecular heterogeneity beyond standard hematoxylin & eosin (H&E) staining. We introduce SPADE, a foundation model that integrates histopathology with ST data to guide image representation learning within a unified framework, in effect creating an ST-informed latent space. SPADE leverages a mixture-of-data experts technique, where experts are created via two-stage imaging feature-space clustering using contrastive learning to learn representations of co-registered WSI patches and gene expression profiles. Pre-trained on the comprehensive HEST-1k dataset, SPADE is evaluated on 20 downstream tasks, demonstrating significantly superior few-shot performance compared to baseline models, highlighting the benefits of integrating morphological and molecular information into one latent space. Code and pretrained weights are available at https://github.com/uclabair/SPADE.
