SUICA: Learning Super-high Dimensional Sparse Implicit Neural Representations for Spatial Transcriptomics
Qingtian Zhu, Yumin Zheng, Yuling Sang, Yifan Zhan, Ziyan Zhu, Jun Ding, Yinqiang Zheng
TL;DR
SUICA addresses the challenge of modeling super-high dimensional, sparsely observed spatial transcriptomics data by learning continuous, compact representations via Implicit Neural Representations guided by a Graph Autoencoder. The method decouples coordinate-to-embedding mapping (INR) from embedding-to-expression decoding, employing a decoding head and a regression-by-classification loss (with Dice regularization) to preserve sparsity and numerical fidelity. Across Stereo-seq MOSTA, Slide-seqV2, Visium, and MERFISH, SUICA achieves superior numerical fidelity, stronger bio-conservation, and robust imputation under various degradations, demonstrating a degradation-agnostic, platform-agnostic approach. This work offers a scalable, reference-free pathway to enhance spatial resolution and gene signal recovery in ST analyses, with broad implications for downstream cellular and tissue biology.
Abstract
Spatial Transcriptomics (ST) is a method that captures gene expression profiles aligned with spatial coordinates. The discrete spatial distribution and the super-high dimensional sequencing results make ST data challenging to be modeled effectively. In this paper, we manage to model ST in a continuous and compact manner by the proposed tool, SUICA, empowered by the great approximation capability of Implicit Neural Representations (INRs) that can enhance both the spatial density and the gene expression. Concretely within the proposed SUICA, we incorporate a graph-augmented Autoencoder to effectively model the context information of the unstructured spots and provide informative embeddings that are structure-aware for spatial mapping. We also tackle the extremely skewed distribution in a regression-by-classification fashion and enforce classification-based loss functions for the optimization of SUICA. By extensive experiments of a wide range of common ST platforms under varying degradations, SUICA outperforms both conventional INR variants and SOTA methods regarding numerical fidelity, statistical correlation, and bio-conservation. The prediction by SUICA also showcases amplified gene signatures that enriches the bio-conservation of the raw data and benefits subsequent analysis. The code is available at https://github.com/Szym29/SUICA.
