Detecting Spatial Dependence in Transcriptomics Data using Vectorised Persistence Diagrams
Katharina Limbeck, Bastian Rieck
TL;DR
This work tackles detecting spatial dependence in spatial transcriptomics by leveraging persistent homology (PH) and functional persistence summaries. It introduces a non-parametric one-sample permutation test based on Betti curves and persistence landscapes to identify spatially variable genes, comparing against Moran's I, sepal, and SpatialDE on simulated and real data. PH-based methods show strong robustness to zero-inflation and noise, offering higher specificity and complementary information to existing approaches, often identifying top SVGs with reliable stability. The approach is scalable through vectorised implementations and provides a flexible framework for integrating topology into spatial omics analyses, with potential applicability to other spatial graph data beyond transcriptomics.
Abstract
Evaluating spatial patterns in data is an integral task across various domains, including geostatistics, astronomy, and spatial tissue biology. The analysis of transcriptomics data in particular relies on methods for detecting spatially-dependent features that exhibit significant spatial patterns for both explanatory analysis and feature selection. However, given the complex and high-dimensional nature of these data, there is a need for robust, stable, and reliable descriptors of spatial dependence. We leverage the stability and multiscale properties of persistent homology to address this task. To this end, we introduce a novel framework using functional topological summaries, such as Betti curves and persistence landscapes, for identifying and describing non-random patterns in spatial data. In particular, we propose a non-parametric one-sample permutation test for spatial dependence and investigate its utility across both simulated and real spatial omics data. Our vectorised approach outperforms baseline methods at accurately detecting spatial dependence. Further, we find that our method is more robust to outliers than alternative tests using Moran's I.
