SPHENIC: Topology-Aware Multi-View Clustering for Spatial Transcriptomics
Chenkai Guo, Yikai Zhu, Renxiang Guan, Jinli Ma, Siwei Wang, Ke Liang, Guangdun Peng, Dayu Hu
TL;DR
SPHENIC tackles the challenge of robust spatial clustering in spatial transcriptomics by integrating topology-aware representations via Extended Persistent Homology and enforcing spatial coherence through a dual-regularized objective. It builds a topology-enhanced, multi-view graph fusion network that jointly leverages gene-expression, spatial coordinates, and topology-derived features in an end-to-end framework. The approach achieves state-of-the-art clustering performance across 11 tissue slices (DLPFC, HBC, MBA), with substantial ARI gains and validated through ablations showing the critical roles of EPH and DualRO. While delivering strong practical impact for deciphering tissue architecture, the method incurs computational overhead due to EPH, highlighting a trade-off between topology fidelity and scalability.
Abstract
Spatial transcriptomics clustering is pivotal for identifying cell subpopulations by leveraging spatial location information. While recent graph-based methods modeling cell-cell interactions have improved clustering accuracy, they remain limited in two key aspects: (i) reliance on local aggregation in static graphs often fails to capture robust global topological structures (e.g., loops and voids) and is vulnerable to noisy edges; and (ii) dimensionality reduction techniques frequently neglect spatial coherence, causing physically adjacent spots to be erroneously separated in the latent space. To overcome these challenges, we propose SPHENIC, a Spatial Persistent Homology-Enhanced Neighborhood Integrative Clustering method. Specifically, it explicitly incorporates topology-invariant features into the clustering network to ensure robust representation learning against noise. Furthermore, we design a dual-regularized optimization module that imposes spatial constraints alongside distributional optimization, ensuring that the embedding space preserves the physical proximity of cells. Extensive experiments on 11 benchmark datasets demonstrate that SPHENIC outperforms state-of-the-art methods by 4.19%-9.14%, validating its superiority in characterizing complex tissue architectures.
