Table of Contents
Fetching ...

CellStream: Dynamical Optimal Transport Informed Embeddings for Reconstructing Cellular Trajectories from Snapshots Data

Yue Ling, Peiqi Zhang, Zhenyi Zhang, Peijie Zhou

TL;DR

CellStream addresses the challenge of reconstructing continuous cellular trajectories from sparse, noisy time-series scRNA-seq snapshots by jointly learning a dynamics-informed embedding and latent cellular dynamics. It combines an autoencoder with unbalanced dynamical OT, optimizing a loss that includes embedding fidelity and a Wasserstein–Fisher–Rao–based transport term in latent space, with dynamics governed by a velocity field $\mathbf{v}$ and growth term $g$ under the continuity equation $\partial_t q + \nabla_{\mathbf{z}}\cdot(\mathbf{v} q) = g q$. The framework demonstrates superior temporal coherence and noise robustness across simulated bifurcations and real datasets (EMT, iPSC) and extends to spatiotemporal contexts via spatial transcriptomics (MOSTA), outperforming state-of-the-art baselines. This end-to-end approach enables faithful reconstruction of dynamic cellular processes directly from static snapshots, with potential to inform regulatory inference and multi-omics integration.

Abstract

Single-cell RNA sequencing (scRNA-seq), especially temporally resolved datasets, enables genome-wide profiling of gene expression dynamics at single-cell resolution across discrete time points. However, current technologies provide only sparse, static snapshots of cell states and are inherently influenced by technical noise, complicating the inference and representation of continuous transcriptional dynamics. Although embedding methods can reduce dimensionality and mitigate technical noise, the majority of existing approaches typically treat trajectory inference separately from embedding construction, often neglecting temporal structure. To address this challenge, here we introduce CellStream, a novel deep learning framework that jointly learns embedding and cellular dynamics from single-cell snapshot data by integrating an autoencoder with unbalanced dynamical optimal transport. Compared to existing methods, CellStream generates dynamics-informed embeddings that robustly capture temporal developmental processes while maintaining high consistency with the underlying data manifold. We demonstrate CellStream's effectiveness on both simulated datasets and real scRNA-seq data, including spatial transcriptomics. Our experiments indicate significant quantitative improvements over state-of-the-art methods in representing cellular trajectories with enhanced temporal coherence and reduced noise sensitivity. Overall, CellStream provides a new tool for learning and representing continuous streams from the noisy, static snapshots of single-cell gene expression.

CellStream: Dynamical Optimal Transport Informed Embeddings for Reconstructing Cellular Trajectories from Snapshots Data

TL;DR

CellStream addresses the challenge of reconstructing continuous cellular trajectories from sparse, noisy time-series scRNA-seq snapshots by jointly learning a dynamics-informed embedding and latent cellular dynamics. It combines an autoencoder with unbalanced dynamical OT, optimizing a loss that includes embedding fidelity and a Wasserstein–Fisher–Rao–based transport term in latent space, with dynamics governed by a velocity field and growth term under the continuity equation . The framework demonstrates superior temporal coherence and noise robustness across simulated bifurcations and real datasets (EMT, iPSC) and extends to spatiotemporal contexts via spatial transcriptomics (MOSTA), outperforming state-of-the-art baselines. This end-to-end approach enables faithful reconstruction of dynamic cellular processes directly from static snapshots, with potential to inform regulatory inference and multi-omics integration.

Abstract

Single-cell RNA sequencing (scRNA-seq), especially temporally resolved datasets, enables genome-wide profiling of gene expression dynamics at single-cell resolution across discrete time points. However, current technologies provide only sparse, static snapshots of cell states and are inherently influenced by technical noise, complicating the inference and representation of continuous transcriptional dynamics. Although embedding methods can reduce dimensionality and mitigate technical noise, the majority of existing approaches typically treat trajectory inference separately from embedding construction, often neglecting temporal structure. To address this challenge, here we introduce CellStream, a novel deep learning framework that jointly learns embedding and cellular dynamics from single-cell snapshot data by integrating an autoencoder with unbalanced dynamical optimal transport. Compared to existing methods, CellStream generates dynamics-informed embeddings that robustly capture temporal developmental processes while maintaining high consistency with the underlying data manifold. We demonstrate CellStream's effectiveness on both simulated datasets and real scRNA-seq data, including spatial transcriptomics. Our experiments indicate significant quantitative improvements over state-of-the-art methods in representing cellular trajectories with enhanced temporal coherence and reduced noise sensitivity. Overall, CellStream provides a new tool for learning and representing continuous streams from the noisy, static snapshots of single-cell gene expression.

Paper Structure

This paper contains 31 sections, 29 equations, 7 figures, 5 tables, 1 algorithm.

Figures (7)

  • Figure 1: Overview of CellStream. CellStream is a dynamics-informed embedding model that jointly learns embeddings and continuous cellular streams from static, sparse snapshots data. CellStream uses an autoencoder and neural network to learn embeddings and continuous cellular streams from sparse snapshots data, guided by real-time trajectory feedback composed of $\mathcal{L}_{\text{OT}}$, $\mathcal{L}_{\text{WFR}}$ and $\mathcal{L}_{\text{Mass}}$.
  • Figure 2: Application in the Simulated data. (a) The dynamics-informed embedding learned by CellStream. (b) Velocity accuracy and temporal consistency of embeddings from different methods across six runs on simulated data with increasing noise level.
  • Figure 3: Application in the EMT dataset. (a) The dynamics-informed embedding learned by CellStream. (b) The embeddings learned by MIOFlow. (c) The embedding learned by VeloViz. (d-g) The embeddings from TIGON with PCA, t-SNE, UMAP and diffusion maps respectively.
  • Figure 4: Application in the iPSC dataset. (a) The dynamics-informed embedding learned by CellStream. (b) Illustration of the bifuraction event in the iPSC dataset. (c) The embedding learned by MIOFlow. (d) The embedding learned by VeloViz. (e) The embedding learned by TIGON with PCA.
  • Figure 5: Application in the MOSTA dataset. (a) The dynamics-informed embedding learned by CellStream. (b) The embedding learned by MIOFlow. (c) The embedding learned by VeloViz. (d-g) The embeddings from TIGON with PCA, t-SNE, UMAP and diffusion maps respectively.
  • ...and 2 more figures