scPPDM: A Diffusion Model for Single-Cell Drug-Response Prediction
Zhaokang Liang, Shuyang Zhuang, Xiaoran Jiao, Weian Mao, Hao Chen, Chunhua Shen
TL;DR
scPPDM introduces a diffusion-based framework for predicting single-cell drug responses directly from scRNA-seq by operating in a shared latent space with dual-condition channels (pre-perturbation state and drug dose). It employs a non-concatenative GD-Attn mechanism to fuse state and drug information, plus four-state training and factorized guidance that maps dose to guidance strength, enabling transparent what-if analyses. Across Tahoe-100M UC/UD splits, it achieves state-of-the-art performance on logFC recovery, correlation, explained variance, and DE-overlap, with notable improvements in unseen-drug generalization. The work enables interpretable, dose-controlled in-silico perturbation predictions that can reduce experimental burden and support rapid compound scoring for drug discovery and precision medicine.
Abstract
This paper introduces the Single-Cell Perturbation Prediction Diffusion Model (scPPDM), the first diffusion-based framework for single-cell drug-response prediction from scRNA-seq data. scPPDM couples two condition channels, pre-perturbation state and drug with dose, in a unified latent space via non-concatenative GD-Attn. During inference, factorized classifier-free guidance exposes two interpretable controls for state preservation and drug-response strength and maps dose to guidance magnitude for tunable intensity. Evaluated on the Tahoe-100M benchmark under two stringent regimes, unseen covariate combinations (UC) and unseen drugs (UD), scPPDM sets new state-of-the-art results across log fold-change recovery, delta correlations, explained variance, and DE-overlap. Representative gains include +36.11%/+34.21% on DEG logFC-Spearman/Pearson in UD over the second-best model. This control interface enables transparent what-if analyses and dose tuning, reducing experimental burden while preserving biological specificity.
