PRESCRIBE: Predicting Single-Cell Responses with Bayesian Estimation
Jiabei Cheng, Changxi Chi, Jingbo Zhou, Hongyi Xin, Jun Xia
TL;DR
PRESCRIBE addresses the problem of predicting single-cell responses to unseen gene perturbations by jointly modeling data and model uncertainty. It introduces a multivariate Natural Posterior Network that outputs not only post-perturbation expression statistics but also an evidence-driven uncertainty signal, operationalized via a pseudo E-distance that combines predictive entropy and posterior evidence. The approach yields well-calibrated, instance-level uncertainty estimates and improves predictive accuracy when unreliable predictions are filtered, outperforming baselines across multiple datasets. This framework enhances reliability for in silico perturbation studies and can generalize to other scientific domains where out-of-distribution predictions and inherent data stochasticity matter. Overall, PRESCRIBE offers a principled, scalable way to quantify and leverage uncertainty in complex, high-dimensional biological prediction tasks.
Abstract
In single-cell perturbation prediction, a central task is to forecast the effects of perturbing a gene unseen in the training data. The efficacy of such predictions depends on two factors: (1) the similarity of the target gene to those covered in the training data, which informs model (epistemic) uncertainty, and (2) the quality of the corresponding training data, which reflects data (aleatoric) uncertainty. Both factors are critical for determining the reliability of a prediction, particularly as gene perturbation is an inherently stochastic biochemical process. In this paper, we propose PRESCRIBE (PREdicting Single-Cell Response wIth Bayesian Estimation), a multivariate deep evidential regression framework designed to measure both sources of uncertainty jointly. Our analysis demonstrates that PRESCRIBE effectively estimates a confidence score for each prediction, which strongly correlates with its empirical accuracy. This capability enables the filtering of untrustworthy results, and in our experiments, it achieves steady accuracy improvements of over 3% compared to comparable baselines.
