Interpretable Neural ODEs for Gene Regulatory Network Discovery under Perturbations
Zaikang Lin, Sei Chang, Aaron Zweig, Minseo Kang, Elham Azizi, David A. Knowles
TL;DR
PerturbODE introduces a scalable, dynamics-aware approach to GRN discovery from perturbation-rich scRNA-seq data by modeling cell trajectories with a neural ODE whose parameters encode the GRN via a latent gene-module representation. The framework combines shift and perfect interventions, an OT-based loss with diffusion regularization, and a post-hoc mapping of module interactions to the full gene network $\mathbf{G} = A \ \mathrm{diag}(\alpha) \ B$, enabling both trajectory prediction and causal discovery. Across SERGIO, BoolODE, TF Atlas, and ChIP-Atlas benchmarks, PerturbODE achieves competitive or superior performance to scalable baselines and reveals interpretable regulatory modules aligned with known developmental programs. The results demonstrate strong generalization to held-out perturbations and offer a path toward integrating additional omics data to further refine GRN inference, with implications for developmental biology and therapeutic target discovery.
Abstract
Modern high-throughput biological datasets with thousands of perturbations provide the opportunity for large-scale discovery of causal graphs that represent the regulatory interactions between genes. Differentiable causal graphical models have been proposed to infer a gene regulatory network (GRN) from large scale interventional datasets, capturing the causal gene regulatory relationships from genetic perturbations. However, existing models are limited in their expressivity and scalability while failing to address the dynamic nature of biological processes such as cellular differentiation. We propose PerturbODE, a novel framework that incorporates biologically informative neural ordinary differential equations (neural ODEs) to model cell state trajectories under perturbations and derive the causal GRN from the neural ODE's parameters. We demonstrate PerturbODE's efficacy in trajectory prediction and GRN inference across simulated and real over-expression datasets.
