GPO-VAE: Modeling Explainable Gene Perturbation Responses utilizing GRN-Aligned Parameter Optimization
Seungheun Baek, Soyon Park, Yan Ting Chok, Mogan Gim, Jaewoo Kang
TL;DR
GPO-VAE tackles the explainability gap in perturbation-response modeling by aligning the latent perturbation effects with a gene regulatory network via GRN-aligned parameter optimization. The model extends CRADLE-VAE with a square GRN-weight matrix $W$ and a K-hop accumulation $T_K$, guided by optimal-transport-based differential expression and sparsity regularization to yield interpretable GRNs while maintaining strong predictive accuracy. Across three Perturb-Seq datasets, GPO-VAE achieves state-of-the-art perturbation-response predictions and yields sparse, biologically meaningful GRNs that align with known regulatory pathways; it also demonstrates robust generalization to unseen perturbations. These results advance explainable biological AI by providing mechanistic, data-driven networks that support target discovery and interpretation in cellular perturbations, with open-source availability for reproducibility.
Abstract
Motivation: Predicting cellular responses to genetic perturbations is essential for understanding biological systems and developing targeted therapeutic strategies. While variational autoencoders (VAEs) have shown promise in modeling perturbation responses, their limited explainability poses a significant challenge, as the learned features often lack clear biological meaning. Nevertheless, model explainability is one of the most important aspects in the realm of biological AI. One of the most effective ways to achieve explainability is incorporating the concept of gene regulatory networks (GRNs) in designing deep learning models such as VAEs. GRNs elicit the underlying causal relationships between genes and are capable of explaining the transcriptional responses caused by genetic perturbation treatments. Results: We propose GPO-VAE, an explainable VAE enhanced by GRN-aligned Parameter Optimization that explicitly models gene regulatory networks in the latent space. Our key approach is to optimize the learnable parameters related to latent perturbation effects towards GRN-aligned explainability. Experimental results on perturbation prediction show our model achieves state-of-the-art performance in predicting transcriptional responses across multiple benchmark datasets. Furthermore, additional results on evaluating the GRN inference task reveal our model's ability to generate meaningful GRNs compared to other methods. According to qualitative analysis, GPO-VAE posseses the ability to construct biologically explainable GRNs that align with experimentally validated regulatory pathways. GPO-VAE is available at https://github.com/dmis-lab/GPO-VAE
