Representation Learning for Distributional Perturbation Extrapolation
Julius von Kügelgen, Jakob Ketterer, Xinwei Shen, Nicolai Meinshausen, Jonas Peters
TL;DR
The paper tackles distributional perturbation extrapolation: predicting the full distribution of omics-like observations under unseen perturbations. It introduces a latent-space model where perturbations act as additive mean shifts ${\bm{Z}}^{\mathrm{pert}}={\bm{Z}}^{\mathrm{base}}+{\bm{W}}{\bm{l}}$ followed by a nonlinear generator ${\bm{f}}({\bm{Z}}^{\mathrm{pert}}, {\bm{\varepsilon}})$, and proves identifiability up to an affine transformation given diverse training perturbations, enabling extrapolation to perturbations in the span of training ${\bm{l}}_e$. The Perturbation Distribution Autoencoder (PDAE) is then proposed to estimate the identifiable components by matching distributional predictions across domains using the energy score, with a workflow consisting of an encoder, a perturbation module, and a stochastic decoder. Empirical results on synthetic data show that PDAE achieves superior distributional fidelity and mean prediction in the in-distribution regime and provides the best-performing extrapolation among methods not explicitly designed for distributional reconstruction, while acknowledging limitations for out-of-distribution generalization and decoder extrapolation. The work advances theory in identifiable representation learning for extrapolation and links to causal modelling by framing perturbations as latent shifts, with practical impact for predicting effects of unseen perturbation combinations in biology and related domains.
Abstract
We consider the problem of modelling the effects of unseen perturbations such as gene knockdowns or drug combinations on low-level measurements such as RNA sequencing data. Specifically, given data collected under some perturbations, we aim to predict the distribution of measurements for new perturbations. To address this challenging extrapolation task, we posit that perturbations act additively in a suitable, unknown embedding space. More precisely, we formulate the generative process underlying the observed data as a latent variable model, in which perturbations amount to mean shifts in latent space and can be combined additively. Unlike previous work, we prove that, given sufficiently diverse training perturbations, the representation and perturbation effects are identifiable up to affine transformation, and use this to characterize the class of unseen perturbations for which we obtain extrapolation guarantees. To estimate the model from data, we propose a new method, the perturbation distribution autoencoder (PDAE), which is trained by maximising the distributional similarity between true and predicted perturbation distributions. The trained model can then be used to predict previously unseen perturbation distributions. Empirical evidence suggests that PDAE compares favourably to existing methods and baselines at predicting the effects of unseen perturbations.
