Estimating treatment effects from single-arm trials via latent-variable modeling
Manuel Haussmann, Tran Minh Son Le, Viivi Halla-aho, Samu Kurki, Jussi V. Leinonen, Miika Koskinen, Samuel Kaski, Harri Lähdesmäki
TL;DR
This work tackles estimating treatment effects when only a single-arm trial is available, by leveraging external controls from real-world data. It introduces an identifiable latent-variable model that learns group-specific and shared latent representations, enabling both direct treatment effect estimation and patient matching without leaking post-treatment information. The model accounts for structured missingness via MNAR modeling and uses amortized variational inference to learn the predictive latent space, with identifiability guaranteed through a conditional prior and auxiliary variables. Empirical results on semi-synthetic IHDP data and a real-world dataset combining an RCT with EHRs show consistent improvements over a broad set of baselines across CATE, ATT, and survival tasks. The approach provides a practical, theoretically grounded framework for using external controls in settings where randomized trials are infeasible.
Abstract
Randomized controlled trials (RCTs) are the accepted standard for treatment effect estimation but they can be infeasible due to ethical reasons and prohibitive costs. Single-arm trials, where all patients belong to the treatment group, can be a viable alternative but require access to an external control group. We propose an identifiable deep latent-variable model for this scenario that can also account for missing covariate observations by modeling their structured missingness patterns. Our method uses amortized variational inference to learn both group-specific and identifiable shared latent representations, which can subsequently be used for {\em (i)} patient matching if treatment outcomes are not available for the treatment group, or for {\em (ii)} direct treatment effect estimation assuming outcomes are available for both groups. We evaluate the model on a public benchmark as well as on a data set consisting of a published RCT study and real-world electronic health records. Compared to previous methods, our results show improved performance both for direct treatment effect estimation as well as for effect estimation via patient matching.
