Adaptive Experiment Design with Synthetic Controls
Alihan Hüyük, Zhaozhi Qian, Mihaela van der Schaar
TL;DR
The paper tackles heterogeneous treatment effects by proposing Syntax, an adaptive exploratory trial design that uses online synthetic controls to identify subpopulations with positive effects within a fixed sample budget. It introduces a synthetic-control estimator $\hat{r}_i(\bm{\beta}) = \hat{y}_{iT}^{(1)} - \bm{\beta}^\top \hat{\bm{y}}_{\cdot T}^{(0)}$ with constraints $\bm{x}_i = X\bm{\beta}$, $\bm{z}_i \approx Z\bm{\beta}$, and $\mathbf{1}^\top\bm{\beta}=1$, proving unbiasedness and a variance bound that includes a representation-error term $\lambda\|\bm{\beta}-\bm{1}_i\|_{N^{-1}}^2$. A variance-minimizing selection of $\bm{\beta}_i^*$ feeds into an online algorithm, Syntax, which adaptively recruits subpopulations by minimizing a sensitivity index $S_i = |\hat{r}_i(\bm{\beta}_i^*)|/\sqrt{V_i(\bm{\beta}_i^*)}$ and ultimately reports $\hat{\mathcal{I}}^* = \{i: \hat{r}_i(\bm{\beta}_i^*)>0\}$. The experiments show Syntax outperforms benchmarks in environments with diminishing factor effects, achieving better FPR/TPR with substantially fewer samples, and discuss practical implications such as sample savings and allocation efficiency. The work highlights when synthetic controls are most beneficial—namely, when pre-treatment factors strongly inform latent loadings and post-treatment factors are weaker—and frames a path toward faster, more targeted clinical evaluation of heterogeneous treatment effects.
Abstract
Clinical trials are typically run in order to understand the effects of a new treatment on a given population of patients. However, patients in large populations rarely respond the same way to the same treatment. This heterogeneity in patient responses necessitates trials that investigate effects on multiple subpopulations - especially when a treatment has marginal or no benefit for the overall population but might have significant benefit for a particular subpopulation. Motivated by this need, we propose Syntax, an exploratory trial design that identifies subpopulations with positive treatment effect among many subpopulations. Syntax is sample efficient as it (i) recruits and allocates patients adaptively and (ii) estimates treatment effects by forming synthetic controls for each subpopulation that combines control samples from other subpopulations. We validate the performance of Syntax and provide insights into when it might have an advantage over conventional trial designs through experiments.
