Sequential Kernel Embedding for Mediated and Time-Varying Dose Response Curves
Rahul Singh, Liyuan Xu, Arthur Gretton
TL;DR
This work develops sequential kernel embedding, a reproducing kernel Hilbert space (RKHS) method to estimate mediated and time-varying dose–response curves with continuous treatments and treatment–confounder feedback. By encoding conditional distributions and sequential plans into RKHS embeddings, the authors decouple the components of Pearl's mediation formula and Robins' g-formula, yielding simple, closed-form kernel ridge regression estimators with nonasymptotic uniform rates and, in discrete cases, semiparametric efficiency. The approach unifies mediation and dynamic treatment effect estimation under a flexible, nonparametric framework and is validated through nonlinear simulations and an empirical analysis of the US Job Corps data, with a cleaned data set provided as a benchmark. This method offers a practical, theoretically grounded path for policy evaluation where treatments and covariates are continuous and feedback loops are present, and it opens avenues for extending to counterfactual distributions and off-policy planning.
Abstract
We propose simple nonparametric estimators for mediated and time-varying dose response curves based on kernel ridge regression. By embedding Pearl's mediation formula and Robins' g-formula with kernels, we allow treatments, mediators, and covariates to be continuous in general spaces, and also allow for nonlinear treatment-confounder feedback. Our key innovation is a reproducing kernel Hilbert space technique called sequential kernel embedding, which we use to construct simple estimators that account for complex feedback. Our estimators preserve the generality of classic identification while also achieving nonasymptotic uniform rates. In nonlinear simulations with many covariates, we demonstrate strong performance. We estimate mediated and time-varying dose response curves of the US Job Corps, and clean data that may serve as a benchmark in future work. We extend our results to mediated and time-varying treatment effects and counterfactual distributions, verifying semiparametric efficiency and weak convergence.
