Parameterising the effect of a continuous treatment using average derivative effects
Oliver J. Hines, Karla Diaz-Ordaz, Stijn Vansteelandt
TL;DR
This paper develops a unifying framework for causal effects with continuous treatments by focusing on weighted average derivative effects (ADEs) and their Riesz representers. It defines a class of estimands 𝓡 that connects weighted ADEs and weighted ATEs, derives optimally efficient representations, and shows how, under mild conditions, the least-squares estimands ψ and Ψ emerge as interpretable, robust targets. It then provides density-free, efficient estimators for ψ and Ψ using one-step influence-function approaches with cross-fitting, and demonstrates performance in simulations and a Warfarin-dose analysis. The work enables practical, model-agnostic inference for continuous treatments and offers a principled path to compare weighted causal effects across populations, with favorable finite-sample properties and broad applicability in biostatistics and epidemiology.
Abstract
The average treatment effect (ATE) is commonly used to quantify the main effect of a binary treatment on an outcome. Extensions to continuous treatments are usually based on the dose-response curve or shift interventions, but both require strong overlap conditions and the resulting curves may be difficult to summarise. We focus instead on average derivative effects (ADEs) that are scalar estimands related to infinitesimal shift interventions requiring only local overlap assumptions. ADEs, however, are rarely used in practice because their estimation usually requires estimating conditional density functions. By characterising the Riesz representers of weighted ADEs, we propose a new class of estimands that provides a unified view of weighted ADEs/ATEs when the treatment is continuous/binary. We derive the estimand in our class that minimises the nonparametric efficiency bound, thereby extending optimal weighting results from the binary treatment literature to the continuous setting. We develop efficient estimators for two weighted ADEs that avoid density estimation and are amenable to modern machine learning methods, which we evaluate in simulations and an applied analysis of Warfarin dosage effects.
