Efficient adjustment for complex covariates: Gaining efficiency with DOPE
Alexander Mangulad Christgau, Anton Rask Lundborg, Niels Richard Hansen
TL;DR
The paper develops a generalized covariate adjustment framework for efficient ATE estimation with complex covariates by reasoning about informative descriptions of covariate information rather than fixed graphs. It introduces DOPE, a Debiased Outcome-adapted Propensity Estimator, which learns an outcome-focused representation and couples nuisance estimation through that representation to achieve robustness and efficiency gains over standard AIPW, especially when covariates strongly predict treatment. The authors prove information bounds implying that using outcome-sufficient descriptions minimizes asymptotic variance and provide a delta-method analysis for representation-induced error, with DOPE shown to be asymptotically normal and variance-consistent. Empirically, DOPE improves finite-sample performance in simulations with single-index structures and demonstrates competitive, stable adjusted effects in NHANES data, including scenarios with extreme propensity scores. The framework supports non-Euclidean covariates (texts/images) and high-dimensional settings, offering practical guidance for efficient ATE estimation in observational studies.
Abstract
Covariate adjustment is a ubiquitous method used to estimate the average treatment effect (ATE) from observational data. Assuming a known graphical structure of the data generating model, recent results give graphical criteria for optimal adjustment, which enables efficient estimation of the ATE. However, graphical approaches are challenging for high-dimensional and complex data, and it is not straightforward to specify a meaningful graphical model of non-Euclidean data such as texts. We propose a new framework that accommodates adjustment for any subset of information expressed by the covariates, and we show that the information that is minimally sufficient for prediction of the outcome given the treatment is also most efficient for adjustment. Based on our theoretical results, we propose the Debiased Outcome-adapted Propensity Estimator (DOPE) for efficient estimation of the ATE, and we provide asymptotic results for DOPE under general conditions. Compared to the augmented inverse propensity weighted (AIPW) estimator, DOPE can retain its efficiency even when the covariates are highly predictive of treatment. We illustrate this with a single-index model, and with an implementation of DOPE based on neural networks, we demonstrate its performance on simulated and real data. Our results show that DOPE provides an efficient and robust methodology for ATE estimation in various observational settings.
