Bayesian Semiparametric Causal Inference: Targeted Doubly Robust Estimation of Treatment Effects
Gözde Sert, Abhishek Chakrabortty, Anirban Bhattacharya
TL;DR
The paper tackles causal inference for observational data with high-dimensional nuisance parameters by introducing DRDB, a Bayesian semiparametric approach that debiases nuisance estimation through a targeted, summary-statistics-based modeling and a density-ratio retargeting step. By decoupling nuisance learning from target inference via sample splitting and cross-fitting, DRDB yields marginal posteriors for the ATE and one-arm means that satisfy Bernstein-von Mises theorems under mild conditions and exhibit Bayesian double robustness when only one nuisance model is correctly specified. The methodology relies on a hierarchical framework that integrates debiased nuisance biases into a conditional likelihood, with tractable posteriors for the target parameters and efficient use of the full data through CF aggregation. Theoretical guarantees are complemented by extensive simulations and a real-data application (NHEFS), demonstrating accurate point estimates and nominal coverage with high-dimensional nuisance models, and the approach naturally extends to a broad class of causal estimands beyond the ATE.
Abstract
We propose a semiparametric Bayesian methodology for estimating the average treatment effect (ATE) within the potential outcomes framework using observational data with high-dimensional nuisance parameters. Our method introduces a Bayesian debiasing procedure that corrects for bias arising from nuisance estimation and employs a targeted modeling strategy based on summary statistics rather than the full data. These summary statistics are identified in a debiased manner, enabling the estimation of nuisance bias via weighted observables and facilitating hierarchical learning of the ATE. By combining debiasing with sample splitting, our approach separates nuisance estimation from inference on the target parameter, reducing sensitivity to nuisance model specification. We establish that, under mild conditions, the marginal posterior for the ATE satisfies a Bernstein-von Mises theorem when both nuisance models are correctly specified and remains consistent and robust when only one is correct, achieving Bayesian double robustness. This ensures asymptotic efficiency and frequentist validity. Extensive simulations confirm the theoretical results, demonstrating accurate point estimation and credible intervals with nominal coverage, even in high-dimensional settings. The proposed framework can also be extended to other causal estimands, and its key principles offer a general foundation for advancing Bayesian semiparametric inference more broadly.
