Bayesian Nonparametrics for Principal Stratification with Continuous Post-Treatment Variables
Dafne Zorzetto, Antonio Canale, Fabrizia Mealli, Francesca Dominici, Falco J. Bargagli-Stoffi
TL;DR
This paper tackles causal inference under principal stratification when the post-treatment variable is continuous. It introduces CASBAH, a Confounders-Aware SHared-atoms Bayesian mixture model that uses a dependent Dirichlet process with shared atoms to model potential post-treatment variables while incorporating covariate-dependent weights, enabling data-adaptive discovery of principal strata and full uncertainty quantification of stratum membership. Through extensive simulations, CASBAH outperforms existing methods in identifying principal strata and estimating stratum-specific causal effects. The method is applied to the 2005 National Ambient Air Quality Standards revision, revealing three strata with distinct patterns in PM$_{2.5}$ changes and mortality, and demonstrating the practical value of stratum-aware causal analysis for environmental policy evaluation.
Abstract
Principal stratification provides a causal inference framework for investigating treatment effects in the presence of a post-treatment variable. Principal strata play a key role in characterizing the treatment effect by identifying groups of units with the same or similar values for the potential post-treatment variable at all treatment levels. The literature has focused mainly on binary post-treatment variables. Few papers considered continuous post-treatment variables. In the presence of a continuous post-treatment, a challenge is how to identify and characterize meaningful coarsening of the latent principal strata that lead to interpretable principal causal effects. This paper introduces the Confounders-Aware SHared atoms BAyesian mixture (CASBAH), a novel approach for principal stratification with binary treatment and continuous post-treatment variables. CASBAH leverages Bayesian nonparametric priors with an innovative hierarchical structure for the potential post-treatment outcomes that overcomes some of the limitations of previous works. Specifically, the novel features of our method allow for (i) identifying coarsened principal strata through a data-adaptive approach and (ii) providing a comprehensive quantification of the uncertainty surrounding stratum membership. Through Monte Carlo simulations, we show that the proposed methodology performs better than existing methods in characterizing the principal strata and estimating principal effects of the treatment. Finally, CASBAH is applied to a case study in which we estimate the causal effects of US national air quality regulations on pollution levels and health outcomes.
