Dynamic Regularized CBDT: Variance-Calibrated Causal Boosting for Interpretable Heterogeneous Treatment Effects
Yichen Liu
TL;DR
We address the challenge of accurately, interpretably, and calibratedly estimating heterogeneous treatment effects from observational data. The Dynamic Regularized CBDT framework extends gradient-boosted decision trees with a composite loss that jointly minimizes prediction error, intra-group variance, global calibration, and ATE calibration, while dynamically updating regularization through gradient statistics. Theoretical guarantees show a PEHE upper-bound reduction of order $O(\sqrt{\lambda+\alpha})$ and convergence properties, complemented by empirical evidence on IHDP and MIMIC-III-based datasets demonstrating improved PEHE, ATE accuracy, calibration coverage, and computational efficiency. The results indicate that dynamic variance regularization yields tighter error bounds and more reliable, interpretable causal rules, with practical impact for high-stakes decision support in healthcare and policy.
Abstract
Heterogeneous treatment effect estimation in high-stakes applications demands models that simultaneously optimize precision, interpretability, and calibration. Many existing tree-based causal inference techniques, however, exhibit high estimation errors when applied to observational data because they struggle to capture complex interactions among factors and rely on static regularization schemes. In this work, we propose Dynamic Regularized Causal Boosted Decision Trees (CBDT), a novel framework that integrates variance regularization and average treatment effect calibration into the loss function of gradient boosted decision trees. Our approach dynamically updates the regularization parameters using gradient statistics to better balance the bias-variance tradeoff. Extensive experiments on standard benchmark datasets and real-world clinical data demonstrate that the proposed method significantly improves estimation accuracy while maintaining reliable coverage of true treatment effects. In an intensive care unit patient triage study, the method successfully identified clinically actionable rules and achieved high accuracy in treatment effect estimation. The results validate that dynamic regularization can effectively tighten error bounds and enhance both predictive performance and model interpretability.
