Shrinkage Methods for Treatment Choice
Takuya Ishihara, Daisuke Kurisu
TL;DR
The paper addresses treatment choice with covariates by proposing a shrinkage rule that shrinks CATE estimates toward their mean and selects shrinkage factors by minimizing an upper bound on the maximum regret under a bounded CATE space $\Theta(\kappa)$. The approach unifies the conditional empirical success (CES) rule and pooling as special cases, and it yields computationally tractable shrinkage factors $w_k^{*}(\kappa)$ through a per-coordinate optimization of $\psi_k(w_k;\kappa)$ involving the function $\eta(\cdot)$. Theoretical results show the shrinkage rule can outperform CES and pooling when $\kappa$ is moderately large or small (depending on variance heterogeneity), with bounds close to optimal and robustness to misspecification; numerical experiments and an empirical JTPA application illustrate when and how shrinkage changes treatment decisions relative to CES. Overall, the shrinkage framework provides a flexible, data-driven means to improve worst-case welfare guarantees in heterogeneous populations and remains practical for larger problem sizes.
Abstract
This study examines the problem of determining whether to treat individuals based on observed covariates. The most common decision rule is the conditional empirical success (CES) rule proposed by Manski (2004), which assigns individuals to treatments that yield the best experimental outcomes conditional on the observed covariates. Conversely, using shrinkage estimators, which shrink unbiased but noisy preliminary estimates toward the average of these estimates, is a common approach in statistical estimation problems because it is well-known that shrinkage estimators may have smaller mean squared errors than unshrunk estimators. Inspired by this idea, we propose a computationally tractable shrinkage rule that selects the shrinkage factor by minimizing an upper bound of the maximum regret. Then, we compare the maximum regret of the proposed shrinkage rule with those of the CES and pooling rules when the space of conditional average treatment effects (CATEs) is correctly specified or misspecified. Our theoretical results demonstrate that the shrinkage rule performs well in many cases and these findings are further supported by numerical experiments. Specifically, we show that the maximum regret of the shrinkage rule can be strictly smaller than those of the CES and pooling rules in certain cases when the space of CATEs is correctly specified. In addition, we find that the shrinkage rule is robust against misspecification of the space of CATEs. Finally, we apply our method to experimental data from the National Job Training Partnership Act Study.
