Policy-Oriented Binary Classification: Improving (KD-)CART Final Splits for Subpopulation Targeting
Lei Bill Wang, Zhenbang Jiao, Fangyi Wang
TL;DR
This paper defines Latent Probability Classification (LPC) and shows that standard CART and KD-CART splits are suboptimal for policy targeting tasks. It introduces Maximizing Distance Final Split (MDFS), along with Penalized Final Split (PFS) and weighted Empirical risk Final Split (wEFS), to produce splits that strictly dominate CART under appropriate assumptions, with MDFS providing a consistent estimator of the unique optimal split $s^*$ defined by $\eta(s^*)=c$. The methods are extended to knowledge-distillation settings and evaluated on synthetic and real-world datasets, where MDFS/PFS/wEFS consistently outperform CART and KD-CART, with RF-MDFS often delivering the strongest gains. The results have practical implications for targeting vulnerable subpopulations under fixed resource constraints, and the framework accommodates general thresholds $c$ as well as potential extensions to more complex, multi-node scenarios.
Abstract
Policymakers often use recursive binary split rules to partition populations based on binary outcomes and target subpopulations whose probability of the binary event exceeds a threshold. We call such problems Latent Probability Classification (LPC). Practitioners typically employ Classification and Regression Trees (CART) for LPC. We prove that in the context of LPC, classic CART and the knowledge distillation method, whose student model is a CART (referred to as KD-CART), are suboptimal. We propose Maximizing Distance Final Split (MDFS), which generates split rules that strictly dominate CART/KD-CART under the unique intersect assumption. MDFS identifies the unique best split rule, is consistent, and targets more vulnerable subpopulations than CART/KD-CART. To relax the unique intersect assumption, we additionally propose Penalized Final Split (PFS) and weighted Empirical risk Final Split (wEFS). Through extensive simulation studies, we demonstrate that the proposed methods predominantly outperform CART/KD-CART. When applied to real-world datasets, MDFS generates policies that target more vulnerable subpopulations than the CART/KD-CART.
