Causal Ordering Without Effect Estimation: A Framework for Using Proxies in Treatment Prioritization
Carlos Fernández-Loría, Jorge Loría
TL;DR
The paper addresses prioritizing treatment when causal-effect estimation is impractical by formalizing causal ordering via a noncausal proxy signal with CAS $\theta(x)$ and comparing it to the true CATE $\beta(x)$. It identifies two principled conditions—Dominant Moderation and Signal Monotonicity—that ensure CAS-based rankings match the true effect ordering, and develops a practical alignment-SNR framework to assess proxies under imperfect alignment. The authors introduce a concrete diagnostic toolkit to quantify proxy usefulness, including SNR estimation, dilution factors, and an alignment threshold, and validate the approach empirically in online advertising, showing that simple baseline proxies can outperform CATE-based ordering for targeting decisions. The work reconciles practical targeting with causal reasoning by reframing the problem around moderators and signal quality, thereby enabling credible causal decisions even when direct effect estimation is infeasible. It provides a nuanced perspective on when predictive proxies should be preferred and offers actionable diagnostics to guide practice in real-world prioritization tasks.
Abstract
Who should we prioritize for treatment when causal effects cannot be estimated? In practice, organizations often rely on predictive proxies: ads are targeted using purchase probabilities, and retention incentives are allocated using churn-risk scores. These models are not causal, but they are often used with the aim of ranking individuals by treatment effects, a task we call causal ordering. We develop a decision-focused framework to reason about this practice. We identify conditions under which proxies recover the correct effect ordering, which hold when a proxy reflects a dominant moderator of treatment effects. We show how these conditions emerge as a useful approximation in discrete choice settings, where the propensity to act without an intervention moderates persuasion. Moreover, we extend beyond this case, demonstrating that proxies capturing a non-dominant moderator can still outperform CATE estimates when they target signals that are easier to estimate precisely. Building on these insights, we introduce diagnostic tools to assess proxy usefulness in practice. Finally, we illustrate the framework in advertising, where a simple predictive proxy outperforms heterogeneous-effect estimation methods.
