Empirically Understanding the Value of Prediction in Allocation
Unai Fischer-Abaigar, Emily Aiken, Christoph Kern, Juan Carlos Perdomo
TL;DR
The paper tackles the question of when investing in prediction pays off in allocating scarce social resources. It introduces an empirical framework and the rvP toolkit that allow practitioners to compare prediction against capacity expansion and treatment improvements using domain-specific data, rather than relying on stylized theory. Through two case studies—German employment services and Ethiopian cash-transfer targeting—the authors show how the relative value of prediction depends on factors like capacity constraints, harm-versus-benefit considerations, and baseline data coverage, and they demonstrate how targeted data collection can be more efficient when improvements are focused near decision thresholds. The work provides actionable insights and replication resources, enabling policymakers to quantify welfare gains from prediction and to tailor investments to their specific context and budget. Overall, the contribution is a practical, data-driven approach to meta-design choices in allocation problems, expanding the toolkit for evaluating where predictive investments yield the strongest welfare returns.
Abstract
Institutions increasingly use prediction to allocate scarce resources. From a design perspective, better predictions compete with other investments, such as expanding capacity or improving treatment quality. Here, the big question is not how to solve a specific allocation problem, but rather which problem to solve. In this work, we develop an empirical toolkit to help planners form principled answers to this question and quantify the bottom-line welfare impact of investments in prediction versus other policy levers such as expanding capacity and improving treatment quality. Applying our framework in two real-world case studies on German employment services and poverty targeting in Ethiopia, we illustrate how decision-makers can reliably derive context-specific conclusions about the relative value of prediction in their allocation problem. We make our software toolkit, rvp, and parts of our data available in order to enable future empirical work in this area.
