Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment
Eli Ben-Michael, D. James Greiner, Kosuke Imai, Zhichao Jiang
TL;DR
The paper tackles learning improvements to deterministic pre-trial risk assessments by addressing identifiability gaps arising from lack of overlap with a fixed baseline policy. It introduces a maximin robust optimization framework that partially identifies policy value and guarantees safety relative to the status quo under plausible outcome models. Applying this to the PSA-DMF system with FTA, NCA, and NVCA scores, the authors find safe improvements are possible for NVCA threshold adjustments but largely cannot justify changes to FTA/NCA scores or broader DMF matrices given the data constraints. The approach provides a principled extrapolation-based policy-learning tool for high-stakes, rule-based systems, and highlights when and where data can support policy modifications. The work also underscores the importance of model class selection, confidence band construction, and the trade-off between safety and potential gains in deterministic-policy settings.
Abstract
Algorithmic recommendations and decisions have become ubiquitous in today's society. Many of these data-driven policies, especially in the realm of public policy, are based on known, deterministic rules to ensure their transparency and interpretability. We examine a particular case of algorithmic pre-trial risk assessments in the US criminal justice system, which provide deterministic classification scores and recommendations to help judges make release decisions. Our goal is to analyze data from a unique field experiment on an algorithmic pre-trial risk assessment to investigate whether the scores and recommendations can be improved. Unfortunately, prior methods for policy learning are not applicable because they require existing policies to be stochastic. We develop a maximin robust optimization approach that partially identifies the expected utility of a policy, and then finds a policy that maximizes the worst-case expected utility. The resulting policy has a statistical safety property, limiting the probability of producing a worse policy than the existing one, under structural assumptions about the outcomes. Our analysis of data from the field experiment shows that we can safely improve certain components of the risk assessment instrument by classifying arrestees as lower risk under a wide range of utility specifications, though the analysis is not informative about several components of the instrument.
