Feature Responsiveness Scores: Model-Agnostic Explanations for Recourse
Harry Cheon, Anneke Wernerfelt, Sorelle A. Friedler, Berk Ustun
TL;DR
This work tackles whether explanations given to decision subjects truly enable recourse in high-stakes settings. It introduces feature responsiveness scores, a model-agnostic measure of the probability that an intervention on a feature leads to a target prediction under defined actionability constraints, captured via reachable sets. The authors develop sampling and enumeration methods to compute these scores, demonstrate that standard SHAP/LIME explanations often highlight unresponsive or immutable features, and show that responsiveness-based explanations improve recourse while flagging fixed predictions. The results, validated on multiple lending datasets, highlight the practical need to incorporate actionability, degree, and direction information into explanations, and provide an open-source library to compute responsiveness scores for real-world deployment.
Abstract
Consumer protection rules require companies that deploy models to automate decisions in high-stakes settings to explain predictions to decision subjects. These rules are motivated, in part, by the belief that explanations can promote recourse by revealing information that decision subjects can use to contest or overturn their predictions. In practice, companies provide individuals with a list of principal reasons based on feature importance derived from methods like SHAP and LIME. In this work, we show how common practices can fail to provide recourse and propose to highlight features based on their responsiveness -- the probability that a decision subject can attain a target prediction through an arbitrary intervention on the feature. We develop efficient methods to compute responsiveness scores for any model and actionability constraints. We show that standard practices in lending can undermine decision subjects by highlighting unresponsive features and explaining predictions that are fixed.
