Learning Recourse Costs from Pairwise Feature Comparisons
Kaivalya Rawal, Himabindu Lakkaraju
TL;DR
This work addresses learning numeric per-feature modification costs for actionable recourse in black-box models by applying the Bradley-Terry model to map pairwise comparisons into feature strengths $β_f$, with costs given by Cost(f) = -β_f. It uses MAP estimation (via Gibbs sampling) to infer these costs from human pairwise comparisons, initially assuming exhaustive feature pairs and then extending to non-exhaustive data by using comparisons between entire recourses. The key contribution is showing that non-exhaustive data, including recourse-vs-recourse comparisons, can recover the full feature-cost vector and thus guide efficient recourse generation without explicit numeric elicitation. This approach enables practical, interpretable recourse in high-stakes settings and reduces the burden on users, with publicly available code supporting implementation.
Abstract
This paper presents a novel technique for incorporating user input when learning and inferring user preferences. When trying to provide users of black-box machine learning models with actionable recourse, we often wish to incorporate their personal preferences about the ease of modifying each individual feature. These recourse finding algorithms usually require an exhaustive set of tuples associating each feature to its cost of modification. Since it is hard to obtain such costs by directly surveying humans, in this paper, we propose the use of the Bradley-Terry model to automatically infer feature-wise costs using non-exhaustive human comparison surveys. We propose that users only provide inputs comparing entire recourses, with all candidate feature modifications, determining which recourses are easier to implement relative to others, without explicit quantification of their costs. We demonstrate the efficient learning of individual feature costs using MAP estimates, and show that these non-exhaustive human surveys, which do not necessarily contain data for each feature pair comparison, are sufficient to learn an exhaustive set of feature costs, where each feature is associated with a modification cost.
