Cost and Reward Infused Metric Elicitation
Chethan Bhateja, Joseph O'Brien, Afnaan Hashmi, Eva Prakash
TL;DR
This paper addresses the limitation of metric elicitation methods that rely solely on confusion matrices by incorporating bounded costs and rewards into the elicitation framework. It extends the multiclass metric elicitation approach DLPME, introducing a cost-reward augmented metric $\psi(\mathbf{d}, \mathbf{r}, \mathbf{c}) = \langle \mathbf{a^d}, \mathbf{d} \rangle + \langle \mathbf{a^c}, \mathbf{c} \rangle + \langle \mathbf{a^r}, \mathbf{r} \rangle$, and an algorithm that learns weight ratios by comparing classifiers and querying an oracle, using RBO for accuracies and Pareto-frontier-based decisions for costs/rewards. Experiments on synthetic data show rapid, logarithmic convergence toward the true metric with scalable queries, and the method provides a practical path to deploying metrics that reflect multi-objective trade-offs such as monetary cost and latency. The work also outlines future directions for real-data validation, non-linear utilities, and group-aware considerations, while addressing ethical and governance concerns in deployment.
Abstract
In machine learning, metric elicitation refers to the selection of performance metrics that best reflect an individual's implicit preferences for a given application. Currently, metric elicitation methods only consider metrics that depend on the accuracy values encoded within a given model's confusion matrix. However, focusing solely on confusion matrices does not account for other model feasibility considerations such as varied monetary costs or latencies. In our work, we build upon the multiclass metric elicitation framework of Hiranandani et al., extrapolating their proposed Diagonal Linear Performance Metric Elicitation (DLPME) algorithm to account for additional bounded costs and rewards. Our experimental results with synthetic data demonstrate our approach's ability to quickly converge to the true metric.
