Higher Stakes, Healthier Trust? An Application-Grounded Approach to Assessing Healthy Trust in High-Stakes Human-AI Collaboration
David S. Johnson
TL;DR
The paper tackles the challenge of evaluating healthy trust in high-stakes human-AI collaboration by introducing Blockies, a parametric, bias-controlled dataset generator, and an application-grounded evaluation framework for scalable online studies. It combines simulated diagnostic tasks with a real black-box model and storytelling-based stake manipulation to study how perceived stakes affect decision time and trust. An empirical study shows that high-stakes conditions increase cognitive effort but can lead to greater reliance on incorrect AI recommendations, reducing healthy distrust in some contexts. Overall, the framework enables reproducible, large-scale comparative evaluations of explainability and H-AI collaboration methods in realistic decision-making scenarios.
Abstract
Human-AI collaboration is increasingly promoted to improve high-stakes decision-making, yet its benefits have not been fully realized. Application-grounded evaluations are needed to better evaluate methods for improving collaboration but often require domain experts, making studies costly and limiting their generalizability. Current evaluation methods are constrained by limited public datasets and reliance on proxy tasks. To address these challenges, we propose an application-grounded framework for large-scale, online evaluations of vision-based decision-making tasks. The framework introduces Blockies, a parametric approach for generating datasets of simulated diagnostic tasks, offering control over the traits and biases in the data used to train real-world models. These tasks are designed to be easy to learn but difficult to master, enabling participation by non-experts. The framework also incorporates storytelling and monetary incentives to manipulate perceived task stakes. An initial empirical study demonstrated that the high-stakes condition significantly reduced healthy distrust of AI, despite longer decision-making times. These findings underscore the importance of perceived stakes in fostering healthy distrust and demonstrate the framework's potential for scalable evaluation of high-stakes Human-AI collaboration.
