Does More Advice Help? The Effects of Second Opinions in AI-Assisted Decision Making
Zhuoran Lu, Dakuo Wang, Ming Yin
TL;DR
The paper investigates how second opinions influence human-AI decision-making, addressing whether advisors from peers or AI can curb misaligned reliance on AI. Through three pre-registered MTurk experiments using a RoBERTa-based sentiment classifier and 20 sentiment tasks, it shows that always-presented second opinions reduce AI over-reliance but increase under-reliance, with effects modulated by agreement level rather than source. Allowing decision-makers to solicit second opinions on demand partially mitigates over-reliance without consistently elevating under-reliance, particularly when opinions strongly agree with the AI. The findings highlight nuanced design considerations for integrating second opinions into AI-assisted workflows and emphasize the trade-offs between reliability, cognitive load, and decision speed. Overall, second opinions hold promise for improving human-AI collaboration, but their impact hinges on presentation format, agreement dynamics, and the potential costs of obtaining second opinions.
Abstract
AI assistance in decision-making has become popular, yet people's inappropriate reliance on AI often leads to unsatisfactory human-AI collaboration performance. In this paper, through three pre-registered, randomized human subject experiments, we explore whether and how the provision of {second opinions} may affect decision-makers' behavior and performance in AI-assisted decision-making. We find that if both the AI model's decision recommendation and a second opinion are always presented together, decision-makers reduce their over-reliance on AI while increase their under-reliance on AI, regardless whether the second opinion is generated by a peer or another AI model. However, if decision-makers have the control to decide when to solicit a peer's second opinion, we find that their active solicitations of second opinions have the potential to mitigate over-reliance on AI without inducing increased under-reliance in some cases. We conclude by discussing the implications of our findings for promoting effective human-AI collaborations in decision-making.
