Quantifying Divergence for Human-AI Collaboration and Cognitive Trust

Müge Kural; Ali Gebeşçe; Tilek Chubakov; Gözde Gül Şahin

Quantifying Divergence for Human-AI Collaboration and Cognitive Trust

Müge Kural, Ali Gebeşçe, Tilek Chubakov, Gözde Gül Şahin

TL;DR

This study investigates how human-AI collaboration and cognitive trust relate to decision-making similarity between humans and AI models. By measuring divergence-based distances between human soft-label distributions and model outputs on an SNLI entailment task, the authors show that users tend to collaborate with the most similar model (as captured by Jensen-Shannon Distance), but cognitive trust does not consistently track this similarity. The work introduces a four-stage user study and analyzes forward and inverse KL divergences as well as JSD to characterize different alignment regimes, uncovering that low inverse KL drives collaboration while trust may require avoiding overconfidence (captured by alpha KL and JSD). These findings provide a framework for pre-deployment evaluation of AI partners and guide future optimization of models for collaboration and trust.

Abstract

Predicting the collaboration likelihood and measuring cognitive trust to AI systems is more important than ever. To do that, previous research mostly focus solely on the model features (e.g., accuracy, confidence) and ignore the human factor. To address that, we propose several decision-making similarity measures based on divergence metrics (e.g., KL, JSD) calculated over the labels acquired from humans and a wide range of models. We conduct a user study on a textual entailment task, where the users are provided with soft labels from various models and asked to pick the closest option to them. The users are then shown the similarities/differences to their most similar model and are surveyed for their likelihood of collaboration and cognitive trust to the selected system. Finally, we qualitatively and quantitatively analyze the relation between the proposed decision-making similarity measures and the survey results. We find that people tend to collaborate with their most similar models -- measured via JSD -- yet this collaboration does not necessarily imply a similar level of cognitive trust. We release all resources related to the user study (e.g., design, outputs), models, and metrics at our repo.

Quantifying Divergence for Human-AI Collaboration and Cognitive Trust

TL;DR

Abstract

Paper Structure (29 sections, 4 equations, 5 figures, 7 tables)

This paper contains 29 sections, 4 equations, 5 figures, 7 tables.

Introduction
Task Setup
Dataset
Models
Random baseline
TF-IDF
RoBERTa Liu2019RoBERTaAR
Enhanced LSTM chen2016enhanced
da-vinci-003 brown2020language
Similarity Calculation
Forward and Inverse KL Divergence
Jensen-Shannon Divergence:
User Study
Subset selection
Design of the User Study
...and 14 more sections

Figures (5)

Figure 1: Decision-making similarities between the user and various models are calculated using various divergence metrics, then linked to collaboration preferences and cognitive trust.
Figure 2: The annotation task is framed as a multiple-choice question answering problem, where the available options correspond to label predictions generated by the selected models.
Figure 3: Collaboration ratings among users. Over half of the participants give 4/5 ratings for collaboration with their aligned model.
Figure 4: Cognitive Trust ratings among users. Participants are distributed across 3 and 4/5 ratings for their aligned model.
Figure 5: Agreements/disagreements between user and the aligned model.

Quantifying Divergence for Human-AI Collaboration and Cognitive Trust

TL;DR

Abstract

Quantifying Divergence for Human-AI Collaboration and Cognitive Trust

Authors

TL;DR

Abstract

Table of Contents

Figures (5)