Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance

Divya Srivastava; Karen M. Feigh

Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance

Divya Srivastava, Karen M. Feigh

TL;DR

The relative impact of using context, properties of the decision making task and environment, to align human and AI algorithm understanding of the state of the world, i.e. judgment, to improve joint human-recommender performance as compared to utilizing post-hoc algorithmic explanations is investigated.

Abstract

Recommender systems, while a powerful decision making tool, are often operationalized as black box models, such that their AI algorithms are not accessible or interpretable by human operators. This in turn can cause confusion and frustration for the operator and result in unsatisfactory outcomes. While the field of explainable AI has made remarkable strides in addressing this challenge by focusing on interpreting and explaining the algorithms to human operators, there are remaining gaps in the human's understanding of the recommender system. This paper investigates the relative impact of using context, properties of the decision making task and environment, to align human and AI algorithm understanding of the state of the world, i.e. judgment, to improve joint human-recommender performance as compared to utilizing post-hoc algorithmic explanations. We conducted an empirical, between-subjects experiment in which participants were asked to work with an automated recommender system to complete a decision making task. We manipulated the method of transparency (shared contextual information to support shared judgment vs algorithmic explanations) and record the human's understanding of the task, the recommender system, and their overall performance. We found that both techniques yielded equivalent agreement on final decisions. However, those who saw task context had less tendency to over-rely on the recommender system and were able to better pinpoint in what conditions the AI erred. Both methods improved participants' confidence in their own decision making, and increased mental demand equally and frustration negligibly. These results present an alternative approach to improving team performance to post-hoc explanations and illustrate the impact of judgment on human cognition in working with recommender systems.

Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance

TL;DR

Abstract

Paper Structure (21 sections, 12 figures, 3 tables)

This paper contains 21 sections, 12 figures, 3 tables.

Introduction
Background & Prior Similar Work
Naturalistic Decision Making (NDM) & Role of Judgement
Model-Specific Contributions to Mitigate Challenges
Contextual Information Improving Team Performance with Recommender Systems
Methodology
Task Domain & Decision Environment
Experiment Design and Task Procedure
Measures and Dependent Variables
Experiment Procedure
Results
Objective Metrics
Final Decision Agreement
Task Performance
Sway, the AI's Influence on the Participant
...and 6 more sections

Figures (12)

Figure 1: Task Outline and Metrics for EDL Trajectory Planning
Figure 2: Example of AI's Local Explanation with the generated suggestion
Figure 3: World State Information from Left to Right: GPS, Atmosphere/Weather, Anticipated Entry Angle
Figure 4: AI-generated trajectory with interactive questions
Figure 5: Final Agreement [%] Between Human and 60% Accurate AI
...and 7 more figures

Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance

TL;DR

Abstract

Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance

Authors

TL;DR

Abstract

Table of Contents

Figures (12)