Evaluating graph-based explanations for AI-based recommender systems

Simon Delarue; Astrid Bertrand; Tiphaine Viard

Evaluating graph-based explanations for AI-based recommender systems

Simon Delarue, Astrid Bertrand, Tiphaine Viard

TL;DR

This study investigates graph based explanations for AI driven recommender systems and compares them to textual and SHAP explanations through qualitative and quantitative user studies. Qualitative insights reveal users find graph explanations interpretable and desire item level metadata, while quantitative results show text explanations yield higher objective understanding and both graph and text designs improve usability relative to SHAP. A key finding is the mismatch between users expressed preference for graph based explanations and their actual performance, highlighting the challenge of balancing social expectations with downstream effectiveness. The work advocates hybrid explanations that combine the strengths of graphs and text to support transparency, understanding, and practical accuracy.

Abstract

Recent years have witnessed a rapid growth of recommender systems, providing suggestions in numerous applications with potentially high social impact, such as health or justice. Meanwhile, in Europe, the upcoming AI Act mentions \emph{transparency} as a requirement for critical AI systems in order to ``mitigate the risks to fundamental rights''. Post-hoc explanations seamlessly align with this goal and extensive literature on the subject produced several forms of such objects, graphs being one of them. Early studies in visualization demonstrated the graphs' ability to improve user understanding, positioning them as potentially ideal explanations. However, it remains unclear how graph-based explanations compare to other explanation designs. In this work, we aim to determine the effectiveness of graph-based explanations in improving users' perception of AI-based recommendations using a mixed-methods approach. We first conduct a qualitative study to collect users' requirements for graph explanations. We then run a larger quantitative study in which we evaluate the influence of various explanation designs, including enhanced graph-based ones, on aspects such as understanding, usability and curiosity toward the AI system. We find that users perceive graph-based explanations as more usable than designs involving feature importance. However, we also reveal that textual explanations lead to higher objective understanding than graph-based designs. Most importantly, we highlight the strong contrast between participants' expressed preferences for graph design and their actual ratings using it, which are lower compared to textual design. These findings imply that meeting stakeholders' expressed preferences might not alone guarantee ``good'' explanations. Therefore, crafting hybrid designs successfully balancing social expectations with downstream performance emerges as a significant challenge.

Evaluating graph-based explanations for AI-based recommender systems

TL;DR

Abstract

Paper Structure (30 sections, 5 figures, 2 tables)

This paper contains 30 sections, 5 figures, 2 tables.

Introduction
Related work
Explainable AI for recommender systems
Explainable AI in Computer Science
Explainable AI in Human Computer Interaction
Study 1: Qualitative exploration of stakeholders' expectations regarding graph-based explanations
Study design
Results
A high level of perceived understanding, based on three main criteria
Strong link between context knowledge and recommendation understanding
Graphs as objects modeling both similarities and popularity
Graph-based explanations are considered interpretable but should contain item information
Study 2: Influence of explanation design
Study design
Recommender system
...and 15 more sections

Figures (5)

Figure 1: Bipartite graph-based explanation containing two sets of distinct nodes. Blue nodes represent users (along with their identifier) and red nodes represent books (along with their title). A user is linked to a book if the former has read and rated the latter. Thin links denote low ratings, while thick links denote high ratings. The blue link corresponds to the system recommendation.
Figure 2: Explanation designs used in the second user study.
Figure 3: Results for quantitative study. Vertical bars are confidence intervals at $95\%$. Significance levels are reported as follows: $***=p \leq .001, **=p \leq .01, *=p \leq .05$, $\cdot=p \leq .07$ and n.s. non significant. Reading key: Objective understanding (Figure \ref{['fig:sub1']}) was statistically significantly higher ($p \leq .05$) when participants used textual explanation rather than graph explanation.
Figure 4: Number of occurrences of each design at each ranking position. Reading key: Graph-based design has been ranked 29 times in #1 position, 24 times in #2 position and 13 times in #3 position.
Figure 5: Example of a participant's progression through the questionnaire.

Evaluating graph-based explanations for AI-based recommender systems

TL;DR

Abstract

Evaluating graph-based explanations for AI-based recommender systems

Authors

TL;DR

Abstract

Table of Contents

Figures (5)