Navigating User Experience of ChatGPT-based Conversational Recommender Systems: The Effects of Prompt Guidance and Recommendation Domain
Yizhe Zhang, Yucheng Jin, Li Chen, Ting Yang
TL;DR
The paper addresses how prompt guidance and recommendation domain influence the user experience of ChatGPT-based conversational recommender systems. It employs a mixed-method online study (N=100) with a 2x2 design (PG: with/without; RD: book/job) to evaluate UX across multiple CRS-Que dimensions and interaction metrics. Key findings show that PG enhances explainability, ease of use, transparency, and CUI adaptability, while RD modulates novelty and willingness to use/try, with domain stakes shaping PG's effectiveness. The results offer design guidance for domain-aware, user-tailored prompt strategies in ChatGPT-powered CRS and highlight moderation by prior RS experience, informing practical UX design and future research.
Abstract
Conversational recommender systems (CRS) enable users to articulate their preferences and provide feedback through natural language. With the advent of large language models (LLMs), the potential to enhance user engagement with CRS and augment the recommendation process with LLM-generated content has received increasing attention. However, the efficacy of LLM-powered CRS is contingent upon the use of prompts, and the subjective perception of recommendation quality can differ across various recommendation domains. Therefore, we have developed a ChatGPT-based CRS to investigate the impact of these two factors, prompt guidance (PG) and recommendation domain (RD), on the overall user experience of the system. We conducted an online empirical study (N = 100) by employing a mixed-method approach that utilized a between-subjects design for the variable of PG (with vs. without) and a within-subjects design for RD (book recommendations vs. job recommendations). The findings reveal that PG can substantially enhance the system's explainability, adaptability, perceived ease of use, and transparency. Moreover, users are inclined to perceive a greater sense of novelty and demonstrate a higher propensity to engage with and try recommended items in the context of book recommendations as opposed to job recommendations. Furthermore, the influence of PG on certain user experience metrics and interactive behaviors appears to be modulated by the recommendation domain, as evidenced by the interaction effects between the two examined factors. This work contributes to the user-centered evaluation of ChatGPT-based CRS by investigating two prominent factors and offers practical design guidance.
