Know Your Users! Estimating User Domain Knowledge in Conversational Recommenders
Ivica Kostric, Ujwal Gadiraju, Krisztian Balog
TL;DR
This work addresses the challenge of varying user domain knowledge in conversational recommender systems by introducing RecQuest, a gamified data-collection protocol that elicits domain-knowledge signals through target-item dialogue. It crafts a target-aware CRS architecture and a controlled experimental flow to generate dialogues across five consumer domains, accompanied by ground-truth knowledge assessments. Analyses reveal distinct interaction and linguistic patterns for novices versus experts, including differences in recommendation rate, question-asking, and attribute-focused language, suggesting the feasibility of estimating user knowledge from dialogue. The resulting RecQuest dataset offers a valuable resource for developing truly adaptive CRS that tailor elicitation and explanations to user expertise.
Abstract
The ideal conversational recommender system (CRS) acts like a savvy salesperson, adapting its language and suggestions to each user's level of expertise. However, most current systems treat all users as experts, leading to frustrating and inefficient interactions when users are unfamiliar with a domain. Systems that can adapt their conversational strategies to a user's knowledge level stand to offer a much more natural and effective experience. To make a step toward such adaptive systems, we introduce a new task: estimating user domain knowledge from conversations, enabling a CRS to better understand user needs and personalize interactions. A key obstacle to developing such adaptive systems is the lack of suitable data; to our knowledge, no existing dataset captures the conversational behaviors of users with varying levels of domain knowledge. Furthermore, in most dialogue collection protocols, users are free to express their own preferences, which tends to concentrate on popular items and well-known features, offering little insight into how novices explore or learn about unfamiliar features. To address this, we design a game-based data collection protocol that elicits varied expressions of knowledge, release the resulting dataset, and provide an initial analysis to highlight its potential for future work on user-knowledge-aware CRS.
