The Influence of Presentation and Performance on User Satisfaction
Kanaad Pathak, Leif Azzopardi, Martin Halvey
TL;DR
This work addresses how query performance and result presentation jointly influence user satisfaction in information retrieval. Using a between-subjects design with five result-card layouts on the WaPo 2018 corpus, the study measures query-level satisfaction, interface-level satisfaction, and both system-side ($nDCG@10$) and user-side performance metrics, including Total Gain on the first page. It finds a strong linear relationship between $nDCG@10$ and query satisfaction, while presentation alone does not linearly elevate satisfaction; however, presentation interacts with total page gain to affect satisfaction and reveals a complex, multi-factorial pattern through LDA analysis. The results underscore the need to harmonize retrieval quality and result presentation in IR design, suggesting adaptive interfaces that account for diverse user preferences and task contexts.
Abstract
The effectiveness of an IR system is gauged not just by its ability to retrieve relevant results but also by how it presents these results to users; an engaging presentation often correlates with increased user satisfaction. While existing research has delved into the link between user satisfaction, IR performance metrics, and presentation, these aspects have typically been investigated in isolation. Our research aims to bridge this gap by examining the relationship between query performance, presentation and user satisfaction. For our analysis, we conducted a between-subjects experiment comparing the effectiveness of various result card layouts for an ad-hoc news search interface. Drawing data from the TREC WaPo 2018 collection, we centered our study on four specific topics. Within each of these topics, we assessed six distinct queries with varying nDCG values. Our study involved 164 participants who were exposed to one of five distinct layouts containing result cards, such as "title'', "title+image'', or "title+image+summary''. Our findings indicate that while nDCG is a strong predictor of user satisfaction at the query level, there exists no linear relationship between the performance of the query, presentation of results and user satisfaction. However, when considering the total gain on the initial result page, we observed that presentation does play a significant role in user satisfaction (at the query level) for certain layouts with result cards such as, title+image or title+image+summary. Our results also suggest that the layout differences have complex and multifaceted impacts on satisfaction. We demonstrate the capacity to equalize user satisfaction levels between queries of varying performance by changing how results are presented. This emphasizes the necessity to harmonize both performance and presentation in IR systems, considering users' diverse preferences.
