QoNext: Towards Next-generation QoE for Foundation Models
Yijin Guo, Zicheng Zhang, Ye Shen, Farong Wen, Junying Wang, Qi Jia, Guangtao Zhai
TL;DR
QoNext introduces a QoE-inspired framework to evaluate foundation models by jointly considering content quality and service quality during interactive use. It builds a large, labeled database by systematically varying five factors (content accuracy, information density, output speed, latency position, and latency duration) and collects human ratings across diverse dialogues, yielding insights on how each factor shapes user experience. Regression models trained on this database predict subjective experience with high rank-order consistency (SRCC around 0.78–0.79), demonstrating the feasibility of objective predictions of user-perceived quality. The work also reveals content accuracy as the primary determinant of overall experience and shows MBTI-based personalization can reveal differential sensitivities to latency and density, offering practical guidance for adaptive optimization and productized service design in human-centric AI systems.
Abstract
Existing evaluations of foundation models, including recent human-centric approaches, fail to capture what truly matters: user's experience during interaction. Current methods treat evaluation as a matter of output correctness alone, overlooking that user satisfaction emerges from the interplay between response quality and interaction, which limits their ability to account for the mechanisms underlying user experience. To address this gap, we introduce QoNext, the first framework that adapts Quality of Experience (QoE) principles from networking and multimedia to the assessment of foundation models. QoNext identifies experiential factors that shape user experience and incorporates them into controlled experiments, where human ratings are collected under varied configurations. From these studies we construct a QoE-oriented database and train predictive models that estimate perceived user experience from measurable system parameters. Our results demonstrate that QoNext not only enables proactive and fine-grained evaluation but also provides actionable guidance for productized services of optimizing foundation models in practice.
