SRLF: An Agent-Driven Set-Wise Reflective Learning Framework for Sequential Recommendation
Jiahao Wang, Bokang Fu, Yu Zhu, Yuli Liu
TL;DR
SRLF tackles the limitation of point-wise modeling in sequential recommendation by introducing a set-wise reflective learning framework that treats candidate item sets holistically. It leverages a closed-loop assess-validate-reflect cycle with a Set-wise Assessment Agent and an overlapping-set partitioning strategy to capture inter-item dynamics and alignment with user preferences, guided by a set-wise mismatch loss that triggers dual-path reflection updating the user profile and item semantics. Empirical results on CDs and MovieLens show state-of-the-art performance, validating the benefit of set-wise reasoning for capturing higher-order relational patterns in user behavior. The framework promises more transparent, adaptable, and effective recommendations in dynamic, multi-item contexts, with future work aimed at diversity and computational efficiency.
Abstract
LLM-based agents are emerging as a promising paradigm for simulating user behavior to enhance recommender systems. However, their effectiveness is often limited by existing studies that focus on modeling user ratings for individual items. This point-wise approach leads to prevalent issues such as inaccurate user preference comprehension and rigid item-semantic representations. To address these limitations, we propose the novel Set-wise Reflective Learning Framework (SRLF). Our framework operationalizes a closed-loop "assess-validate-reflect" cycle that harnesses the powerful in-context learning capabilities of LLMs. SRLF departs from conventional point-wise assessment by formulating a holistic judgment on an entire set of items. It accomplishes this by comprehensively analyzing both the intricate interrelationships among items within the set and their collective alignment with the user's preference profile. This method of set-level contextual understanding allows our model to capture complex relational patterns essential to user behavior, making it significantly more adept for sequential recommendation. Extensive experiments validate our approach, confirming that this set-wise perspective is crucial for achieving state-of-the-art performance in sequential recommendation tasks.
