Beyond Collaborative Filtering: A Relook at Task Formulation in Recommender Systems
Aixin Sun
TL;DR
The paper argues that traditional RecSys task formulations reduce the problem to static matrix completion of $R(u,i)$ over a $|U|\times|I|$ space, neglecting the dynamic decision-making behind user interactions. It advocates for application-specific, timeline-aware task definitions that incorporate contextual inputs $X$, current session items $I_c$, and historical interactions $U\times I$, to better reflect user decision-making. It analyzes mismatches between common offline datasets and real-world contexts, critiques standard evaluation practices, and emphasizes the need for domain-specific datasets and scenario-aligned evaluation to guide baselines. The authors frame recommender systems as inherently dynamic ranking problems that should balance enduring user preferences with momentary context, urging researchers to adopt scenario-driven task formulations, richer data collection, and evaluation methods suitable for emerging settings like conversational and sequential recommendations. Overall, the work seeks to realign RecSys research with practice by clarifying task scopes, datasets, and evaluation to support more meaningful, context-aware recommendations.
Abstract
Recommender Systems (RecSys) have become indispensable in numerous applications, profoundly influencing our everyday experiences. Despite their practical significance, academic research in RecSys often abstracts the formulation of research tasks from real-world contexts, aiming for a clean problem formulation and more generalizable findings. However, it is observed that there is a lack of collective understanding in RecSys academic research. The root of this issue may lie in the simplification of research task definitions, and an overemphasis on modeling the decision outcomes rather than the decision-making process. That is, we often conceptualize RecSys as the task of predicting missing values in a static user-item interaction matrix, rather than predicting a user's decision on the next interaction within a dynamic, changing, and application-specific context. There exists a mismatch between the inputs accessible to a model and the information available to users during their decision-making process, yet the model is tasked to predict users' decisions. While collaborative filtering is effective in learning general preferences from historical records, it is crucial to also consider the dynamic contextual factors in practical settings. Defining research tasks based on application scenarios using domain-specific datasets may lead to more insightful findings. Accordingly, viable solutions and effective evaluations can emerge for different application scenarios.
