Pre-trained Recommender Systems: A Causal Debiasing Perspective
Ziqian Lin, Hao Ding, Nghia Trong Hoang, Branislav Kveton, Anoop Deoras, Hao Wang
TL;DR
The paper tackles data scarcity and transferability in recommender systems by introducing PreRec, a pre-trained, causally debiased recommender built as a hierarchical Bayesian deep model. It explicitly models in-domain (popularity) and cross-domain (domain) confounders with latent variables ${f Z}_j$ and ${f D}_k$, respectively, and leverages universal item/user embeddings derived from textual content and user histories. PreRec supports multi-domain pre-training, zero-shot inference via $do$-calculus to remove cross-domain bias, and target-domain fine-tuning to capture domain-specific signals; across cross-market and cross-platform tests on real data, it consistently outperforms strong baselines in zero-shot and fine-tuning scenarios. This causal debiasing framework enables more reliable, data-efficient transfer of recommender capabilities to new markets or platforms.
Abstract
Recent studies on pre-trained vision/language models have demonstrated the practical benefit of a new, promising solution-building paradigm in AI where models can be pre-trained on broad data describing a generic task space and then adapted successfully to solve a wide range of downstream tasks, even when training data is severely limited (e.g., in zero- or few-shot learning scenarios). Inspired by such progress, we investigate in this paper the possibilities and challenges of adapting such a paradigm to the context of recommender systems, which is less investigated from the perspective of pre-trained model. In particular, we propose to develop a generic recommender that captures universal interaction patterns by training on generic user-item interaction data extracted from different domains, which can then be fast adapted to improve few-shot learning performance in unseen new domains (with limited data). However, unlike vision/language data which share strong conformity in the semantic space, universal patterns underlying recommendation data collected across different domains (e.g., different countries or different E-commerce platforms) are often occluded by both in-domain and cross-domain biases implicitly imposed by the cultural differences in their user and item bases, as well as their uses of different e-commerce platforms. As shown in our experiments, such heterogeneous biases in the data tend to hinder the effectiveness of the pre-trained model. To address this challenge, we further introduce and formalize a causal debiasing perspective, which is substantiated via a hierarchical Bayesian deep learning model, named PreRec. Our empirical studies on real-world data show that the proposed model could significantly improve the recommendation performance in zero- and few-shot learning settings under both cross-market and cross-platform scenarios.
