Separating and Learning Latent Confounders to Enhancing User Preferences Modeling
Hangtong Xu, Yuanbo Xu, Yongjian Yang
TL;DR
This work tackles bias and confounding in recommender systems by treating the former recommender as a proxy for unmeasured confounders and proposing SLFR to disentangle confounders from true user preferences in a latent space. It introduces a two-stage variational approach where stage one learns user- and item-specific confounder representations via two VAEs with a modified KL objective to promote independence, and stage two estimates counterfactual true preferences by removing confounder effects in a Matrix Factorization scoring framework, optimizing the combined loss $\,\mathcal{L}_{SLFR} = \,\mathcal{L}_{normal} + \gamma \\mathcal{L}_{bias}$. The method is demonstrated on five real-world datasets, showing consistent improvements over strong baselines and compatibility with backbones like MF, LightGCN, DIN, and SASRec. By explicitly separating latent confounders from user preferences, SLFR provides a general debiasing strategy for more accurate true-preference modeling in recommender systems.
Abstract
Recommender models aim to capture user preferences from historical feedback and then predict user-specific feedback on candidate items. However, the presence of various unmeasured confounders causes deviations between the user preferences in the historical feedback and the true preferences, resulting in models not meeting their expected performance. Existing debias models either (1) specific to solving one particular bias or (2) directly obtain auxiliary information from user historical feedback, which cannot identify whether the learned preferences are true user preferences or mixed with unmeasured confounders. Moreover, we find that the former recommender system is not only a successor to unmeasured confounders but also acts as an unmeasured confounder affecting user preference modeling, which has always been neglected in previous studies. To this end, we incorporate the effect of the former recommender system and treat it as a proxy for all unmeasured confounders. We propose a novel framework, Separating and Learning Latent Confounders For Recommendation (SLFR), which obtains the representation of unmeasured confounders to identify the counterfactual feedback by disentangling user preferences and unmeasured confounders, then guides the target model to capture the true preferences of users. Extensive experiments in five real-world datasets validate the advantages of our method.
