Multi-Cause Deconfounding for Recommender Systems with Latent Confounders
Zhirong Huang, Shichao Zhang, Debo Cheng, Jiuyong Li, Lin Liu, Guixian Zhang
TL;DR
This work tackles latent confounders in recommender systems by introducing MCDCF, a multi-cause deconfounding framework that separately learns substitutes for user-side and item-side latent confounders from user behavior data. It leverages two independent VAEs to recover $x_u$ and $x_i$ from multi-cause embeddings, then conditions the prediction on $U,I,X_U,X_I$ to debias the implicit feedback signal $C$, with a Bayesian ranking objective and ELBO regularization. Theoretical justification via backdoor adjustment and empirical validation on three real-world datasets demonstrate that MCDCF improves recommendation accuracy while providing stronger debiasing than state-of-the-art baselines and ablations. The results imply that explicitly modeling and removing both user- and item-side latent confounders can significantly enhance the reliability and fairness of recommendations without extra private data. The approach offers a practical, privacy-friendly path toward robust causal recommendations in complex, real-world settings.
Abstract
In recommender systems, various latent confounding factors (e.g., user social environment and item public attractiveness) can affect user behavior, item exposure, and feedback in distinct ways. These factors may directly or indirectly impact user feedback and are often shared across items or users, making them multi-cause latent confounders. However, existing methods typically fail to account for latent confounders between users and their feedback, as well as those between items and user feedback simultaneously. To address the problem of multi-cause latent confounders, we propose a multi-cause deconfounding method for recommender systems with latent confounders (MCDCF). MCDCF leverages multi-cause causal effect estimation to learn substitutes for latent confounders associated with both users and items, using user behaviour data. Specifically, MCDCF treats the multiple items that users interact with and the multiple users that interact with items as treatment variables, enabling it to learn substitutes for the latent confounders that influence the estimation of causality between users and their feedback, as well as between items and user feedback. Additionally, we theoretically demonstrate the soundness of our MCDCF method. Extensive experiments on three real-world datasets demonstrate that our MCDCF method effectively recovers latent confounders related to users and items, reducing bias and thereby improving recommendation accuracy.
