Mitigating Propensity Bias of Large Language Models for Recommender Systems
Guixian Zhang, Guan Yuan, Debo Cheng, Lin Liu, Jiuyong Li, Shichao Zhang
TL;DR
This work tackles the propensity bias and dimensional collapse that arise when incorporating LLM-generated side information into recommender systems. It introduces Counterfactual LLM Recommendation (CLLMR), which combines a spectrum-based Side Information Encoder (SSE) with counterfactual inference to debias the alignment between side information and collaborative signals. SSE uses an identifiable VAE with a spectrum-derived latent prior and controlled noise to capture structural information from historical interactions, preventing collapse across representation dimensions. Through causal modeling and counterfactual reasoning, CLLMR mitigates LLM biases during inference while maintaining the rich knowledge encoded by LLMs, achieving robust improvements across multiple backbone recommenders and real-world datasets. The approach couples contrastive alignment with causal debiasing, delivering practical gains for LLM-enhanced recommender systems with improved fairness and personalization.
Abstract
The rapid development of Large Language Models (LLMs) creates new opportunities for recommender systems, especially by exploiting the side information (e.g., descriptions and analyses of items) generated by these models. However, aligning this side information with collaborative information from historical interactions poses significant challenges. The inherent biases within LLMs can skew recommendations, resulting in distorted and potentially unfair user experiences. On the other hand, propensity bias causes side information to be aligned in such a way that it often tends to represent all inputs in a low-dimensional subspace, leading to a phenomenon known as dimensional collapse, which severely restricts the recommender system's ability to capture user preferences and behaviours. To address these issues, we introduce a novel framework named Counterfactual LLM Recommendation (CLLMR). Specifically, we propose a spectrum-based side information encoder that implicitly embeds structural information from historical interactions into the side information representation, thereby circumventing the risk of dimension collapse. Furthermore, our CLLMR approach explores the causal relationships inherent in LLM-based recommender systems. By leveraging counterfactual inference, we counteract the biases introduced by LLMs. Extensive experiments demonstrate that our CLLMR approach consistently enhances the performance of various recommender models.
