Item-side Fairness of Large Language Model-based Recommendation System
Meng Jiang, Keqin Bao, Jizhi Zhang, Wenjie Wang, Zhengyi Yang, Fuli Feng, Xiangnan He
TL;DR
This work investigates item-side fairness in LLM-based recommender systems (LRS), arguing that LRS inherits popularity-driven biases and semantic biases from pretraining. It introduces IFairLRS, a two-stage framework that applies reweighting during instruction-tuning and a punishment-based reranking during inference to calibrate item exposure. Empirical results on MovieLens1M and Steam show that IFairLRS improves fairness metrics such as $MGU@K$ and $DGU@K$ with minimal losses in $NDCG@K$ and $HR@K$, highlighting practical gains for fairer item exposure. The study also reveals that grounding and pretraining contribute to unfairness, suggesting future work on broader groupings and deeper analysis of LLM priors in LRS.
Abstract
Recommendation systems for Web content distribution intricately connect to the information access and exposure opportunities for vulnerable populations. The emergence of Large Language Models-based Recommendation System (LRS) may introduce additional societal challenges to recommendation systems due to the inherent biases in Large Language Models (LLMs). From the perspective of item-side fairness, there remains a lack of comprehensive investigation into the item-side fairness of LRS given the unique characteristics of LRS compared to conventional recommendation systems. To bridge this gap, this study examines the property of LRS with respect to item-side fairness and reveals the influencing factors of both historical users' interactions and inherent semantic biases of LLMs, shedding light on the need to extend conventional item-side fairness methods for LRS. Towards this goal, we develop a concise and effective framework called IFairLRS to enhance the item-side fairness of an LRS. IFairLRS covers the main stages of building an LRS with specifically adapted strategies to calibrate the recommendations of LRS. We utilize IFairLRS to fine-tune LLaMA, a representative LLM, on \textit{MovieLens} and \textit{Steam} datasets, and observe significant item-side fairness improvements. The code can be found in https://github.com/JiangM-C/IFairLRS.git.
