Embedding Cultural Diversity in Prototype-based Recommender Systems
Armin Moradi, Nicola Neophytou, Florian Carichon, Golnoosh Farnadi
TL;DR
The paper tackles popularity bias in culturally rich recommender systems by focusing on demographic biases within prototype-based matrix factorization. It introduces two in-processing mechanisms for ProtoMF: Prototype K-filtering to emphasize locally relevant prototypes and a Prototype-Distributing Regularizer to encourage a uniform distribution of prototypes across the embedding space, using country of origin as a cultural cue. Across four datasets, the approach yields tangible fairness gains—reducing long-tail and underrepresented-item rankings—while maintaining or improving recommendation quality (e.g., HitRate@10 improvements). Additionally, the method enhances explainability by producing more culturally diverse prototype associations. Limitations include reliance on country-of-origin as a proxy for culture, potential generalizability constraints from data filtering, and a need for deeper theoretical and explainability analyses; future work could broaden cultural factors and provide formal fairness guarantees.
Abstract
Popularity bias in recommender systems can increase cultural overrepresentation by favoring norms from dominant cultures and marginalizing underrepresented groups. This issue is critical for platforms offering cultural products, as they influence consumption patterns and human perceptions. In this work, we address popularity bias by identifying demographic biases within prototype-based matrix factorization methods. Using the country of origin as a proxy for cultural identity, we link this demographic attribute to popularity bias by refining the embedding space learning process. First, we propose filtering out irrelevant prototypes to improve representativity. Second, we introduce a regularization technique to enforce a uniform distribution of prototypes within the embedding space. Across four datasets, our results demonstrate a 27\% reduction in the average rank of long-tail items and a 2\% reduction in the average rank of items from underrepresented countries. Additionally, our model achieves a 2\% improvement in HitRatio@10 compared to the state-of-the-art, highlighting that fairness is enhanced without compromising recommendation quality. Moreover, the distribution of prototypes leads to more inclusive explanations by better aligning items with diverse prototypes.
