Sustainable transparency in Recommender Systems: Bayesian Ranking of Images for Explainability
Jorge Paz-Ruza, Amparo Alonso-Betanzos, Berta Guijarro-Berdiñas, Brais Cancela, Carlos Eiras-Franco
TL;DR
BRIE reframes image-based explanations for recommender systems as a Bayesian ranking task, improving over ELVis and MF-ELVis by using Bayesian Pairwise Ranking (BPR) and an extended negative-sampling strategy. The model learns from user-uploaded images with a simple dot-product backbone and dropout, achieving higher ranking performance (notably MAUC) while drastically reducing model size and environmental impact. Evaluated on six real-world restaurant datasets, BRIE consistently outperforms baselines and demonstrates substantially lower training and inference emissions. The work advances explainable AI in RS by delivering more trustworthy, scalable, and green visual explanations that leverage existing user-generated content rather than synthetic or text-based signals.
Abstract
Recommender Systems have become crucial in the modern world, commonly guiding users towards relevant content or products, and having a large influence over the decisions of users and citizens. However, ensuring transparency and user trust in these systems remains a challenge; personalized explanations have emerged as a solution, offering justifications for recommendations. Among the existing approaches for generating personalized explanations, using existing visual content created by users is a promising option to maximize transparency and user trust. State-of-the-art models that follow this approach, despite leveraging highly optimized architectures, employ surrogate learning tasks that do not efficiently model the objective of ranking images as explanations for a given recommendation; this leads to a suboptimal training process with high computational costs that may not be reduced without affecting model performance. This work presents BRIE, a novel model where we leverage Bayesian Pairwise Ranking to enhance the training process, allowing us to consistently outperform state-of-the-art models in six real-world datasets while reducing its model size by up to 64 times and its CO2 emissions by up to 75% in training and inference.
