ExplainRec: Towards Explainable Multi-Modal Zero-Shot Recommendation with Preference Attribution and Large Language Models
Bo Ma, LuYao Liu, ZeHua Hu, Simon Lau
TL;DR
ExplainRec tackles the limitations of prior LLM-based recommender systems by integrating explainable preference attribution, zero-shot transfer, multi-modal fusion, and multi-task optimization within a unified framework. It introduces attribution-enhanced instructions, a multi-objective preference loss, a universal preference knowledge base, and a joint training scheme to share learning signals across tasks. Empirical results on MovieLens-25M, Amazon Movies & TV, and cross-domain data show consistent improvements (0.7%–0.9% AUC gains) over strong baselines, along with interpretable explanations and robust cold-start performance. The approach demonstrates the value of combining explainability, cross-modal content, and cross-task collaboration for practical, scalable LLM-based recommendations with strong transfer capabilities.
Abstract
Recent advances in Large Language Models (LLMs) have opened new possibilities for recommendation systems, though current approaches such as TALLRec face challenges in explainability and cold-start scenarios. We present ExplainRec, a framework that extends LLM-based recommendation capabilities through preference attribution, multi-modal fusion, and zero-shot transfer learning. The framework incorporates four technical contributions: preference attribution tuning for explainable recommendations, zero-shot preference transfer for cold-start users and items, multi-modal enhancement leveraging visual and textual content, and multi-task collaborative optimization. Experimental evaluation on MovieLens-25M and Amazon datasets shows that ExplainRec outperforms existing methods, achieving AUC improvements of 0.7\% on movie recommendation and 0.9\% on cross-domain tasks, while generating interpretable explanations and handling cold-start scenarios effectively.
