Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models
Jiongran Wu, Jiahao Liu, Dongsheng Li, Guangping Zhang, Mingzhe Han, Hansu Gu, Peng Zhang, Li Shang, Tun Lu, Ning Gu
TL;DR
LLMD4Rec introduces a bidirectional mutual distillation framework that enables dynamic collaboration between LLM-centric and CRM-based sequential recommender systems. By alternating training and aligning output distributions through temperature-scaled KL divergences and sample-wise weighting, it achieves mutual enhancement without adding trainable parameters or increasing inference costs. The approach injects CRM signals into the LLM (information augmentation) and distills LLM semantics into the CRM (output distribution distillation), forming a virtuous loop that improves both components. Empirical results on real-world datasets show substantial gains over strong baselines, with efficient training and deployment suitable for scalable, real-time applications.
Abstract
Large language models (LLMs) have demonstrated exceptional performance in understanding and generating semantic patterns, making them promising candidates for sequential recommendation tasks. However, when combined with conventional recommendation models (CRMs), LLMs often face challenges related to high inference costs and static knowledge transfer methods. In this paper, we propose a novel mutual distillation framework, LLMD4Rec, that fosters dynamic and bidirectional knowledge exchange between LLM-centric and CRM-based recommendation systems. Unlike traditional unidirectional distillation methods, LLMD4Rec enables iterative optimization by alternately refining both models, enhancing the semantic understanding of CRMs and enriching LLMs with collaborative signals from user-item interactions. By leveraging sample-wise adaptive weighting and aligning output distributions, our approach eliminates the need for additional parameters while ensuring effective knowledge transfer. Extensive experiments on real-world datasets demonstrate that LLMD4Rec significantly improves recommendation accuracy across multiple benchmarks without increasing inference costs. This method provides a scalable and efficient solution for combining the strengths of both LLMs and CRMs in sequential recommendation systems.
