Tag-Enriched Multi-Attention with Large Language Models for Cross-Domain Sequential Recommendation
Wangyu Wu, Xuhang Chen, Zhenhong Chen, Jing-En Jiang, Kim-Fung Tsang, Xiaowei Huang, Fei Ma, Jimin Xiao
TL;DR
Cross-domain sequential recommendation faces data sparsity and cross-domain misalignment, limiting multimodal utilization. The authors present TEMA-LLM, a framework that uses LLMs to generate domain-aware semantic tags from item titles/descriptions and fuses tag embeddings with ID, textual, and visual features through a Tag-Enriched Multi-Attention architecture. The approach jointly models intra- and inter-domain user preferences via a four-stage pipeline, including self-refined tag lists, tag embedding, multimodal item construction, and hierarchical attention, achieving state-of-the-art results on four large-scale CDSR datasets. Ablation studies confirm the contributions of LLM-driven tagging, weighted tag fusion, and the multi-attention design, with offline tag generation and CLIP feature extraction enhancing practicality. Overall, the work demonstrates the practical potential of LLM-based semantic tagging to improve cross-domain alignment and personalized, multimodal recommendations in consumer-facing platforms.
Abstract
Cross-Domain Sequential Recommendation (CDSR) plays a crucial role in modern consumer electronics and e-commerce platforms, where users interact with diverse services such as books, movies, and online retail products. These systems must accurately capture both domain-specific and cross-domain behavioral patterns to provide personalized and seamless consumer experiences. To address this challenge, we propose \textbf{TEMA-LLM} (\textit{Tag-Enriched Multi-Attention with Large Language Models}), a practical and effective framework that integrates \textit{Large Language Models (LLMs)} for semantic tag generation and enrichment. Specifically, TEMA-LLM employs LLMs to assign domain-aware prompts and generate descriptive tags from item titles and descriptions. The resulting tag embeddings are fused with item identifiers as well as textual and visual features to construct enhanced item representations. A \textit{Tag-Enriched Multi-Attention} mechanism is then introduced to jointly model user preferences within and across domains, enabling the system to capture complex and evolving consumer interests. Extensive experiments on four large-scale e-commerce datasets demonstrate that TEMA-LLM consistently outperforms state-of-the-art baselines, underscoring the benefits of LLM-based semantic tagging and multi-attention integration for consumer-facing recommendation systems. The proposed approach highlights the potential of LLMs to advance intelligent, user-centric services in the field of consumer electronics.
