From Token to Item: Enhancing Large Language Models for Recommendation via Item-aware Attention Mechanism

Xiaokun Zhang; Bowei He; Jiamin Chen; Ziqiang Cui; Chen Ma

From Token to Item: Enhancing Large Language Models for Recommendation via Item-aware Attention Mechanism

Xiaokun Zhang, Bowei He, Jiamin Chen, Ziqiang Cui, Chen Ma

Abstract

Large Language Models (LLMs) have recently gained increasing attention in the field of recommendation. Existing LLM-based methods typically represent items as token sequences, and apply attention layers on these tokens to generate recommendations. However, by inheriting the standard attention mechanism, these methods focus on modeling token-level relations. This token-centric focus overlooks the item as the fundamental unit of recommendation, preventing existing methods from effectively capturing collaborative relations at the item level. In this work, we revisit the role of tokens in LLM-driven recommendation and categorize their relations into two types: (1) intra-item token relations, which present the content semantics of an item, e.g., name, color, and size; and (2) inter-item token relations, which encode collaborative relations across items. Building on these insights, we propose a novel framework with an item-aware attention mechanism (IAM) to enhance LLMs for recommendation. Specifically, IAM devises two complementary attention layers: (1) an intra-item attention layer, which restricts attention to tokens within the same item, modeling item content semantics; and (2) an inter-item attention layer, which attends exclusively to token relations across items, capturing item collaborative relations. Through this stacked design, IAM explicitly emphasizes items as the fundamental units in recommendation, enabling LLMs to effectively exploit item-level collaborative relations. Extensive experiments on several public datasets demonstrate the effectiveness of IAM in enhancing LLMs for personalized recommendation.

From Token to Item: Enhancing Large Language Models for Recommendation via Item-aware Attention Mechanism

Abstract

Paper Structure (28 sections, 4 equations, 6 figures, 3 tables)

This paper contains 28 sections, 4 equations, 6 figures, 3 tables.

Introduction
Related Work
Sequential Recommendation
LLM-based Recommendation
Preliminaries
Problem Statement
Recommendation Paradigm of LLMs
Instruction construction
Token layer
Attention layer
Recommendation generation
Fine-tuning with Low-rank Adaption
The Proposed IAM
Intra-item Attention Layer
Inter-item Attention Layer
...and 13 more sections

Figures (6)

Figure 1: (a) Current LLM-based methods focus on modeling token-level relations, while failing to exploit collaborative relations at the item level; (b) intra- and inter-item token relations indicate item content semantics and collaborative relations, respectively.
Figure 2: Recommendation paradigm of LLM-based methods.
Figure 3: Causal self-attention mechanism and its attention matrix.
Figure 4: The architecture of IAM. The intra-item attention layer handles intra-item token relations to model item content semantics. The inter-item attention layer attends to inter-item token relations to explicitly capture item collaborative relations. IAM employs a recommendation adapter to generate top-$k$ items as recommendations.
Figure 5: Ablation study on intra- and inter-item attention layers.
...and 1 more figures

From Token to Item: Enhancing Large Language Models for Recommendation via Item-aware Attention Mechanism

Abstract

From Token to Item: Enhancing Large Language Models for Recommendation via Item-aware Attention Mechanism

Authors

Abstract

Table of Contents

Figures (6)