PersonaX: A Recommendation Agent Oriented User Modeling Framework for Long Behavior Sequence
Yunxiao Shi, Wujiang Xu, Zeqi Zhang, Xing Zi, Qiang Wu, Min Xu
TL;DR
PersonaX tackles the challenge of modeling users from long behavioral histories for LLM-based recommendation agents by performing offline core-set construction of Sub-Behavior Sequences (SBS) through hierarchical clustering, adaptive budget allocation, and a prototypicality-diversity objective to generate multiple textual personas cached for online retrieval. It decouples profile generation from online inference, enabling fast, cached retrieval that improves downstream ranking while using only about $30 ext{-}50 ext{ } ext{%}$ of the historical data. Empirical results with AgentCF and Agent4Rec across CDs50, CDs200, and Books480 show consistent improvements (roughly $3$–$11 ext{%}$ for AgentCF and $10 ext{–}50 ext{%}$ for Agent4Rec) and significant reductions in online latency, especially on long sequences. The work offers a scalable, model-agnostic solution with practical guidance on hyper-parameter tuning and data efficiency for long-horizon user modeling in production recommendation systems.
Abstract
User profile embedded in the prompt template of personalized recommendation agents play a crucial role in shaping their decision-making process. High-quality user profiles are essential for aligning agent behavior with real user interests. Typically, these profiles are constructed by leveraging LLMs for user profile modeling (LLM-UM). However, this process faces several challenges: (1) LLMs struggle with long user behaviors due to context length limitations and performance degradation. (2) Existing methods often extract only partial segments from full historical behavior sequence, inevitably discarding diverse user interests embedded in the omitted content, leading to incomplete modeling and suboptimal profiling. (3) User profiling is often tightly coupled with the inference context, requiring online processing, which introduces significant latency overhead. In this paper, we propose PersonaX, an agent-agnostic LLM-UM framework to address these challenges. It augments downstream recommendation agents to achieve better recommendation performance and inference efficiency. PersonaX (a) segments complete historical behaviors into clustered groups, (b) selects multiple sub behavior sequences (SBS) with a balance of prototypicality and diversity to form a high quality core set, (c) performs offline multi-persona profiling to capture diverse user interests and generate fine grained, cached textual personas, and (d) decouples user profiling from online inference, enabling profile retrieval instead of real time generation. Extensive experiments demonstrate its effectiveness: using only 30 to 50% of behavioral data (sequence length 480), PersonaX enhances AgentCF by 3 to 11% and Agent4Rec by 10 to 50%. As a scalable and model-agnostic LLM-UM solution, PersonaX sets a new benchmark in scalable user modeling.
