Table of Contents
Fetching ...

Toward Personalized LLM-Powered Agents: Foundations, Evaluation, and Future Directions

Yue Xu, Qian Chen, Zizhan Ma, Dongrui Liu, Wenxuan Wang, Xiting Wang, Li Xiong, Wenjie Wang

TL;DR

This survey provides a capability-oriented review of personalized LLM-powered agents, and organizes the literature around four interdependent components: profile modeling, memory, planning, and action execution, highlighting cross-component interactions and recurring design trade-offs.

Abstract

Large language models have enabled agents that reason, plan, and interact with tools and environments to accomplish complex tasks. As these agents operate over extended interaction horizons, their effectiveness increasingly depends on adapting behavior to individual users and maintaining continuity across time, giving rise to personalized LLM-powered agents. In such long-term, user-dependent settings, personalization permeates the entire decision pipeline rather than remaining confined to surface-level generation. This survey provides a capability-oriented review of personalized LLM-powered agents. We organize the literature around four interdependent components: profile modeling, memory, planning, and action execution. Using this taxonomy, we synthesize representative methods and analyze how user signals are represented, propagated, and utilized, highlighting cross-component interactions and recurring design trade-offs. We further examine evaluation metrics and benchmarks tailored to personalized agents, summarize application scenarios spanning general assistance to specialized domains, and outline future directions for research and deployment. By offering a structured framework for understanding and designing personalized LLM-powered agents, this survey charts a roadmap toward more user-aligned, adaptive, robust, and deployable agentic systems, accelerating progress from prototype personalization to scalable real-world assistants.

Toward Personalized LLM-Powered Agents: Foundations, Evaluation, and Future Directions

TL;DR

This survey provides a capability-oriented review of personalized LLM-powered agents, and organizes the literature around four interdependent components: profile modeling, memory, planning, and action execution, highlighting cross-component interactions and recurring design trade-offs.

Abstract

Large language models have enabled agents that reason, plan, and interact with tools and environments to accomplish complex tasks. As these agents operate over extended interaction horizons, their effectiveness increasingly depends on adapting behavior to individual users and maintaining continuity across time, giving rise to personalized LLM-powered agents. In such long-term, user-dependent settings, personalization permeates the entire decision pipeline rather than remaining confined to surface-level generation. This survey provides a capability-oriented review of personalized LLM-powered agents. We organize the literature around four interdependent components: profile modeling, memory, planning, and action execution. Using this taxonomy, we synthesize representative methods and analyze how user signals are represented, propagated, and utilized, highlighting cross-component interactions and recurring design trade-offs. We further examine evaluation metrics and benchmarks tailored to personalized agents, summarize application scenarios spanning general assistance to specialized domains, and outline future directions for research and deployment. By offering a structured framework for understanding and designing personalized LLM-powered agents, this survey charts a roadmap toward more user-aligned, adaptive, robust, and deployable agentic systems, accelerating progress from prototype personalization to scalable real-world assistants.
Paper Structure (102 sections, 7 equations, 5 figures, 1 table)

This paper contains 102 sections, 7 equations, 5 figures, 1 table.

Figures (5)

  • Figure 1: Overview of personalized LLM-powered agents. Upon receiving a user request, the agent coordinates profile modeling, memory, planning, and action execution to generate a tailored response. Interaction outcomes provide feedback that refines user preference representations, enabling iterative and long-term personalization.
  • Figure 2: User-specific data in personalization process.
  • Figure 3: Example of a two-dimensional taxonomy of user preferences. Preferences are categorized by their expression form (explicit vs. implicit) and semantic function (behavioral vs. topical), illustrated through a multi-turn conversational recommendation scenario.
  • Figure 4: Taxonomy of personalized LLM-powered agents.
  • Figure 5: Overview of evaluation for personalized LLM-powered agents. Evaluation is organized along three layers: (1) evaluation goals and metric dimensions, including effectiveness, adaptivity, generalization, robustness, and risk; (2) assessment paradigms, such as automatic scoring, rule-based checking, learned evaluators, and LLM-as-a-judge; and (3) representative benchmark families, including interactive alignment and user-substitution settings.