Rehearse With User: Personalized Opinion Summarization via Role-Playing based on Large Language Models
Yanyue Zhang, Yulan He, Deyu Zhou
TL;DR
The paper tackles personalized opinion summarization for multi-document product reviews by introducing Rehearsal, a multi-agent framework in which an LLM-based summary agent produces a general summary, a user agent engages in role-playing with supervision to capture individual interests, and a retrieval-augmented rewriting step tailors the final personalized summary. It formalizes role-playing with practice and evaluation against four consistency metrics, and employs retrieval-augmented rewriting to prune irrelevant content and incorporate user-relevant information. The authors curate PerSum, a specialized dataset built on Amazon reviews, and demonstrate that Rehearsal improves personalization alignment across multiple base models, though they note limitations in supervision quality and evaluation metrics. The work advances personalized summarization by combining role-playing, supervision, and retrieval augmentation to better reflect reader interests in summaries, with implications for user-centric recommendation and evaluation in large-language-model-driven systems.
Abstract
Personalized opinion summarization is crucial as it considers individual user interests while generating product summaries. Recent studies show that although large language models demonstrate powerful text summarization and evaluation capabilities without the need for training data, they face difficulties in personalized tasks involving long texts. To address this, \textbf{Rehearsal}, a personalized opinion summarization framework via LLMs-based role-playing is proposed. Having the model act as the user, the model can better understand the user's personalized needs. Additionally, a role-playing supervisor and practice process are introduced to improve the role-playing ability of the LLMs, leading to a better expression of user needs. Furthermore, through suggestions from virtual users, the summary generation is intervened, ensuring that the generated summary includes information of interest to the user, thus achieving personalized summary generation. Experiment results demonstrate that our method can effectively improve the level of personalization in large model-generated summaries.
