PersLLM: A Personified Training Approach for Large Language Models
Zheni Zeng, Jiayi Chen, Huimin Chen, Yukun Yan, Yuxuan Chen, Zhenghao Liu, Zhiyuan Liu, Maosong Sun
TL;DR
PersLLM presents a data-centric approach to LLM personification by integrating structured data construction (with anti-induction and CoT prompting, temporal staging, and objective/subjective material separation) and a two-step tuning pipeline (personified conversational tuning plus automatic DPO) to produce dynamic, distinctive personalities. The framework is validated theoretically via personality scale analyses and practically through HP-like and real-world case studies, showing improved personality distinction, consistency, and more natural human-agent interactions. Key contributions include detailed data construction strategies, an automated DPO-based tuning mechanism, and publicly available code and demonstrations. The results suggest PersLLM can enhance human-like interactions, multi-agent collaboration, and domain-specific personification, with implications for education, consultation, and social simulations.
Abstract
Large language models (LLMs) exhibit human-like intelligence, enabling them to simulate human behavior and support various applications that require both humanized communication and extensive knowledge reserves. Efforts are made to personify LLMs with special training data or hand-crafted prompts, while correspondingly faced with challenges such as insufficient data usage or rigid behavior patterns. Consequently, personified LLMs fail to capture personified knowledge or express persistent opinion. To fully unlock the potential of LLM personification, we propose PersLLM, a framework for better data construction and model tuning. For insufficient data usage, we incorporate strategies such as Chain-of-Thought prompting and anti-induction, improving the quality of data construction and capturing the personality experiences, knowledge, and thoughts more comprehensively. For rigid behavior patterns, we design the tuning process and introduce automated DPO to enhance the specificity and dynamism of the models' personalities, which leads to a more natural opinion communication. Both automated metrics and expert human evaluations demonstrate the effectiveness of our approach. Case studies in human-machine interactions and multi-agent systems further suggest potential application scenarios and future directions for LLM personification.
