PROPER: A Progressive Learning Framework for Personalized Large Language Models with Group-Level Adaptation
Linhai Zhang, Jialong Wu, Deyu Zhou, Yulan He
TL;DR
PROPER introduces a progressive three-stage personalization framework for large language models that bridges population-level knowledge to user-level customization through a meso-level group model. By combining LoRA-based adapters with a Mixture-of-Experts routing mechanism and two specialized routers (user-aware and LoRA-aware), PROPER captures group-level residuals and individual user preferences while controlling computational cost. Empirical results on the LaMP benchmark show PROPER consistently outperforms state-of-the-art fine-tuning baselines across seven tasks, with notable gains in data-sparse settings and clear evidence of progressive improvements across stages. The work advances efficient, scalable personalization for LLMs and opens avenues for multi-task, evolving-user scenarios alongside consideration of privacy and fairness implications.
Abstract
Personalized large language models (LLMs) aim to tailor their outputs to user preferences. Recent advances in parameter-efficient fine-tuning (PEFT) methods have highlighted the effectiveness of adapting population-level LLMs to personalized LLMs by fine-tuning user-specific parameters with user history. However, user data is typically sparse, making it challenging to adapt LLMs to specific user patterns. To address this challenge, we propose PROgressive PERsonalization (PROPER), a novel progressive learning framework inspired by meso-level theory in social science. PROPER bridges population-level and user-level models by grouping users based on preferences and adapting LLMs in stages. It combines a Mixture-of-Experts (MoE) structure with Low Ranked Adaptation (LoRA), using a user-aware router to assign users to appropriate groups automatically. Additionally, a LoRA-aware router is proposed to facilitate the integration of individual user LoRAs with group-level LoRAs. Experimental results show that PROPER significantly outperforms SOTA models across multiple tasks, demonstrating the effectiveness of our approach.
