Personalize Your LLM: Fake it then Align it
Yijing Zhang, Dyah Adila, Changho Shin, Frederic Sala
TL;DR
CHAMELEON tackles scalable LLM personalization by coupling self-generated synthetic user preference data with representation editing, enabling per-user alignment without fine-tuning or large labeled datasets. It learns personalized and non-personalized embedding subspaces (via SVD and CCS) and applies inference-time edits to enhance user-specific behavior, while also enabling efficient group-scale personalization. Empirical results on the LaMP benchmark show CHAMELEON outperforms two personalization baselines and a time-constrained DPO setup, with average gains around $40\%$ across architectures, and demonstrated generalization to unseen users. The approach offers data- and compute-efficient personalization with practical implications for deploying personalized LLMs at scale, albeit with limitations tied to the quality of self-generated data and privacy considerations.
Abstract
Personalizing large language models (LLMs) is essential for delivering tailored interactions that improve user experience. Many existing personalization methods require fine-tuning LLMs for each user, rendering them prohibitively expensive for widespread adoption. Although retrieval-based approaches offer a more compute-efficient alternative, they still depend on large, high-quality datasets that are not consistently available for all users. To address this challenge, we propose CHAMELEON, a scalable and efficient personalization approach that uses (1) self-generated personal preference data and (2) representation editing to enable quick and cost-effective personalization. Our experiments on various tasks, including those from the LaMP personalization benchmark, show that CHAMELEON efficiently adapts models to personal preferences, improving instruction-tuned models and outperforms two personalization baselines by an average of 40% across two model architectures.
