Can LLMs Discern the Traits Influencing Your Preferences? Evaluating Personality-Driven Preference Alignment in LLMs
Tianyu Zhao, Siqi Li, Yasser Shoukry, Salma Elmalaki
TL;DR
This work addresses the brittleness of preference-based personalization in LLMs by introducing PACIFIC, a psychometrically grounded dataset that links stated user preferences to Big Five personality traits. The authors demonstrate that aligning preferences with inferred personality signals substantially improves answer accuracy, achieving large gains over baselines. They present a four-method framework, including explicit trait labeling and retrieval-augmented prompting, and show that explicit cues yield the strongest performance while RAG offers a practical alternative when trait annotations are unavailable. The work highlights the potential of using personality as a compact, robust memory abstraction for user modeling and discusses implications for controllable, user-aligned generation, along with observed biases and future research directions.
Abstract
User preferences are increasingly used to personalize Large Language Model (LLM) responses, yet how to reliably leverage preference signals for answer generation remains under-explored. In practice, preferences can be noisy, incomplete, or even misleading, which can degrade answer quality when applied naively. Motivated by the observation that stable personality traits shape everyday preferences, we study personality as a principled ''latent'' signal behind preference statements. Through extensive experiments, we find that conditioning on personality-aligned preferences substantially improves personalized question answering: selecting preferences consistent with a user's inferred personality increases answer-choice accuracy from 29.25% to 76%, compared to using randomly selected preferences. Based on these findings, we introduce PACIFIC (Preference Alignment Choices Inference for Five-factor Identity Characterization), a personality-labeled preference dataset containing 1200 preference statements spanning diverse domains (e.g., travel, movies, education), annotated with Big-Five (OCEAN) trait directions. Finally, we propose a framework that enables an LLM model to automatically retrieve personality-aligned preferences and incorporate them during answer generation.
