ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs
Hua Shen, Tiffany Knearem, Reshmi Ghosh, Yu-Ju Yang, Nicholas Clark, Tanushree Mitra, Yun Huang
TL;DR
ValueCompass addresses how to quantify and improve alignment between humans and LLMs across real-world contexts. It integrates Schwartz's Theory of Basic Values into a three-part framework: a contextual value alignment instrument (Value Form), robust prompting strategies, and quantitative alignment metrics (Alignment Rate, Alignment Distance, Alignment Ranking). The framework is applied to four scenarios and five LLMs with 112 human participants across seven countries, revealing widespread misalignments (e.g., humans favor National Security values that LLMs reject) and clear context effects, with the best F1 reaching $0.529$. The authors argue for context-aware, human-in-the-loop alignment strategies and demonstrate ValueCompass as a practical diagnostic tool to guide responsible AI design and governance.
Abstract
As AI systems become more advanced, ensuring their alignment with a diverse range of individuals and societal values becomes increasingly critical. But how can we capture fundamental human values and assess the degree to which AI systems align with them? We introduce ValueCompass, a framework of fundamental values, grounded in psychological theory and a systematic review, to identify and evaluate human-AI alignment. We apply ValueCompass to measure the value alignment of humans and large language models (LLMs) across four real-world scenarios: collaborative writing, education, public sectors, and healthcare. Our findings reveal concerning misalignments between humans and LLMs, such as humans frequently endorse values like "National Security" which were largely rejected by LLMs. We also observe that values differ across scenarios, highlighting the need for context-aware AI alignment strategies. This work provides valuable insights into the design space of human-AI alignment, laying the foundations for developing AI systems that responsibly reflect societal values and ethics.
