DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life
Yu Ying Chiu, Liwei Jiang, Yejin Choi
TL;DR
DailyDilemmas provides 1,360 synthetic, non-clear-cut everyday moral dilemmas pairing two actions with stakeholders and associated values, enabling a cross-disciplinary assessment of LLM value prioritization. By mapping 301 values into five frameworks (WVSCulturalMap, Moral Foundations, Maslow, Aristotle’s Virtues, Plutchik’s Emotions) and analyzing action choices across GPT-4-turbo, Claude-3-haiku, and other models, the paper reveals consistent self-expression and care tendencies while exposing model-specific biases. It also evaluates alignment with OpenAI ModelSpec and Anthropic CAI principles and demonstrates limited steerability of value preferences via system prompts, underscoring practical challenges for end users in controlling model ethics in closed APIs. The work advances understanding of how real-world value trade-offs arise in AI decision-making and provides a dataset and analytic framework to guide safer, more aligned AI deployments in daily life.
Abstract
As users increasingly seek guidance from LLMs for decision-making in daily life, many of these decisions are not clear-cut and depend significantly on the personal values and ethical standards of people. We present DailyDilemmas, a dataset of 1,360 moral dilemmas encountered in everyday life. Each dilemma presents two possible actions, along with affected parties and relevant human values for each action. Based on these dilemmas, we gather a repository of human values covering diverse everyday topics, such as interpersonal relationships, workplace, and environmental issues. With DailyDilemmas, we evaluate LLMs on these dilemmas to determine what action they will choose and the values represented by these action choices. Then, we analyze values through the lens of five theoretical frameworks inspired by sociology, psychology, and philosophy, including the World Values Survey, Moral Foundations Theory, Maslow's Hierarchy of Needs, Aristotle's Virtues, and Plutchik's Wheel of Emotions. For instance, we find LLMs are most aligned with self-expression over survival in World Values Survey and care over loyalty in Moral Foundations Theory. Interestingly, we find substantial preference differences in models for some core values. For example, for truthfulness, Mixtral-8x7B neglects it by 9.7% while GPT-4-turbo selects it by 9.4%. We also study the recent guidance released by OpenAI (ModelSpec), and Anthropic (Constitutional AI) to understand how their designated principles reflect their models' actual value prioritization when facing nuanced moral reasoning in daily-life settings. Finally, we find that end users cannot effectively steer such prioritization using system prompts.
