Are Rules Meant to be Broken? Understanding Multilingual Moral Reasoning as a Computational Pipeline with UniMoral
Shivani Kumar, David Jurgens
TL;DR
This work introduces UniMoral, a multilingual, holistic dataset and computational pipeline for studying the full moral reasoning process across six languages. By combining psychologically grounded dilemmas with socially sourced Reddit scenarios, and enriching annotations with action choices, ethical principles, contributing factors, consequences, and individual moral-cultural profiles, UniMoral enables four core evaluations on large language models: action prediction, moral typology classification, factor attribution, and consequence generation. The study reveals substantial language-dependent performance gaps, highlights the benefit of contextual cues such as moral values and persona, and shows that real-world Reddit data pose greater challenges than curated psychological scenarios. Overall, UniMoral advances cross-cultural moral reasoning research in NLP and offers a platform for exploring bias, cultural variation, and moral value quantification in AI systems.
Abstract
Moral reasoning is a complex cognitive process shaped by individual experiences and cultural contexts and presents unique challenges for computational analysis. While natural language processing (NLP) offers promising tools for studying this phenomenon, current research lacks cohesion, employing discordant datasets and tasks that examine isolated aspects of moral reasoning. We bridge this gap with UniMoral, a unified dataset integrating psychologically grounded and social-media-derived moral dilemmas annotated with labels for action choices, ethical principles, contributing factors, and consequences, alongside annotators' moral and cultural profiles. Recognizing the cultural relativity of moral reasoning, UniMoral spans six languages, Arabic, Chinese, English, Hindi, Russian, and Spanish, capturing diverse socio-cultural contexts. We demonstrate UniMoral's utility through a benchmark evaluations of three large language models (LLMs) across four tasks: action prediction, moral typology classification, factor attribution analysis, and consequence generation. Key findings reveal that while implicitly embedded moral contexts enhance the moral reasoning capability of LLMs, there remains a critical need for increasingly specialized approaches to further advance moral reasoning in these models.
