Table of Contents
Fetching ...

Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences

Denis Emelin, Ronan Le Bras, Jena D. Hwang, Maxwell Forbes, Yejin Choi

TL;DR

This work introduces Moral Stories, a crowdsourced dataset of grounded, goal-directed moral narratives with explicit normative constraints. It evaluates both classification and generation with varying degrees of grounding and proposes Chain-of-Experts decoding to enforce normative alignment. The results show that rich grounding improves task performance and that abductive reasoning and synthetic consequence signals bolster norm discovery and moral action generation. Overall, the study demonstrates the potential of structured narratives and expert-augmented decoding to steer AI behavior in socially appropriate directions, while outlining avenues for broader norm coverage and ethical safeguards.

Abstract

In social settings, much of human behavior is governed by unspoken rules of conduct. For artificial systems to be fully integrated into social environments, adherence to such norms is a central prerequisite. We investigate whether contemporary NLG models can function as behavioral priors for systems deployed in social settings by generating action hypotheses that achieve predefined goals under moral constraints. Moreover, we examine if models can anticipate likely consequences of (im)moral actions, or explain why certain actions are preferable by generating relevant norms. For this purpose, we introduce 'Moral Stories', a crowd-sourced dataset of structured, branching narratives for the study of grounded, goal-oriented social reasoning. Finally, we propose decoding strategies that effectively combine multiple expert models to significantly improve the quality of generated actions, consequences, and norms compared to strong baselines, e.g. though abductive reasoning.

Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences

TL;DR

This work introduces Moral Stories, a crowdsourced dataset of grounded, goal-directed moral narratives with explicit normative constraints. It evaluates both classification and generation with varying degrees of grounding and proposes Chain-of-Experts decoding to enforce normative alignment. The results show that rich grounding improves task performance and that abductive reasoning and synthetic consequence signals bolster norm discovery and moral action generation. Overall, the study demonstrates the potential of structured narratives and expert-augmented decoding to steer AI behavior in socially appropriate directions, while outlining avenues for broader norm coverage and ethical safeguards.

Abstract

In social settings, much of human behavior is governed by unspoken rules of conduct. For artificial systems to be fully integrated into social environments, adherence to such norms is a central prerequisite. We investigate whether contemporary NLG models can function as behavioral priors for systems deployed in social settings by generating action hypotheses that achieve predefined goals under moral constraints. Moreover, we examine if models can anticipate likely consequences of (im)moral actions, or explain why certain actions are preferable by generating relevant norms. For this purpose, we introduce 'Moral Stories', a crowd-sourced dataset of structured, branching narratives for the study of grounded, goal-oriented social reasoning. Finally, we propose decoding strategies that effectively combine multiple expert models to significantly improve the quality of generated actions, consequences, and norms compared to strong baselines, e.g. though abductive reasoning.

Paper Structure

This paper contains 18 sections, 14 figures, 19 tables.

Figures (14)

  • Figure 1: Example narrative included in Moral Stories.
  • Figure 2: Examples of generated actions. Items followed by ✓ are relevant to both intention and norm, ✗ are not.
  • Figure 3: Additional Moral Stories examples.
  • Figure 4: Additional examples of generated actions. ✓ marks predictions that are relevant to both intention and norm, ✗ those that are not (or are nonsensical.)
  • Figure 5: Examples of generated consequences. ✓ denotes plausible predictions, ✗ marks implausible ones.
  • ...and 9 more figures