A theory of appropriateness with applications to generative artificial intelligence
Joel Z. Leibo, Alexander Sasha Vezhnevets, Manfred Diaz, John P. Agapiou, William A. Cunningham, Peter Sunehag, Julia Haas, Raphael Koster, Edgar A. Duéñez-Guzmán, William S. Isaac, Georgios Piliouras, Stanley M. Bileschi, Iyad Rahwan, Simon Osindero
TL;DR
This paper develops a theory of appropriateness as a socially constructed mechanism that guides action across contexts and scales, arguing it offers a more robust governance lens than AI alignment. It models human decision making through predictive pattern completion within a global workspace, connecting memory, perception, and action to normative behavior. By distinguishing explicit and implicit norms, and conventions and sanctions, it explains how norms emerge, stabilize, and change, with implications for safety, policy, and multi-agent AI ecosystems. The authors advocate a decentralized, polycentric approach to AI governance where norm customization, sanctioning, and context sensitivity are harnessed to achieve collective flourishing in a pluralistic society. This framework aims to guide the design and deployment of norm-sensitive AI that can operate safely and adaptively in long-tail, domain-specific contexts while respecting diverse communities.
Abstract
What is appropriateness? Humans navigate a multi-scale mosaic of interlocking notions of what is appropriate for different situations. We act one way with our friends, another with our family, and yet another in the office. Likewise for AI, appropriate behavior for a comedy-writing assistant is not the same as appropriate behavior for a customer-service representative. What determines which actions are appropriate in which contexts? And what causes these standards to change over time? Since all judgments of AI appropriateness are ultimately made by humans, we need to understand how appropriateness guides human decision making in order to properly evaluate AI decision making and improve it. This paper presents a theory of appropriateness: how it functions in human society, how it may be implemented in the brain, and what it means for responsible deployment of generative AI technology.
