Operationalising Rawlsian Ethics for Fairness in Norm-Learning Agents
Jessica Woodgate, Paul Marshall, Nirav Ajmeri
TL;DR
The paper addresses the emergence of fair norms in multi-agent systems by integrating normative ethics into learning agents. It introduces RAWL-${\cdot}$E, a framework that operationalises Rawlsian maximin through an ethics module that shapes rewards, enabling explicit norm learning via a norms module and learning via a DQN-based interaction module. The key contributions are the design of an ethics-informed norm-learning architecture, the demonstration that emergent norms prioritize the least advantaged, and empirical evidence showing improved social welfare, reduced inequality, and greater robustness in two harvesting scenarios. The results suggest that embedding normative ethics into RL agents can yield fairer, more sustainable, and cooperative MAS with practical implications for ethically-aligned autonomous systems.
Abstract
Social norms are standards of behaviour common in a society. However, when agents make decisions without considering how others are impacted, norms can emerge that lead to the subjugation of certain agents. We present RAWL-E, a method to create ethical norm-learning agents. RAWL-E agents operationalise maximin, a fairness principle from Rawlsian ethics, in their decision-making processes to promote ethical norms by balancing societal well-being with individual goals. We evaluate RAWL-E agents in simulated harvesting scenarios. We find that norms emerging in RAWL-E agent societies enhance social welfare, fairness, and robustness, and yield higher minimum experience compared to those that emerge in agent societies that do not implement Rawlsian ethics.
