Policy-Adaptable Methods For Resolving Normative Conflicts Through Argumentation and Graph Colouring
Johnny Joyce
TL;DR
This work tackles normative conflicts in multi-agent systems by proposing a policy-adaptable framework that merges argumentation with graph colouring to extract admissible, non-conflicting norm sets. It introduces ColourResolve and ColourResolveComplete, along with curtailment-based ColourCurtail and ColourCurtailComplete, and proves admissibility and completeness under argumentation semantics. The authors define flexible heuristics to realize lexicographic policies (e.g., lex posterior, lex superior, lex specialis) and demonstrate how curtailment enables admitting multiple, potentially conflicting norms in a controlled manner, with an overall worst-case complexity of $O(n^3)$. Empirical comparisons against Oren et al.'s work show competitive performance and highlight the practical value of the approach for safe reinforcement learning and real-world autonomous systems, where policy-adaptable, coherent normative reasoning is critical.
Abstract
In a multi-agent system, one may choose to govern the behaviour of an agent by imposing norms, which act as guidelines for how agents should act either all of the time or in given situations. However, imposing multiple norms on one or more agents may result in situations where these norms conflict over how the agent should behave. In any system with normative conflicts (such as safe reinforcement models or systems which monitor safety protocols), one must decide which norms should be followed such that the most important and most relevant norms are maintained. We introduce a new method for resolving normative conflicts through argumentation and graph colouring which is compatible with a variety of normative conflict resolution policies. We prove that this method always creates an admissible set of arguments under argumentation semantics, meaning that it produces coherent outputs. We also introduce more robust variants of this method, each building upon their predecessor to create a superior output, and we include further mathematical proof of their coherence. Our most advanced variant uses the existing concept of curtailment, where one norm may supersede another without fully eliminating it. The methods we introduce are all compatible with various pre-existing policies for resolving normative conflicts. Empirical evaluations are also performed to compare our algorithms to each other and to others in existing literature.
