Turn-based Multi-Agent Reinforcement Learning Model Checking
Dennis Gross
TL;DR
This paper addresses the challenge of verifying turn-based multi-agent reinforcement learning (TMARL) agents in stochastic multiplayer games, where traditional verification approaches struggle with multi-agent scalability. It introduces a tight integration of TMARL with probabilistic model checking by modeling the system as an $MDP$ and deriving a deterministic $DTMC$ via a joint policy wrapper to support $PCTL$ verification. The approach demonstrates improved scalability over naive monolithic model checking across diverse benchmarks and provides actionable insights into agent behavior and strategic moves. This work advances reliable validation of TMARL in complex environments, with practical implications for safer and better-designed turn-based game AI.
Abstract
In this paper, we propose a novel approach for verifying the compliance of turn-based multi-agent reinforcement learning (TMARL) agents with complex requirements in stochastic multiplayer games. Our method overcomes the limitations of existing verification approaches, which are inadequate for dealing with TMARL agents and not scalable to large games with multiple agents. Our approach relies on tight integration of TMARL and a verification technique referred to as model checking. We demonstrate the effectiveness and scalability of our technique through experiments in different types of environments. Our experiments show that our method is suited to verify TMARL agents and scales better than naive monolithic model checking.
