Online Test Synthesis From Requirements: Enhancing Reinforcement Learning with Game Theory
Ocan Sankur, Thierry Jéron, Nicolas Markey, David Mentré, Reiya Noguchi
TL;DR
The study addresses automatic online synthesis of test cases for black-box reactive systems using automata-based requirements. It introduces a Monte Carlo Tree Search framework augmented with game-theoretic greedy strategies and reward shaping to accelerate convergence toward test objectives while preserving completeness guarantees. The key contribution is a Greedy-MCTS algorithm that biases both tree and roll-out policies using controllable predecessor concepts and cooperative states, demonstrated to dramatically improve success rates in a case study. This approach offers a scalable, principled method for efficient online conformance testing with practical implications for industrial testing of complex reactive systems.
Abstract
We consider the automatic online synthesis of black-box test cases from functional requirements specified as automata for reactive implementations. The goal of the tester is to reach some given state, so as to satisfy a coverage criterion, while monitoring the violation of the requirements. We develop an approach based on Monte Carlo Tree Search, which is a classical technique in reinforcement learning for efficiently selecting promising inputs. Seeing the automata requirements as a game between the implementation and the tester, we develop a heuristic by biasing the search towards inputs that are promising in this game. We experimentally show that our heuristic accelerates the convergence of the Monte Carlo Tree Search algorithm, thus improving the performance of testing.
