Playing Stochastically in Weighted Timed Games to Emulate Memory
Benjamin Monmege, Julie Parreaux, Pierre-Alain Reynier
TL;DR
This work investigates how stochastic (randomized) strategies can substitute for finite-memory strategies in weighted timed games (WTGs) and, conversely, how memory can emulate randomness. It defines a rigorous stochastic framework for WTGs, establishing measurability and finiteness conditions for expected payoffs, and demonstrates that in divergent WTGs and in untimed shortest-path games (SPGs) the stochastic value equals the deterministic value. The authors prove that Min can emulate memory with randomization and that Max can play memoryless optimally against Min’s strategies, leading to a determinacy result where stochastic, memoryless, and deterministic values all coincide for these classes. They also introduce switching strategies for Min in divergent WTGs and construct a memoryless stochastic interpolation η^p to approximate switching behavior, providing algorithmic insights via region-based analyses. Overall, the paper clarifies the memory-randomness trade-off, yielding practical implications for strategy design in real-time quantitative games and suggesting avenues for efficient synthesis and implementation.
Abstract
Weighted timed games are two-player zero-sum games played in a timed automaton equipped with integer weights. We consider optimal reachability objectives, in which one of the players, that we call Min, wants to reach a target location while minimising the cumulated weight. While knowing if Min has a strategy to guarantee a value lower than a given threshold is known to be undecidable (with two or more clocks), several conditions, one of them being divergence, have been given to recover decidability. In such weighted timed games (like in untimed weighted games in the presence of negative weights), Min may need finite memory to play (close to) optimally. This is thus tempting to try to emulate this finite memory with other strategic capabilities. In this work, we allow the players to use stochastic decisions, both in the choice of transitions and of timing delays. We give a definition of the expected value in weighted timed games. We then show that, in divergent weighted timed games as well as in (untimed) weighted games (that we call shortest-path games in the following), the stochastic value is indeed equal to the classical (deterministic) value, thus proving that Min can guarantee the same value while only using stochastic choices, and no memory.
