IntervalMDP.jl: Accelerated Value Iteration for Interval Markov Decision Processes
Frederik Baymler Mathiesen, Morteza Lahijanian, Luca Laurenti
TL;DR
IntervalMDP.jl addresses efficient analysis of interval Markov Decision Processes by enabling robust value iteration on IMDPs through CPU multi-threading and CUDA GPU acceleration. The framework interoperates with established formats (PRISM, bmdp-tool) and uses NetCDF/JSON storage, while employing bitonic sort and O-maximization to accelerate per-state linear programs. Empirical results show substantial performance gains: CPU speedups of $2$-$4$ times on average and GPU speedups of orders of magnitude for large models, with lower memory consumption than competing tools. The platform enables scalable verification and policy synthesis for IMDPs and lays groundwork for extensions to temporal logics.
Abstract
In this paper, we present IntervalMDP.jl, a Julia package for probabilistic analysis of interval Markov Decision Processes (IMDPs). IntervalMDP.jl facilitates the synthesis of optimal strategies and verification of IMDPs against reachability specifications and discounted reward properties. The library supports sparse matrices and is compatible with data formats from common tools for the analysis of probabilistic models, such as PRISM. A key feature of IntervalMDP.jl is that it presents both a multi-threaded CPU and a GPU-accelerated implementation of value iteration algorithms for IMDPs. In particular, IntervalMDP.jl takes advantage of the Julia type system and the inherently parallelizable nature of value iteration to improve the efficiency of performing analysis of IMDPs. On a set of examples, we show that IntervalMDP.jl substantially outperforms existing tools for verification and strategy synthesis for IMDPs in both computation time and memory consumption.
