Shackling Uncertainty using Mixed Criticality in Monte-Carlo Tree Search
Franco Cordeiro, Samuel Tardieu, Laurent Pautet
TL;DR
This work tackles planning under uncertain resource costs in mixed-criticality contexts for autonomous missions. It extends Monte Carlo Tree Search by incorporating Mixed Criticality concepts across multiple resources, notably time and energy, via a two-mode (lo/hi) cost model and per-node budgets, yielding the (MC)2TS framework. The approach jointly aims to maximize hi-action completion under worst-case budgets while preserving lo-action execution in normal conditions, demonstrated on an active-perception drone scenario where it outperforms traditional MCTS under both optimistic and pessimistic cost estimates. The results show that (MC)2TS offers a safe, efficient planning paradigm with replanning capabilities, useful for resource-constrained, mission-critical robotics applications.
Abstract
In the world of embedded systems, optimizing actions with the uncertain costs of multiple resources is a complex challenge. Existing methods include plan building based on Monte Carlo Tree Search (MCTS), an approach that thrives in multiple online planning scenarios. However, these methods often overlook uncertainty in worst-case cost estimations. A system can fail to operate before achieving a critical objective when actual costs exceed optimistic worst-case estimates. Conversely, a system based on pessimistic worst-case estimates would lead to resource over-provisioning even for less critical objectives. To solve similar issues, the Mixed Criticality (MC) approach has been developed in the real-time systems community. In this paper, we propose to extend the MCTS heuristic in three directions. Firstly, we reformulate the concept of MC to account for uncertain worst-case costs. High-criticality tasks must be executed regardless of their uncertain costs. Low-criticality tasks are either executed in low-criticality mode utilizing resources up-to their optimistic worst-case estimates, or executed in high-criticality mode by degrading them, or discarded when resources are scarce. Secondly, although the MC approach was originally developed for real-time systems, focusing primarily on worst-case execution time as the only uncertain resource, our approach extends the concept of resources to deal with several resources at once, such as the time and energy required to perform an action. Finally, we propose an extension of MCTS with MC concepts, which we refer to as $(MC)^2TS$, to efficiently adjust resource allocation to uncertain costs according to the criticality of actions. We demonstrate our approach in an active perception scenario. Our evaluation shows $(MC)^2TS$ outperforms the traditional MCTS regardless of whether the worst case estimates are optimistic or pessimistic.
