The Trembling-Hand Problem for LTLf Planning
Pian Yu, Shufang Zhu, Giuseppe De Giacomo, Marta Kwiatkowska, Moshe Vardi
TL;DR
This work addresses trembling-hand errors in planning for temporally extended goals specified in $LTL_f$, covering both deterministic and nondeterministic environments. It develops a two-pronged solution: for deterministic domains, encode action-instruction errors into an $MDP$ and solve the $LTL_f$ objective via DFA-product methods; for nondeterministic domains, model the problem as an $MDPST$ with robust strategies and solve using a product with the DFA and a specialized value-iteration on a reduced sub-MDPST. A formal framework for $MDPSTs$ with $LTL_f$ objectives is introduced, including a robust satisfaction criterion and efficient Bellman equations. The approach is validated on a human-robot co-assembly scenario, demonstrating correctness and promising scalability through state-space pruning and a proof-of-concept implementation.
Abstract
Consider an agent acting to achieve its temporal goal, but with a "trembling hand". In this case, the agent may mistakenly instruct, with a certain (typically small) probability, actions that are not intended due to faults or imprecision in its action selection mechanism, thereby leading to possible goal failure. We study the trembling-hand problem in the context of reasoning about actions and planning for temporally extended goals expressed in Linear Temporal Logic on finite traces (LTLf), where we want to synthesize a strategy (aka plan) that maximizes the probability of satisfying the LTLf goal in spite of the trembling hand. We consider both deterministic and nondeterministic (adversarial) domains. We propose solution techniques for both cases by relying respectively on Markov Decision Processes and on Markov Decision Processes with Set-valued Transitions with LTLf objectives, where the set-valued probabilistic transitions capture both the nondeterminism from the environment and the possible action instruction errors from the agent. We formally show the correctness of our solution techniques and demonstrate their effectiveness experimentally through a proof-of-concept implementation.
