A Model for Optimal Resilient Planning Subject to Fallible Actuators
Kyle Baldes, Diptanil Chaudhuri, Jason M. O'Kane, Dylan A. Shell
TL;DR
This work introduces a fallible actuator MDP (FA-MDP) to incorporate utilization-driven actuator failures into long-horizon planning. By modeling failure with a reliability function $\rho$ and malfunction transitions $F$, it enables anticipatory policies that preserve critical actuators for future opportunities, rather than exhaustively risking failure and re-planning. The authors propose a lattice-based solver (Lattice Planner) that operates on a DAG of actuator-subset states, leveraging a local Bellman operator with a contraction factor $\gamma\cdot\overline{\rho} < 1$ and hot-starting to accelerate convergence. Empirical results on gridworld tasks show that the lattice approach scales better than a monolithic solver, especially as the number of actuators grows, demonstrating practical resilience benefits for complex robotic systems.
Abstract
Robots incurring component failures ought to adapt their behavior to best realize still-attainable goals under reduced capacity. We formulate the problem of planning with actuators known a priori to be susceptible to failure within the Markov Decision Processes (MDP) framework. The model captures utilization-driven malfunction and state-action dependent likelihoods of actuator failure in order to enable reasoning about potential impairment and the long-term implications of impoverished future control. This leads to behavior differing qualitatively from plans which ignore failure. As actuators malfunction, there are combinatorially many configurations which can arise. We identify opportunities to save computation through re-use, exploiting the observation that differing configurations yield closely related problems. Our results show how strategic solutions are obtained so robots can respond when failures do occur -- for instance, in prudently scheduling utilization in order to keep critical actuators in reserve.
