Approximate Information States for Worst-Case Control and Learning in Uncertain Systems
Aditya Dave, Nishanth Venkatesh, Andreas A. Malikopoulos
TL;DR
This work addresses robust, worst-case decision-making under partial observations in non-stochastic uncertain systems by introducing information states and their approximate counterparts. The authors develop a dynamic-programming framework that uses an information-state $oldsymbol{}_t$ to achieve optimal planning, and extend it to approximate information states $oldsymbol{} Pi_t$ that can be learned from outputs with provable bounded loss via an approximate DP. They provide explicit conditions and Lipschitz-based bounds that control approximation error, and illustrate the approach through state-quantization examples and learning-based methods in wall-defense and pursuit-evasion problems. The results offer a principled path to tractable worst-case control and robust reinforcement learning under partial observations, with demonstrated computational and performance benefits in representative scenarios.
Abstract
In this paper, we investigate discrete-time decision-making problems in uncertain systems with partially observed states. We consider a non-stochastic model, where uncontrolled disturbances acting on the system take values in bounded sets with unknown distributions. We present a general framework for decision-making in such problems by using the notion of the information state and approximate information state, and introduce conditions to identify an uncertain variable that can be used to compute an optimal strategy through a dynamic program (DP). Next, we relax these conditions and define approximate information states that can be learned from output data without knowledge of system dynamics. We use approximate information states to formulate a DP that yields a strategy with a bounded performance loss. Finally, we illustrate the application of our results in control and reinforcement learning using numerical examples.
