Continuity of Filters for Discrete-Time Control Problems Defined by Explicit Equations
Eugene A. Feinberg, Sayaka Ishizawa, Pavlo O. Kasyanov, David N. Kraemer
TL;DR
The paper addresses the problem of establishing weak continuity of the filter in discrete-time stochastic control with incomplete information, by relating stochastic-equation models to POMDP kernels through Aumann's lemma. It develops sufficient conditions on the transition $F$ and observation $G$ that guarantee the filter kernel $q$ is weakly continuous (and, under stronger conditions, continuous in total variation), enabling existence of optimal policies and convergence of value iterations for the reduced COMDP. A key technical advance is a criterion for total-variation continuity of kernels defined via stochastic equations, leveraging diffeomorphic image mappings and continuity of transformed neighborhoods. The results are illustrated on linear state-space models and inventory-control problems with noisy observations, and extended to POMDP1 formulations, providing practical criteria for ensuring optimal control under partial information in engineering, economics, and related fields.
Abstract
Discrete time control systems whose dynamics and observations are described by stochastic equations are common in engineering, operations research, health care, and economics. For example, stochastic filtering problems are usually defined via stochastic equations. These problems can be reduced to Markov decision processes (MDPs) whose states are posterior state distributions, and transition probabilities for such MDPs are sometimes called filters. This paper investigates sufficient conditions on transition and observation functions for the original problems to guarantee weak continuity of the filter. Under mild conditions on cost functions, weak continuity implies the existence of optimal policies minimizing the expected total costs, the validity of optimality equations, and convergence of value iterations to optimal values. This paper uses recent results on weak continuity of filters for partially observable MDPs defined by transition and observation probabilities. It develops a criterion of weak continuity of transition probabilities and a sufficient condition for continuity in total variation of transition probabilities. The results are illustrated with applications to filtering problems.
