On the Forgetting of Particle Filters
Joona Karjalainen, Anthony Lee, Sumeetpal S. Singh, Matti Vihola
TL;DR
This work shows that if the underlying Feynman–Kac model satisfies a strong mixing condition, the particle-filter state, viewed as an inhomogeneous Markov chain, forgets its initial condition at an exponential rate in total variation, requiring only $O(\log N)$ algorithm steps for forgetting. The authors extend the forgetting analysis to the conditional particle filter (CPF), obtaining analogous $O(\log N)$ forgetting along with time-uniform $L^p$ bounds and new propagation-of-chaos results. They furthermore develop coupling mechanisms and discuss implications for algorithms that couple particle filters, including applications to out-of-sequence measurements. The results substantially improve on prior conservative bounds, provide sharp rates, and offer practical insights for coupling, smoothing, and OOS scenarios in sequential Monte Carlo methods.
Abstract
We study the forgetting properties of the particle filter when its state - the collection of particles - is regarded as a Markov chain. Under a strong mixing assumption on the particle filter's underlying Feynman-Kac model, we find that the particle filter is exponentially mixing, and forgets its initial state in $O(\log N )$ 'time', where $N$ is the number of particles and time refers to the number of particle filter algorithm steps, each comprising a selection (or resampling) and mutation (or prediction) operation. We present an example which shows that this rate is optimal. In contrast to our result, available results to-date are extremely conservative, suggesting $O(α^N)$ time steps are needed, for some $α>1$, for the particle filter to forget its initialisation. We also study the conditional particle filter (CPF) and extend our forgetting result to this context. We establish a similar conclusion, namely, CPF is exponentially mixing and forgets its initial state in $O(\log N )$ time. To support this analysis, we establish new time-uniform $L^p$ error estimates for CPF, which can be of independent interest. We also establish new propagation of chaos type results using our proof techniques, discuss implications to couplings of particle filters and an application to processing out-of-sequence measurements.
