Conditional Score Learning for Quickest Change Detection in Markov Transition Kernels
Wuxia Chen, Taposh Banerjee, Vahid Tarokh
TL;DR
The paper tackles quickest change detection in Markov processes with unknown transition kernels by learning the conditional score $\nabla_{\mathbf{y}} \log p(\mathbf{y}|\mathbf{x})$ from paired samples, enabling likelihood-free detection. It introduces a conditional Hyvärinen score framework and a score-based CUSUM (SCUSUM) procedure, including a truncated variant for stability and bounded increments. Theoretical results establish exponential lower bounds on mean time to false alarm and asymptotic upper bounds on detection delay under Doeblin-type uniform ergodicity, using Hoeffding-type concentration for dependent data. Empirically, the approach is validated on synthetic Gaussian-transition Markov processes and real CMU Motion Capture data, showing accurate conditional-score learning and timely change detection across multiple scenarios. Overall, the work extends score-based quickest change detection from i.i.d. settings to dependent, high-dimensional processes with practical training and rigorous performance guarantees.
Abstract
We address the problem of quickest change detection in Markov processes with unknown transition kernels. The key idea is to learn the conditional score $\nabla_{\mathbf{y}} \log p(\mathbf{y}|\mathbf{x})$ directly from sample pairs $( \mathbf{x},\mathbf{y})$, where both $\mathbf{x}$ and $\mathbf{y}$ are high-dimensional data generated by the same transition kernel. In this way, we avoid explicit likelihood evaluation and provide a practical way to learn the transition dynamics. Based on this estimation, we develop a score-based CUSUM procedure that uses conditional Hyvarinen score differences to detect changes in the kernel. To ensure bounded increments, we propose a truncated version of the statistic. With Hoeffding's inequality for uniformly ergodic Markov processes, we prove exponential lower bounds on the mean time to false alarm. We also prove asymptotic upper bounds on detection delay. These results give both theoretical guarantees and practical feasibility for score-based detection in high-dimensional Markov models.
