Quickest Change Detection for Multiple Data Streams Using the James-Stein Estimator
Topi Halme, Venugopal V. Veeravalli, Visa Koivunen
TL;DR
This work tackles quickest change detection in multiple Gaussian data streams with an unknown mean shift. It leverages the James-Stein shrinkage estimator to form two tests: JS-WL-CuSum and JS-SRRS, achieving uniform reductions in detection delay across post-change parameters and ARL constraints, especially for large $K$. The authors provide non-asymptotic upper bounds and second-order asymptotic minimax results, showing that shrinkage improves performance without sacrificing optimality, and they validate the theory with simulations showing notable gains over ML-based and GLR methods. The results highlight the practical impact of integrating prior structural information via shrinkage into online detection, offering scalable and robust performance in high-dimensional settings.
Abstract
The problem of quickest change detection is studied in the context of detecting an arbitrary unknown mean-shift in multiple independent Gaussian data streams. The James-Stein estimator is used in constructing detection schemes that exhibit strong detection performance both asymptotically and non-asymptotically. Our results indicate that utilizing the James-Stein estimator in the recently developed window-limited CuSum test constitutes a uniform improvement over its typical maximum likelihood variant. That is, the proposed James-Stein version achieves a smaller detection delay simultaneously for all possible post-change parameter values and every false alarm rate constraint, as long as the number of parallel data streams is greater than three. Additionally, an alternative detection procedure that utilizes the James-Stein estimator is shown to have asymptotic detection delay properties that compare favorably to existing tests. The second-order asymptotic detection delay term is reduced in a predefined low-dimensional subspace of the parameter space, while second-order asymptotic minimaxity is preserved. The results are verified in simulations, where the proposed schemes are shown to achieve smaller detection delays compared to existing alternatives, especially when the number of data streams is large.
