Higher-criticism for sparse multi-stream change-point detection
Tingnan Gong, Alon Kipnis, Yao Xie
TL;DR
This work introduces a higher-criticism (HC) based approach for sparse multi-stream sequential change-point detection, combining per-stream change tests (e.g., LR or GLR) via HC to achieve rapid detection when only a small subset of streams is affected. The authors formulate a sparse heteroscedastic normal model and derive an information-theoretic lower bound on detection delay; they prove that HC-based detection attains this bound, with the delay converging to $oldsymbol{ riangle}^*(r,eta, ext{var})$, and provide a phase-transition view between undetectable and detectable regimes. The analysis accommodates unknown sparsity, mean, and variance changes and demonstrates robustness to heteroscedasticity; extensive simulations show HC often outperforms competing methods in sparse settings and provides practical stream localization. The work bridges offline sparse signal detection techniques with online sequential detection, offering a theoretically grounded, adaptable framework for real-time multi-stream anomaly detection with clear implications for high-dimensional monitoring and fault localization.
Abstract
We study a statistical procedure based on higher criticism (HC) to address the sparse multi-stream quickest change-point detection problem. Namely, we aim to detect a potential change in the distribution of multiple data streams at some unknown time. If a change occurs, only a few streams are affected, whereas the identity of the affected streams is unknown. The HC-based procedure involves testing for a change point in individual streams and combining multiple tests using higher criticism. Relying on HC thresholding, the procedure also indicates a set of streams suspected to be affected by the change. We provide a theoretical analysis under a sparse heteroscedastic normal change-point model. We establish an information-theoretic detection delay lower bound when individual tests are based on the likelihood ratio or the generalized likelihood ratio statistics and show that the delay of the HC-based method converges in distribution to this bound. In the special case of constant variance, our bound coincides with known results in (Chan, 2017). We demonstrate the effectiveness of the HC-based method compared to other methods in detecting sparse changes through extensive numerical evaluations.
