Adaptive Bernstein Change Detector for High-Dimensional Data Streams
Marco Heyden, Edouard Fouché, Vadim Arzamasov, Tanja Fenn, Florian Kalinke, Klemens Böhm
TL;DR
ABCD provides a principled, online detector for changes in high-dimensional data streams by tracking the reconstruction loss $L_t$ of a learned encoder–decoder within an adaptive window and applying a Bernstein-based test to detect changes. It outputs the change point $t^*$, the affected subspace $D^*$, and a severity measure $\Delta$, with the ability to operate with PCA, kernel PCA, or autoencoder architectures and maintain constant-time online statistics. The method integrates change-point detection, subspace identification, and severity quantification into a single framework, and, on synthetic and real HD streams, yields up to 20% higher F1-score and higher precision than competitors while accurately locating subspaces and correlating severity with ground truth. This approach offers practical utility for real-time monitoring and adaptive learning in complex systems with evolving, high-dimensional data.
Abstract
Change detection is of fundamental importance when analyzing data streams. Detecting changes both quickly and accurately enables monitoring and prediction systems to react, e.g., by issuing an alarm or by updating a learning algorithm. However, detecting changes is challenging when observations are high-dimensional. In high-dimensional data, change detectors should not only be able to identify when changes happen, but also in which subspace they occur. Ideally, one should also quantify how severe they are. Our approach, ABCD, has these properties. ABCD learns an encoder-decoder model and monitors its accuracy over a window of adaptive size. ABCD derives a change score based on Bernstein's inequality to detect deviations in terms of accuracy, which indicate changes. Our experiments demonstrate that ABCD outperforms its best competitor by up to 20% in F1-score on average. It can also accurately estimate changes' subspace, together with a severity measure that correlates with the ground truth.
