Automatic Change-Point Detection in Time Series via Deep Learning
Jie Li, Paul Fearnhead, Piotr Fryzlewicz, Tengyao Wang
TL;DR
The paper reframes offline change-point detection as a supervised learning task by training neural networks on labelled change/no-change examples, showing that standard tests like CUSUM can be represented within neural networks and that learned detectors can match or outperform them under model misspecification. It provides theoretical generalisation bounds that decompose error into the baseline test error plus a VC-dimension term, and shows that with practical training sizes, relatively small networks can achieve competitive performance. Empirically, the method performs on par with CUSUM under independent Gaussian noise and substantially better under autocorrelated or heavy-tailed noise, with successful applications to accelerometer-based activity changes. The work also demonstrates a scalable path to detecting and classifying multiple change-types using deep residual CNNs and presents a case study on real data, offering a flexible framework for automatic detector generation and deployment in diverse settings.
Abstract
Detecting change-points in data is challenging because of the range of possible types of change and types of behaviour of data when there is no change. Statistically efficient methods for detecting a change will depend on both of these features, and it can be difficult for a practitioner to develop an appropriate detection method for their application of interest. We show how to automatically generate new offline detection methods based on training a neural network. Our approach is motivated by many existing tests for the presence of a change-point being representable by a simple neural network, and thus a neural network trained with sufficient data should have performance at least as good as these methods. We present theory that quantifies the error rate for such an approach, and how it depends on the amount of training data. Empirical results show that, even with limited training data, its performance is competitive with the standard CUSUM-based classifier for detecting a change in mean when the noise is independent and Gaussian, and can substantially outperform it in the presence of auto-correlated or heavy-tailed noise. Our method also shows strong results in detecting and localising changes in activity based on accelerometer data.
