Bayesian Online Changepoint Detection
Ryan Prescott Adams, David J. C. MacKay
TL;DR
The paper addresses online changepoint detection by developing an exact Bayesian method to infer the current run length $r_t$ as data arrive, assuming independent parameters before and after changepoints. It derives a recursive, modular algorithm that uses a hazard-based change-point prior and conjugate-exponential models to maintain tractable sufficient statistics and exact predictive distributions. The approach yields a run-length posterior and a causal predictive filter, enabling real-time detection across diverse data types. Demonstrations on well-log, stock-return, and coal-mine data show accurate changepoint localization and intuitive run-length dynamics, highlighting the method's applicability to real-time decision making with modular model integration.
Abstract
Changepoints are abrupt variations in the generative parameters of a data sequence. Online detection of changepoints is useful in modelling and prediction of time series in application areas such as finance, biometrics, and robotics. While frequentist methods have yielded online filtering and prediction techniques, most Bayesian papers have focused on the retrospective segmentation problem. Here we examine the case where the model parameters before and after the changepoint are independent and we derive an online algorithm for exact inference of the most recent changepoint. We compute the probability distribution of the length of the current ``run,'' or time since the last changepoint, using a simple message-passing algorithm. Our implementation is highly modular so that the algorithm may be applied to a variety of types of data. We illustrate this modularity by demonstrating the algorithm on three different real-world data sets.
