Mixed-Integer Programming for Change-point Detection
Apoorva Narula, Santanu S. Dey, Yao Xie
TL;DR
This work develops strengthened MIP formulations for offline change-point detection via piecewise-linear fitting, achieving globally optimal segmentations with tighter LP relaxations. The key advance is an extended, nested segment-assignment representation that yields an integral LP projection (polyhedron (C1)) and tighter relaxations than prior Basic/Alternate formulations, improving solver runtimes across univariate, multidimensional, and sparse change-point tasks. The framework naturally accommodates both continuity and discontinuity in univariate fits and extends to shared-breakpoint models across dimensions, plus a sparsity-enabled variant. Extensive experiments on real-world stock data demonstrate substantial runtime gains and robust performance advantages of the Extended formulations over existing benchmarks, validating the practical impact for large-scale change-point detection. Overall, the approach provides a unified, scalable optimization-based foundation for accurate, interpretable change-point detection in diverse settings.
Abstract
We present a new mixed-integer programming (MIP) approach for offline multiple change-point detection by casting the problem as a globally optimal piecewise linear (PWL) fitting problem. Our main contribution is a family of strengthened MIP formulations whose linear programming (LP) relaxations admit integral projections onto the segment assignment variables, which encode the segment membership of each data point. This property yields provably tighter relaxations than existing formulations for offline multiple change-point detection. We further extend the framework to two settings of active research interest: (i) multidimensional PWL models with shared change-points, and (ii) sparse change-point detection, where only a subset of dimensions undergo structural change. Extensive computational experiments on benchmark real-world datasets demonstrate that the proposed formulations achieve reductions in solution times under both $\ell_1$ and $\ell_2$ loss functions in comparison to the state-of-the-art.
