Multiscale Quantile Regression with Local Error Control
Zhi Liu, Housen Li
TL;DR
The paper tackles robust change-point detection in time series by proposing MUSCLE, a multiscale quantile segmentation method with local error control. It combines a multiscale testing framework on segments, a variational estimator, and a wavelet-tree–based dynamic programming implementation to achieve strong finite-sample guarantees, including FDR and OER control via a single tuning parameter. The authors establish consistency and localization rates under mild signal-to-noise conditions, and demonstrate competitive performance against a wide range of methods on simulations and real data from electrophysiology and geophysics. Extensions to distributional changes (M-MUSCLE) and dependent data (D-MUSCLE) broaden applicability, while the splitting-merging variant MUSCLE-S offers scalable computation for large datasets. Overall, MUSCLE provides robust, powerful change-point detection with interpretable results and practical software availability through the muscle R package.
Abstract
For robust and efficient detection of change points, we introduce a novel methodology MUSCLE (Multiscale qUantile Segmentation Controlling Local Error) that partitions serial data into multiple segments, each sharing a common quantile. It leverages multiple tests for quantile changes over different scales and locations, and variational estimation. Unlike the often adopted global error control, MUSCLE focuses on local errors defined on individual segments, significantly improving detection power in finding change points. Meanwhile, due to the built-in model complexity penalty, it enjoys the finite sample guarantee that its false discovery rate (or the expected proportion of falsely detected change points) is upper bounded by its unique tuning parameter. Further, we obtain the consistency and the localization error rates in estimating change points, under mild signal-to-noise-ratio conditions. Both match (up to log factors) the minimax optimality results in the Gaussian setup. All theories hold under the only distributional assumption of serial independence. Incorporating the wavelet tree data structure, we develop an efficient dynamic programming algorithm for computing MUSCLE. Extensive simulations as well as real data applications in electrophysiology and geophysics demonstrate its competitiveness and effectiveness. An implementation via R package muscle is available on GitHub.
