On High-Dimensional Change-Point Detection Based on Pairwise Distances

Spandan Ghoshal; Bilol Banerjee; Anil K. Ghosh

On High-Dimensional Change-Point Detection Based on Pairwise Distances

Spandan Ghoshal, Bilol Banerjee, Anil K. Ghosh

TL;DR

This paper proposes nonparametric, distance-based change-point detection methods that remain effective when the data dimension $d$ greatly exceeds the sample size. By leveraging pairwise distances and an energy-distance-inspired divergence, the authors develop a scalable statistic with a permutation-based significance test, and they extend the framework to generalized distance functionals $\varphi_{h,\psi}$. Theoretical results establish strong consistency and high-dimensional limits under HDLSS and growing-$n$ regimes, with detailed analyses of sparse signals and various distance choices. Empirical studies on simulated HDLSS data and real stock-price returns demonstrate the methods' competitive performance, particularly in detecting scale changes and higher-order distributional differences where Euclidean-distance methods falter. Overall, the work advances robust, nonparametric change-point detection for high-dimensional applications and suggests practical enhancements like block-distance variants for further resilience.

Abstract

In change-point analysis, one aims at finding the locations of abrupt distributional changes (if any) in a sequence of multivariate observations. In this article, we propose some nonparametric methods based on averages of pairwise distances for this purpose. These distance-based methods can be conveniently used for high-dimensional data even when the dimension is much larger than the sample size (i.e., the length of the sequence). We carry out some theoretical investigations on the behaviour of these methods not only when the dimension of the data remains fixed and the sample size grows to infinity, but also in situations where the dimension diverges to infinity while the sample size may or may not grow with the dimension. Several high-dimensional datasets are analyzed to compare the empirical performance of these proposed methods against some state-of-the-art methods.

On High-Dimensional Change-Point Detection Based on Pairwise Distances

TL;DR

This paper proposes nonparametric, distance-based change-point detection methods that remain effective when the data dimension

greatly exceeds the sample size. By leveraging pairwise distances and an energy-distance-inspired divergence, the authors develop a scalable statistic with a permutation-based significance test, and they extend the framework to generalized distance functionals

. Theoretical results establish strong consistency and high-dimensional limits under HDLSS and growing-

regimes, with detailed analyses of sparse signals and various distance choices. Empirical studies on simulated HDLSS data and real stock-price returns demonstrate the methods' competitive performance, particularly in detecting scale changes and higher-order distributional differences where Euclidean-distance methods falter. Overall, the work advances robust, nonparametric change-point detection for high-dimensional applications and suggests practical enhancements like block-distance variants for further resilience.

On High-Dimensional Change-Point Detection Based on Pairwise Distances

TL;DR

Abstract

On High-Dimensional Change-Point Detection Based on Pairwise Distances

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (36)