Persistence-Based Statistics for Detecting Structural Changes in High-Dimensional Point Clouds
Toshiyuki Nakayama
TL;DR
The paper addresses the challenge of detecting structural changes in high-dimensional random point clouds by developing a statistically rigorous, nonparametric framework based on persistence statistics. It establishes moment bounds for total and maximum persistence under general distributions and Gaussian mixtures, and introduces a bounded, normalized statistic—PL+JS—constructed from persistence landscapes and the Jensen–Shannon divergence to enable stable, scale- and shift-invariant change detection via permutation tests. The authors prove Hölder continuity of the PL+JS-based change measure with respect to perturbations of input point clouds, enabling robust finite-sample inference, and demonstrate practical applicability with dynamic attribute data from decentralized governance. Overall, the work provides a theoretically grounded, computationally stable approach to identifying regime shifts in complex, high-dimensional data, with concrete guidance for parameter tuning and applicability to real-world dynamic systems.
Abstract
We study the probabilistic behavior of persistence-based statistics and propose a novel nonparametric framework for detecting structural changes in high-dimensional random point clouds. We establish moment bounds and tightness results for classical persistence statistics-total and maximum persistence-under general distributions, with explicit variance-scaling behavior derived for Gaussian mixture models. Building on these results, we introduce a bounded and normalized statistic based on persistence landscapes combined with the Jensen-Shannon divergence, and we prove its Holder continuity with respect to perturbations of the input point clouds. The resulting measure is stable, scale- and shift-invariant, and well suited for finite-sample nonparametric inference via permutation testing. An illustrative numerical study using dynamic attribute vectors from decentralized governance data demonstrates the practical applicability of the proposed method. Overall, this work provides a statistically rigorous and computationally stable approach to change-point detection in complex, high-dimensional data.
