Robust Functional Principal Component Analysis for Non-Euclidean Random Objects
Jiazhen Xu, Andrew T. A. Wood, Tao Zou
TL;DR
This work addresses robust analysis of time-varying non-Euclidean objects by transforming object-valued curves into Fréchet median distance trajectories and applying a Winsorized $U$-statistic-based FPCA. The key methodological advances include a robust autocovariance operator $C_{WPU}$ whose eigenfunctions align with the standard case, and a data-driven mechanism to maintain robustness against outliers via a cutoff $Q$ and radius function $\xi$. Theoretical guarantees cover uniform convergence of the Fréchet median, asymptotic Gaussian behavior of the estimator, and robustness properties with explicit breakdown points. Empirical evidence from a NYC Citi Bike case study and simulations shows improved robustness to outliers while preserving competitive performance when data are clean, highlighting the method's practical value for dynamic networks and other non-Euclidean time-varying objects.
Abstract
Functional data analysis offers a diverse toolkit of statistical methods tailored for analyzing samples of real-valued random functions. Recently, samples of time-varying random objects, such as time-varying networks, have been increasingly encountered in modern data analysis. These data structures represent elements within general metric spaces that lack local or global linear structures, rendering traditional functional data analysis methods inapplicable. Moreover, the existing methodology for time-varying random objects does not work well in the presence of outlying objects. In this paper, we propose a robust method for analysing time-varying random objects. Our method employs pointwise Fréchet medians and then constructs pointwise distance trajectories between the individual time courses and the sample Fréchet medians. This representation effectively transforms time-varying objects into functional data. A novel robust approach to functional principal component analysis based on a Winsorized U-statistic estimator of the covariance structure is introduced. The proposed robust analysis of these distance trajectories is able to identify key features of time-varying objects and is useful for downstream analysis. To illustrate the efficacy of our approach, numerical studies focusing on dynamic networks are conducted. The results indicate that the proposed method exhibits good all-round performance and surpasses the existing approach in terms of robustness, showcasing its superior performance in handling time-varying objects data.
