Functional Random Forest with Adaptive Cost-Sensitive Splitting for Imbalanced Functional Data Classification
Fahad Mostafa, Hafiz Khan
TL;DR
This work tackles the challenge of class-imbalanced functional data classification by introducing Functional Random Forest with Adaptive Cost-Sensitive Splitting (FRF-ACS). The method represents curves via FPCA/basis expansions, uses a locally adaptive impurity measure to emphasize minority classes, and employs a hybrid resampling strategy (Functional SMOTE plus cost-sensitive bootstrapping) with curve-aware leaf similarity. Theoretical support is provided through FPCA truncation error identities and a link between the adaptive impurity and weighted misclassification risk, complemented by a consistency argument. Empirically, FRF-ACS yields substantial gains in minority-class detection (F1, AUPRC, MCC) and balanced accuracy across synthetic and four real functional datasets, demonstrating robustness to noise and imbalance while maintaining interpretability.
Abstract
Classification of functional data where observations are curves or trajectories poses unique challenges, particularly under severe class imbalance. Traditional Random Forest algorithms, while robust for tabular data, often fail to capture the intrinsic structure of functional observations and struggle with minority class detection. This paper introduces Functional Random Forest with Adaptive Cost-Sensitive Splitting (FRF-ACS), a novel ensemble framework designed for imbalanced functional data classification. The proposed method leverages basis expansions and Functional Principal Component Analysis (FPCA) to represent curves efficiently, enabling trees to operate on low dimensional functional features. To address imbalance, we incorporate a dynamic cost sensitive splitting criterion that adjusts class weights locally at each node, combined with a hybrid sampling strategy integrating functional SMOTE and weighted bootstrapping. Additionally, curve specific similarity metrics replace traditional Euclidean measures to preserve functional characteristics during leaf assignment. Extensive experiments on synthetic and real world datasets including biomedical signals and sensor trajectories demonstrate that FRF-ACS significantly improves minority class recall and overall predictive performance compared to existing functional classifiers and imbalance handling techniques. This work provides a scalable, interpretable solution for high dimensional functional data analysis in domains where minority class detection is critical.
