FedST: Secure Federated Shapelet Transformation for Time Series Classification
Zhiyu Liang, Hongzhi Wang
TL;DR
FedST presents a secure federated shapelet transformation for time series classification, enabling cross-organization learning without data sharing. It defines a federated shapelet search kernel, analyzes security and efficiency bottlenecks, and proposes FL-tailored acceleration techniques including a secure dot-product protocol, sorting-based IG acceleration, and an F-stat trade-off, achieving substantial speedups while preserving accuracy and interpretability. Experimental results on 97 UCR datasets and synthetic data demonstrate FedST’s competitive accuracy relative to centralized baselines and significant efficiency gains, including up to thousands of times faster federated search with combined optimizations. The framework can be extended with differential privacy to protect outputs further, making FedST a practical, interpretable, privacy-preserving solution for time series classification in federated settings.
Abstract
This paper explores how to build a shapelet-based time series classification (TSC) model in the federated learning (FL) scenario, that is, using more data from multiple owners without actually sharing the data. We propose FedST, a novel federated TSC framework extended from a centralized shapelet transformation method. We recognize the federated shapelet search step as the kernel of FedST. Thus, we design a basic protocol for the FedST kernel that we prove to be secure and accurate. However, we identify that the basic protocol suffers from efficiency bottlenecks and the centralized acceleration techniques lose their efficacy due to the security issues. To speed up the federated protocol with security guarantee, we propose several optimizations tailored for the FL setting. Our theoretical analysis shows that the proposed methods are secure and more efficient. We conduct extensive experiments using both synthetic and real-world datasets. Empirical results show that our FedST solution is effective in terms of TSC accuracy, and the proposed optimizations can achieve three orders of magnitude of speedup.
