A Federated Data Fusion-Based Prognostic Model for Applications with Multi-Stream Incomplete Signals

Madi Arabi; Xiaolei Fang

A Federated Data Fusion-Based Prognostic Model for Applications with Multi-Stream Incomplete Signals

Madi Arabi, Xiaolei Fang

TL;DR

The paper tackles the challenge of building reliable prognostic models when historical data are scarce and distributed across multiple organizations. It introduces a privacy-preserving federated framework that fuses multi-stream degradation signals via multivariate functional PCA to produce MFPC-scores, which are then used in a (log)-location-scale regression for failure-time prediction. A two-stage federated feature extraction algorithm—federated dominant subspace identification and federated MFPC-scores computation—enables joint learning without sharing raw data, while a federated LLS regression estimates model parameters. Extensive simulations and a NASA turbofan case study show the Federated Model achieves the same predictive accuracy as a centralized approach and outperforms models built by individual users, especially when local data are limited. The work demonstrates that privacy-preserving, distributed prognostics with incomplete multi-sensor signals can attain centralized-level performance and scale to many participants in industrial settings.

Abstract

Most prognostic methods require a decent amount of data for model training. In reality, however, the amount of historical data owned by a single organization might be small or not large enough to train a reliable prognostic model. To address this challenge, this article proposes a federated prognostic model that allows multiple users to jointly construct a failure time prediction model using their multi-stream, high-dimensional, and incomplete data while keeping each user's data local and confidential. The prognostic model first employs multivariate functional principal component analysis to fuse the multi-stream degradation signals. Then, the fused features coupled with the times-to-failure are utilized to build a (log)-location-scale regression model for failure prediction. To estimate parameters using distributed datasets and keep the data privacy of all participants, we propose a new federated algorithm for feature extraction. Numerical studies indicate that the performance of the proposed model is the same as that of classic non-federated prognostic models and is better than that of the models constructed by each user itself.

A Federated Data Fusion-Based Prognostic Model for Applications with Multi-Stream Incomplete Signals

TL;DR

Abstract

Paper Structure (23 sections, 1 theorem, 12 equations, 14 figures, 2 algorithms)

This paper contains 23 sections, 1 theorem, 12 equations, 14 figures, 2 algorithms.

Introduction
The Prognostic Model and Parameter Estimation
The Federated Feature Extraction Algorithm
Feature Extraction Using Centralized Data
Federated Feature Extraction Using Decentralized Data
Federated Dominant Subspace Identification
Federated Algorithm for MFPC-scores Computation
Prognostic Model Training and Time-To-Failure Prediction
Simulation Study I
Data Generation and Benchmarking Models
Results and Analysis
Simulation Study II
Stragglers Effect
Large Scale Applications
Computational Time
...and 8 more sections

Key Result

Proposition 1

Given an uncentered matrix $\boldsymbol{X}\in\mathbb{R}^{N\times J}$ and $\boldsymbol{U}_{new}\in\mathbb{R}^{N\times K}$ whose orthonormal columns span the $K$-dimensional dominant subspace of $\boldsymbol{X}$, the MFPC-scores discussed in Section sec: sub: para_center can be computed from $\boldsym

Figures (14)

Figure 1: Overview of proposed federated feature extraction method
Figure 2: The prediction errors of federated and non-federated models.
Figure 3: The prediction errors of federated model and individual model (User 1)
Figure 4: The prediction errors of federated model and individual model (User 2)
Figure 5: The prediction errors of federated model and individual model (User 3)
...and 9 more figures

Theorems & Definitions (1)

Proposition 1

A Federated Data Fusion-Based Prognostic Model for Applications with Multi-Stream Incomplete Signals

TL;DR

Abstract

A Federated Data Fusion-Based Prognostic Model for Applications with Multi-Stream Incomplete Signals

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (14)

Theorems & Definitions (1)