RULSurv: A probabilistic survival-based method for early censoring-aware prediction of remaining useful life in ball bearings
Christian Marius Lillelund, Fernando Pannullo, Morten Opprud Jakobsen, Manuel Morante, Christian Fischer Pedersen
TL;DR
RULSurv tackles censoring in bearing RUL estimation by coupling a KL-divergence–based event detector with survival-analysis models that naturally handle right-censoring. The method labels the onset of degradation via spectral-distance changes across bearing-frequency bands and then learns individual survival distributions $S(t|oldsymbol{x})$ using five models, including CoxPH, GBSA, RSF, MTLR, and BNNSurv, ensuring monotonic RUL predictions via the survival curve. Bayesian approaches (e.g., BNNSurv with Monte Carlo Dropout) provide credible intervals for uncertainty, while cross-validated experiments on the XJTU-SY dataset show competitive MAEs and state-of-the-art CRA under high load. The work demonstrates that incorporating censored data improves predictive accuracy and enables time-to-failure predictions with probabilistic interpretation, offering actionable insight for predictive maintenance.
Abstract
Predicting the remaining useful life (RUL) of ball bearings is an active area of research, where novel machine learning techniques are continuously being applied to predict degradation trends and anticipate failures before they occur. However, few studies have explicitly addressed the challenge of handling censored data, where information about a specific event (\eg mechanical failure) is incomplete or only partially observed. To address this issue, we introduce a novel and flexible method for early fault detection using Kullback-Leibler (KL) divergence and RUL estimation using survival analysis that naturally supports censored data. We demonstrate our approach in the XJTU-SY dataset using a 5-fold cross-validation strategy across three different operating conditions. When predicting the time to failure for bearings under the highest load (C1, 12.0 kN and 2100 RPM) with 25% random censoring, our approach achieves a mean absolute error (MAE) of 14.7 minutes (95% CI = 13.6-15.8) using a linear CoxPH model, and an MAE of 12.6 minutes (95% CI = 11.8-13.4) using a nonlinear Random Survival Forests model, compared to an MAE of 18.5 minutes (95% CI = 17.4-19.6) using a linear LASSO model that does not support censoring. Moreover, our approach achieves a mean cumulative relative accuracy (CRA) of 0.7586 over 5 bearings under the highest load, which improves over several state-of-the-art baselines. Our work highlights the importance of considering censored data as part of the model design when building predictive models for early fault detection and RUL estimation.
