Table of Contents
Fetching ...

Fault Detection and Monitoring using a Data-Driven Information-Based Strategy: Method, Theory, and Application

Camilo Ramírez, Jorge F. Silva, Ferhat Tamssaouet, Tomás Rojas, Marcos E. Orchard

TL;DR

This work reframes fault detection as a model-drift problem within additive noise models and proposes a distribution-free, mutual-information–based detector that does not require faulty training data. Central to the method is the Residual Information Value (RIV), which measures the dependency between the regression input and the residual through a nonparametric estimate of mutual information $I(X;R)$. The authors prove strong consistency, finite-sample exponential-detection guarantees under healthy conditions, and power/level convergence, providing rigorous performance guarantees across a broad class of systems. Empirically, the approach is validated on synthetic ANMs and a realistic N-CMAPSS turbofan dataset, with extensions to richer residual-information features (RIFs) for diagnosis and prognostics. The framework offers a practical, unsupervised fault-detection tool applicable to nonlinear, non-Gaussian settings and deployed models, with strong relevance for PHM and system monitoring.

Abstract

The ability to detect when a system undergoes an incipient fault is of paramount importance in preventing a critical failure. Classic methods for fault detection (including model-based and data-driven approaches) rely on thresholding error statistics or simple input-residual dependencies but face difficulties with non-linear or non-Gaussian systems. Behavioral methods (e.g., those relying on digital twins) address these difficulties but still face challenges when faulty data is scarce, decision guarantees are required, or working with already-deployed models is required. In this work, we propose an information-driven fault detection method based on a novel concept drift detector, addressing these challenges. The method is tailored to identifying drifts in input-output relationships of additive noise models (i.e., model drifts) and is based on a distribution-free mutual information (MI) estimator. Our scheme does not require prior faulty examples and can be applied distribution-free over a large class of system models. Our core contributions are twofold. First, we demonstrate the connection between fault detection, model drift detection, and testing independence between two random variables. Second, we prove several theoretical properties of the proposed MI-based fault detection scheme: (i) strong consistency, (ii) exponentially fast detection of the non-faulty case, and (iii) control of both significance levels and power of the test. To conclude, we validate our theory with synthetic data and the benchmark dataset N-CMAPSS of aircraft turbofan engines. These empirical results support the usefulness of our methodology in many practical and realistic settings, and the theoretical results show performance guarantees that other methods cannot offer.

Fault Detection and Monitoring using a Data-Driven Information-Based Strategy: Method, Theory, and Application

TL;DR

This work reframes fault detection as a model-drift problem within additive noise models and proposes a distribution-free, mutual-information–based detector that does not require faulty training data. Central to the method is the Residual Information Value (RIV), which measures the dependency between the regression input and the residual through a nonparametric estimate of mutual information . The authors prove strong consistency, finite-sample exponential-detection guarantees under healthy conditions, and power/level convergence, providing rigorous performance guarantees across a broad class of systems. Empirically, the approach is validated on synthetic ANMs and a realistic N-CMAPSS turbofan dataset, with extensions to richer residual-information features (RIFs) for diagnosis and prognostics. The framework offers a practical, unsupervised fault-detection tool applicable to nonlinear, non-Gaussian settings and deployed models, with strong relevance for PHM and system monitoring.

Abstract

The ability to detect when a system undergoes an incipient fault is of paramount importance in preventing a critical failure. Classic methods for fault detection (including model-based and data-driven approaches) rely on thresholding error statistics or simple input-residual dependencies but face difficulties with non-linear or non-Gaussian systems. Behavioral methods (e.g., those relying on digital twins) address these difficulties but still face challenges when faulty data is scarce, decision guarantees are required, or working with already-deployed models is required. In this work, we propose an information-driven fault detection method based on a novel concept drift detector, addressing these challenges. The method is tailored to identifying drifts in input-output relationships of additive noise models (i.e., model drifts) and is based on a distribution-free mutual information (MI) estimator. Our scheme does not require prior faulty examples and can be applied distribution-free over a large class of system models. Our core contributions are twofold. First, we demonstrate the connection between fault detection, model drift detection, and testing independence between two random variables. Second, we prove several theoretical properties of the proposed MI-based fault detection scheme: (i) strong consistency, (ii) exponentially fast detection of the non-faulty case, and (iii) control of both significance levels and power of the test. To conclude, we validate our theory with synthetic data and the benchmark dataset N-CMAPSS of aircraft turbofan engines. These empirical results support the usefulness of our methodology in many practical and realistic settings, and the theoretical results show performance guarantees that other methods cannot offer.
Paper Structure (69 sections, 6 theorems, 56 equations, 15 figures, 7 tables, 1 algorithm)

This paper contains 69 sections, 6 theorems, 56 equations, 15 figures, 7 tables, 1 algorithm.

Key Result

Lemma 1

Let $X$ and $Y$ be r.v.s taking values in $\mathbb{R}^p$ and $\mathbb{R}^q$, respectively. There exists a random variable $W\sim\mathrm{Uniform}([0,1])$, independent of $X$, and a measurable function $f:\mathbb{R}^p\times[0,1]\rightarrow\mathbb{R}^q$, such that $(X,Y)\overset{\textup{a.s.}}{=}(X,f(X

Figures (15)

  • Figure 1: Diagrams of the abstraction stages of our formal framework.
  • Figure 2: Block diagram for the pipeline induced by our family of decision rules: $\psi_{b_n,d_n,a_n}^{\lambda, n}(\cdot)$.
  • Figure 3: Numerical results for our MI-based model drift detection method and baselines on parametrized drifts.
  • Figure 4: Numerical results for the RIV method and baselines on parametrized drifts over AR systems.
  • Figure 5: ROC curves for different systems with a fixed target RIV of $\gamma=0.1\:\text{bits}$ and $n\in\{50,100,250,500\}$.
  • ...and 10 more figures

Theorems & Definitions (14)

  • definition 1: Adapted from gretton2010consistent and gonzalez2021indtest
  • definition 2: Adapted from lehmann1986testing and gonzalez2021indtest
  • definition 3: Adapted from lehmann1986testing
  • Lemma 1: see austin2015exchangeable
  • definition 4
  • definition 5
  • remark 1
  • Theorem 1
  • Theorem 2
  • Theorem 3
  • ...and 4 more