Analytical results for uncertainty propagation through trained machine learning regression models
Andrew Thompson
TL;DR
This work develops analytical expressions for propagating input uncertainty through fixed regression ML models, covering linear, ridge, SVM and four kernel-based methods (kernel ridge, Gaussian Processes, kernel SVM, and Relevance Vector Machines). By deriving mean and variance formulas for predictions under various input distributions (Gaussian, uniform, symmetric triangular) and kernel choices (notably the RBF kernel), the paper demonstrates when analytical propagation outperforms Monte Carlo sampling in both accuracy and efficiency. The authors validate their framework with a lithium-ion battery state-of-health case study using EIS data, showing close agreement with Monte Carlo results and highlighting computational advantages in metrology contexts. Limitations include the focus on fixed models and RBF kernels, with future work aimed at numerical stability, extending to additional model families, and incorporating model uncertainty via variance decomposition.
Abstract
Machine learning (ML) models are increasingly being used in metrology applications. However, for ML models to be credible in a metrology context they should be accompanied by principled uncertainty quantification. This paper addresses the challenge of uncertainty propagation through trained/fixed machine learning (ML) regression models. Analytical expressions for the mean and variance of the model output are obtained/presented for certain input data distributions and for a variety of ML models. Our results cover several popular ML models including linear regression, penalised linear regression, kernel ridge regression, Gaussian Processes (GPs), support vector machines (SVMs) and relevance vector machines (RVMs). We present numerical experiments in which we validate our methods and compare them with a Monte Carlo approach from a computational efficiency point of view. We also illustrate our methods in the context of a metrology application, namely modelling the state-of-health of lithium-ion cells based upon Electrical Impedance Spectroscopy (EIS) data
