Probabilistic Neural Networks (PNNs) for Modeling Aleatoric Uncertainty in Scientific Machine Learning
Farhad Pourkamali-Anaraki, Jamal F. Husseini, Scott E. Stapleton
TL;DR
The work addresses modeling aleatoric uncertainty in scientific machine learning by replacing the deterministic output with a trainable Gaussian distribution characterized by a mean $f_ extmu$ and variance $f_ extσ$, optimized via the negative log-likelihood. It introduces KL-divergence as a principled metric for neural-architecture search, benchmarking PNNs against Gaussian process regression and demonstrating superior handling of heteroscedastic data in both synthetic tests and a real materials-science case. Key findings show that PNNs can achieve mean predictions with $R^2$ around $0.97$ and predictive intervals with high observed correlation (≈ 0.80), while GPR struggles with heteroscedastic uncertainty in these settings. The approach provides robust, distribution-aware surrogate modeling for scientific problems and suggests avenues for integrating epistemic uncertainty and active learning in future work.
Abstract
This paper investigates the use of probabilistic neural networks (PNNs) to model aleatoric uncertainty, which refers to the inherent variability in the input-output relationships of a system, often characterized by unequal variance or heteroscedasticity. Unlike traditional neural networks that produce deterministic outputs, PNNs generate probability distributions for the target variable, allowing the determination of both predicted means and intervals in regression scenarios. Contributions of this paper include the development of a probabilistic distance metric to optimize PNN architecture, and the deployment of PNNs in controlled data sets as well as a practical material science case involving fiber-reinforced composites. The findings confirm that PNNs effectively model aleatoric uncertainty, proving to be more appropriate than the commonly employed Gaussian process regression for this purpose. Specifically, in a real-world scientific machine learning context, PNNs yield remarkably accurate output mean estimates with R-squared scores approaching 0.97, and their predicted intervals exhibit a high correlation coefficient of nearly 0.80, closely matching observed data intervals. Hence, this research contributes to the ongoing exploration of leveraging the sophisticated representational capacity of neural networks to delineate complex input-output relationships in scientific problems.
