Probabilistic Neural Networks (PNNs) with t-Distributed Outputs: Adaptive Prediction Intervals Beyond Gaussian Assumptions
Farhad Pourkamali-Anaraki
TL;DR
This work tackles the problem of uncertainty quantification in regression under non-Gaussian output distributions. It introduces TDistrNN, a Probabilistic Neural Network that outputs a three-parameter Student's $t$-distribution ($f_\mu$, $f_\sigma$, $f_\nu$) to capture location, scale, and tail heaviness, and derives a tailored negative log-likelihood loss with analytical gradients for backpropagation. By enabling adaptive tail behavior through the degrees of freedom $f_\nu$, the approach yields prediction intervals that are narrower than Gaussian-based methods while preserving target coverage, as demonstrated on synthetic data and real-world benchmarks (Concrete Compressive Strength, Energy Efficiency, and Student Performance). The results highlight improved uncertainty quantification in regression tasks with heavy-tailed and outlier-prone data, offering a robust alternative to Gaussian assumptions for practical decision-making.
Abstract
Traditional neural network regression models provide only point estimates, failing to capture predictive uncertainty. Probabilistic neural networks (PNNs) address this limitation by producing output distributions, enabling the construction of prediction intervals. However, the common assumption of Gaussian output distributions often results in overly wide intervals, particularly in the presence of outliers or deviations from normality. To enhance the adaptability of PNNs, we propose t-Distributed Neural Networks (TDistNNs), which generate t-distributed outputs, parameterized by location, scale, and degrees of freedom. The degrees of freedom parameter allows TDistNNs to model heavy-tailed predictive distributions, improving robustness to non-Gaussian data and enabling more adaptive uncertainty quantification. We develop a novel loss function tailored for the t-distribution and derive efficient gradient computations for seamless integration into deep learning frameworks. Empirical evaluations on synthetic and real-world data demonstrate that TDistNNs improve the balance between coverage and interval width. Notably, for identical architectures, TDistNNs consistently produce narrower prediction intervals than Gaussian-based PNNs while maintaining proper coverage. This work contributes a flexible framework for uncertainty estimation in neural networks tasked with regression, particularly suited to settings involving complex output distributions.
