Optimal Uncertainty-guided Neural Network Training
H M Dipu Kabir, Abbas Khosravi, Abdollah Kavousi-Fard, Saeid Nahavandi, Dipti Srinivasan
TL;DR
This paper addresses unreliable convergence and inconsistent PI quality in NN-based direct PI construction. It proposes a smooth, customizable loss called $CWFDC$ that minimizes PINAW, PINAFD, and a squared coverage penalty $(1-α+δ - PICP)^2$, balancing PI width, coverage, and distance from targets. The method is evaluated on wind power and electricity demand data, showing up to 99.8% training convergence and reduced PI variation compared with prior methods like LUBE, Wan, Marin, and Zhang. The approach enables user-driven trade-offs among PI sharpness, coverage, and robustness to data shifts, with practical benefits for power-grid decision-making.
Abstract
The neural network (NN)-based direct uncertainty quantification (UQ) methods have achieved the state of the art performance since the first inauguration, known as the lower-upper-bound estimation (LUBE) method. However, currently-available cost functions for uncertainty guided NN training are not always converging and all converged NNs are not generating optimized prediction intervals (PIs). Moreover, several groups have proposed different quality criteria for PIs. These raise a question about their relative effectiveness. Most of the existing cost functions of uncertainty guided NN training are not customizable and the convergence of training is uncertain. Therefore, in this paper, we propose a highly customizable smooth cost function for developing NNs to construct optimal PIs. The optimized average width of PIs, PI-failure distances and the PI coverage probability (PICP) are computed for the test dataset. The performance of the proposed method is examined for the wind power generation and the electricity demand data. Results show that the proposed method reduces variation in the quality of PIs, accelerates the training, and improves convergence probability from 99.2% to 99.8%.
