Accuracy estimation of neural networks by extreme value theory

Gero Junike; Marco Oesting

Accuracy estimation of neural networks by extreme value theory

Gero Junike, Marco Oesting

TL;DR

The paper addresses quantifying the tail of the neural network approximation error $\mathcal{E}=|f-\varphi|$ on compact domains. It adopts extreme value theory, showing exceedances above a high threshold $u$ follow a generalized Pareto distribution with scale $\sigma(u)$ and shape $\gamma$, and introduces a new estimator $\hat{\gamma}_{k,N}$ that remains negative with probability one, along with an estimator for the upper endpoint $x^{*}$. The authors provide plug-in expressions for tail quantities $P(\mathcal{E}>x)$ and $\mathbb{E}[\mathcal{E}-u\mid \mathcal{E}>u]$ using $\widehat{x^{*}}_{k,N}$ and $\hat{\gamma}_{k,N}$, and validate the approach on a financial pricing task (American put options), showing accurate tail probability and mean-excess estimates. The work offers a rigorous tail-risk assessment for neural network errors, enabling improved risk management for mispricing and other applications where large errors matter.

Abstract

Neural networks are able to approximate any continuous function on a compact set. However, it is not obvious how to quantify the error of the neural network, i.e., the remaining bias between the function and the neural network. Here, we propose the application of extreme value theory to quantify large values of the error, which are typically relevant in applications. The distribution of the error beyond some threshold is approximately generalized Pareto distributed. We provide a new estimator of the shape parameter of the Pareto distribution suitable to describe the error of neural networks. Numerical experiments are provided.

Accuracy estimation of neural networks by extreme value theory

TL;DR

The paper addresses quantifying the tail of the neural network approximation error

on compact domains. It adopts extreme value theory, showing exceedances above a high threshold

follow a generalized Pareto distribution with scale

and shape

, and introduces a new estimator

that remains negative with probability one, along with an estimator for the upper endpoint

. The authors provide plug-in expressions for tail quantities

and

using

and

, and validate the approach on a financial pricing task (American put options), showing accurate tail probability and mean-excess estimates. The work offers a rigorous tail-risk assessment for neural network errors, enabling improved risk management for mispricing and other applications where large errors matter.

Accuracy estimation of neural networks by extreme value theory

TL;DR

Abstract

Accuracy estimation of neural networks by extreme value theory

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (2)