Assessment of the quality of a prediction

Roger Sewell

Assessment of the quality of a prediction

Roger Sewell

TL;DR

The paper argues that the true mutual information $I(x;y)$ is ill-suited for evaluating a prediction algorithm’s output, and promotes the Apparent Shannon Information $J(x;Q_y)$ as the appropriate, uniquely characterized metric. It develops a Bayesian framework using Dirichlet-based mixtures of skew-Student distributions to model the distribution of $j(x,y)=\log\left(\frac{Q_y(x)}{P(x)}\right)$ and to infer the posterior uncertainty in $J(x;Q_y)$, addressing heavy-tailed, asymmetric behavior. The method is illustrated on a Bayesian model predicting the recurrence time of prostate cancer, and is presented as generally applicable to problems where the explicit distribution of $j(x,y)$ is intractable. The work provides a principled approach to uncertainty quantification for prediction quality and offers guidance for comparing Bayesian predictive algorithms under unseen data, with practical implications for model design and evaluation. Overall, it contributes a rigorous, adaptable framework for assessing high-quality predictions beyond simple point metrics.

Abstract

Shannon defined the mutual information between two variables. We illustrate why the true mutual information between a variable and the predictions made by a prediction algorithm is not a suitable measure of prediction quality, but the apparent Shannon mutual information (ASI) is; indeed it is the unique prediction quality measure with either of two very different lists of desirable properties, as previously shown by de Finetti and other authors. However, estimating the uncertainty of the ASI is a difficult problem, because of long and non-symmetric heavy tails to the distribution of the individual values of $j(x,y)=\log\frac{Q_y(x)}{P(x)}$ We propose a Bayesian modelling method for the distribution of $j(x,y)$, from the posterior distribution of which the uncertainty in the ASI can be inferred. This method is based on Dirichlet-based mixtures of skew-Student distributions. We illustrate its use on data from a Bayesian model for prediction of the recurrence time of prostate cancer. We believe that this approach is generally appropriate for most problems, where it is infeasible to derive the explicit distribution of the samples of $j(x,y)$, though the precise modelling parameters may need adjustment to suit particular cases.

Assessment of the quality of a prediction

TL;DR

The paper argues that the true mutual information

is ill-suited for evaluating a prediction algorithm’s output, and promotes the Apparent Shannon Information

as the appropriate, uniquely characterized metric. It develops a Bayesian framework using Dirichlet-based mixtures of skew-Student distributions to model the distribution of

and to infer the posterior uncertainty in

, addressing heavy-tailed, asymmetric behavior. The method is illustrated on a Bayesian model predicting the recurrence time of prostate cancer, and is presented as generally applicable to problems where the explicit distribution of

is intractable. The work provides a principled approach to uncertainty quantification for prediction quality and offers guidance for comparing Bayesian predictive algorithms under unseen data, with practical implications for model design and evaluation. Overall, it contributes a rigorous, adaptable framework for assessing high-quality predictions beyond simple point metrics.

Abstract

We propose a Bayesian modelling method for the distribution of

, from the posterior distribution of which the uncertainty in the ASI can be inferred. This method is based on Dirichlet-based mixtures of skew-Student distributions. We illustrate its use on data from a Bayesian model for prediction of the recurrence time of prostate cancer. We believe that this approach is generally appropriate for most problems, where it is infeasible to derive the explicit distribution of the samples of

, though the precise modelling parameters may need adjustment to suit particular cases.

Paper Structure (23 sections, 36 equations, 3 figures)

This paper contains 23 sections, 36 equations, 3 figures.

Introduction
Definitions and rationale
Notation
Reasoning leading to choice of Apparent Shannon Information, definitions, and basic properties
Conditions for measurement
Properties
List of basic properties
Betting outcome prediction
Relevance to design of algorithms based on Bayesian inference
Stability and estimation
A problem: point estimates based on finite datasets do not suffice
The suggested approach
Discussion
Outline proof of uniqueness
Details of modelling method used
...and 8 more sections

Figures (3)

Figure 1: Histogram of $j(x,y)$ for data points not seen during training, where the prediction algorithm is using a range of biomarkers and clinical data to predict time of recurrence of prostate cancer following radical prostatectomy.
Figure 2: Three samples of the distributions specified by the posterior distribution of the $\theta_k$ given the $j(x,y)$ data. For each (green) distribution a mean value can be calculated, since the distribution’s parameters are known. Given a large number of such samples of the mean, the posterior distribution of the mean can be reconstructed as in Figure \ref{['fig3']}.
Figure 3: The cumulative distribution of $J(x; Q_y)$ based on modelling the $j(x,y)$ data calculated from the data points unseen during training.

Assessment of the quality of a prediction

TL;DR

Abstract

Assessment of the quality of a prediction

Authors

TL;DR

Abstract

Table of Contents

Figures (3)