Efficient pooling of predictions via kernel embeddings

Sam Allen; David Ginsbourger; Johanna Ziegel

Efficient pooling of predictions via kernel embeddings

Sam Allen, David Ginsbourger, Johanna Ziegel

TL;DR

By embedding predictions into a Reproducing Kernel Hilbert Space (RKHS), it is illustrated that estimating the linear pool weights that optimise kernel-based scoring rules is a convex quadratic optimisation problem, which permits an efficient implementation of the linear pool when optimally combining predictions on arbitrary outcome domains.

Abstract

Probabilistic predictions are probability distributions over the set of possible outcomes. Such predictions quantify the uncertainty in the outcome, making them essential for effective decision making. By combining multiple predictions, the information sources used to generate the predictions are pooled, often resulting in a more informative forecast. Probabilistic predictions are typically combined by linearly pooling the individual predictive distributions; this encompasses several ensemble learning techniques, for example. The weights assigned to each prediction can be estimated based on their past performance, allowing more accurate predictions to receive a higher weight. This can be achieved by finding the weights that optimise a proper scoring rule over some training data. By embedding predictions into a Reproducing Kernel Hilbert Space (RKHS), we illustrate that estimating the linear pool weights that optimise kernel-based scoring rules is a convex quadratic optimisation problem. This permits an efficient implementation of the linear pool when optimally combining predictions on arbitrary outcome domains. This result also holds for other combination strategies, and we additionally study a flexible generalisation of the linear pool that overcomes some of its theoretical limitations, whilst allowing an efficient implementation within the RKHS framework. These approaches are compared in an application to operational wind speed forecasts, where this generalisation is found to offer substantial improvements upon the traditional linear pool.

Efficient pooling of predictions via kernel embeddings

TL;DR

Abstract

Efficient pooling of predictions via kernel embeddings

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (11)