Proper Scoring Rules for Multivariate Probabilistic Forecasts based on Aggregation and Transformation
Romain Pic, Clément Dombry, Philippe Naveau, Maxime Taillardat
TL;DR
The paper addresses the challenge of interpreting multivariate forecast verification by introducing a framework built on transformation and aggregation to construct interpretable proper scoring rules. By combining transformations (e.g., projections, variograms, wavelets) with aggregation across margins, patches, or scales, the approach yields scores that target specific forecast features such as dependence structure and spatial anisotropy while preserving propriety. Through theoretical exposition and simulation, the authors show that these scores can discriminate misspecifications more insightfully than conventional multivariate scores and can bridge the gap between probabilistic scoring and spatial verification tools. The framework thus offers a practical path to more informative forecast verification, with relevance to weather, climate, and ML-based forecasting systems.
Abstract
Proper scoring rules are an essential tool to assess the predictive performance of probabilistic forecasts. However, propriety alone does not ensure an informative characterization of predictive performance and it is recommended to compare forecasts using multiple scoring rules. With that in mind, interpretable scoring rules providing complementary information are necessary. We formalize a framework based on aggregation and transformation to build interpretable multivariate proper scoring rules. Aggregation-and-transformation-based scoring rules are able to target specific features of the probabilistic forecasts; which improves the characterization of the predictive performance. This framework is illustrated through examples taken from the literature and studied using numerical experiments showcasing its benefits. In particular, it is shown that it can help bridge the gap between proper scoring rules and spatial verification tools.
