Mixability of Integral Losses: a Key to Efficient Online Aggregation of Functional and Probabilistic Forecasts
Alexander Korotin, Vladimir V'yugin, Evgeny Burnaev
TL;DR
This work extends online prediction with expert advice to function-valued forecasts by introducing X-integral losses and proving that η-mixable (exp-concave) losses induce η-mixable (exp-concave) integral losses. The authors establish a general transfer theorem and apply it to a wide class of probabilistic-forecasting losses, including CRPS, SCRPS, energy-based distance, KL, Beta-2, CFD, MMD, OT, and SW2, deriving explicit aggregation rules and learning-rate guarantees. The main theoretical contribution is the transfer of mixability/exp-concavity to the integral setting, enabling the Aggregating Algorithm to achieve constant regret across function-valued predictions. This provides a principled, scalable framework for online probabilistic forecasting and distributional learning, with practical aggregation via simple mixtures or Wasserstein barycenters.
Abstract
In this paper we extend the setting of the online prediction with expert advice to function-valued forecasts. At each step of the online game several experts predict a function, and the learner has to efficiently aggregate these functional forecasts into a single forecast. We adapt basic mixable (and exponentially concave) loss functions to compare functional predictions and prove that these adaptations are also mixable (exp-concave). We call this phenomenon mixability (exp-concavity) of integral loss functions. As an application of our main result, we prove that various loss functions used for probabilistic forecasting are mixable (exp-concave). The considered losses include Sliced Continuous Ranked Probability Score, Energy-Based Distance, Optimal Transport Costs and Sliced Wasserstein-2 distance, Beta-2 and Kullback-Leibler divergences, Characteristic function and Maximum Mean Discrepancies.
