Bayesian Ensembling: Insights from Online Optimization and Empirical Bayes
Daniel Waxman, Fernando Llorente, Petar M. Djurić
TL;DR
This work investigates online Bayesian ensembling, contrasting Bayesian model averaging (BMA) with Bayesian stacking (BS) and introducing Online Bayesian Stacking (OBS) as an online, empirical Bayes–inspired alternative. By reframing OBS as online portfolio selection (OPS), the authors leverage regret analysis and efficient online convex optimization (OCO) algorithms (e.g., Exponentiated Gradient, Online Newton Step) to derive performance guarantees and practical guidance. They establish that OBS often outperforms online BMA (O-BMA) and dynamic model averaging (DMA), especially in nonstationary or M-open settings, while providing a hybrid approach when M-closed assumptions may hold. Through extensive experiments across subset linear regression, online variational inference, and time-series forecasting, the paper demonstrates OBS’s robustness and practical advantages, offering actionable recommendations on when and how to deploy OBS in online Bayesian learning. The work bridges Bayesian ensemble learning with portfolio theory, enabling principled, scalable online inference for modern Bayesian models.
Abstract
We revisit the classical problem of Bayesian ensembles and address the challenge of learning optimal combinations of Bayesian models in an online, continual learning setting. To this end, we reinterpret existing approaches such as Bayesian model averaging (BMA) and Bayesian stacking through a novel empirical Bayes lens, shedding new light on the limitations and pathologies of BMA. Further motivated by insights from online optimization, we propose Online Bayesian Stacking (OBS), a method that optimizes the log-score over predictive distributions to adaptively combine Bayesian models. A key contribution of our work is establishing a novel connection between OBS and portfolio selection, bridging Bayesian ensemble learning with a rich, well-studied theoretical framework that offers efficient algorithms and extensive regret analysis. We further clarify the relationship between OBS and online BMA, showing that they optimize related but distinct cost functions. Through theoretical analysis and empirical evaluation, we identify scenarios where OBS outperforms online BMA and provide principled methods and guidance on when practitioners should prefer one approach over the other.
