Non-Bayesian Learning in Misspecified Models
Sebastian Bervoets, Mathieu Faure, Ludovic Renou
TL;DR
This paper shows that in misspecified learning problems, a conservative, non_Bayesian updating rule can yield better predictive performance than standard Bayesian updating. By formulating the update as a Robbins–Monro algorithm and analyzing the associated ODE, the authors establish that the agent’s beliefs converge to a convex mixture of candidate models that maximizes the cross-entropy with respect to the true data-generating process, effectively learning the closest mixture to $p^*$ in KL terms. The key technical contribution is the Lyapunov structure provided by the cross-entropy $V(q)$ and the decomposition of the zero set $E$ into convex components, with a unique top component $C_{k^*}$ attracting the dynamics under mild conditions on the updating weights. The paper also discusses robustness to different weight schemes, contrasts with generalized Bayes, and comments on potential overreaction, linking the results to broader literature on learning under misspecification and information-theoretic decision rules. Overall, non_Bayesian updating can be advantageous in misspecified environments and offers computational or stability benefits when using restricted model families that approximate the truth well via mixtures.
Abstract
Deviations from Bayesian updating are traditionally categorized as biases, errors, or fallacies, thus implying their inherent ``sub-optimality.'' We offer a more nuanced view. We demonstrate that, in learning problems with misspecified models, non-Bayesian updating can outperform Bayesian updating.
