Bernstein-von Mises for Adaptively Collected Data
Kevin Du, Yash Nair, Lucas Janson
TL;DR
This work extends Bernstein–von Mises theory to adaptively collected data, showing that Bayesian uncertainty quantification becomes asymptotically equivalent to Wald-type frequentist UQ in broad adaptive settings. By proving that the posterior for the parameter vector $\beta$ concentrates around the maximum likelihood estimator with covariance $\sigma^2(\mathbf{X}_n^\top\mathbf{X}_n)^{-1}$, the paper unifies Bayesian and frequentist views under adaptivity without relying on traditional stability conditions. The contributions cover a general adaptive linear Gaussian framework, specialized results for multi-armed and contextual bandits (including exponential-family extensions), and a triangular-array formulation with thorough numerical validation highlighting when Bayesian UQ aligns with or diverges from frequentist guarantees. The findings offer practical insights into the limits of Bayesian credible intervals in adaptive experiments and motivate further work on empirical Bayes approaches and broader parametric models. Overall, the results deepen our understanding of uncertainty quantification in sequential, adaptive data collection and their implications for decision-making in safety-critical and online settings.
Abstract
Uncertainty quantification (UQ) for adaptively collected data, such as that coming from adaptive experiments, bandits, or reinforcement learning, is necessary for critical elements of data collection such as ensuring safety and conducting after-study inference. The data's adaptivity creates significant challenges for frequentist UQ, yet Bayesian UQ remains the same as if the data were independent and identically distributed (i.i.d.), making it an appealing and commonly used approach. Bayesian UQ requires the (correct) specification of a prior distribution while frequentist UQ does not, but for i.i.d. data the celebrated Bernstein-von Mises theorem shows that as the sample size grows, the prior 'washes out' and Bayesian UQ becomes frequentist-valid, implying that the choice of prior need not be a major impediment to Bayesian UQ as it makes no difference asymptotically. This paper for the first time extends the Bernstein-von Mises theorem to adaptively collected data, proving asymptotic equivalence between Bayesian UQ and Wald-type frequentist UQ in this challenging setting. Our result showing this asymptotic agreement does not require the standard stability condition required by works studying validity of Wald-type frequentist UQ; in cases where stability is satisfied, our results combined with these prior studies of frequentist UQ imply frequentist validity of Bayesian UQ. Counterintuitively however, they also provide a negative result that Bayesian UQ is not asymptotically frequentist valid when stability fails, despite the fact that the prior washes out and Bayesian UQ asymptotically matches standard Wald-type frequentist UQ. We empirically validate our theory (positive and negative) via a range of simulations.
