Improved prediction of future user activity in online A/B testing
Lorenzo Masoero, Mario Beraha, Thomas Richardson, Stefano Favaro
TL;DR
This work tackles the problem of predicting future user activity in online A/B tests, including both the number of new users and their re-trigger rates. It introduces a Bayesian nonparametric model built on a negative-binomial likelihood and a negative-binomial scaled-stable process prior (NBP with SBSP), enabling exact posterior and predictive forms without MCMC. The authors derive closed-form expressions for key quantities, provide an urn-based sampling scheme for efficient simulation, and propose two hyperparameter-estimation strategies (maximum marginal likelihood and regression-based). Empirical results on synthetic and real data show the BNP predictor is competitive with, and often superior to, existing approaches for predicting future activity and total retrigger rates, with clear implications for experiment duration planning and cost reduction in online A/B testing. The work also demonstrates the practical viability of BNP methods in large-scale online experiments and outlines promising avenues for extending predictions to more granular re-trigger dynamics and long-term effects.
Abstract
In online randomized experiments or A/B tests, accurate predictions of participant inclusion rates are of paramount importance. These predictions not only guide experimenters in optimizing the experiment's duration but also enhance the precision of treatment effect estimates. In this paper we present a novel, straightforward, and scalable Bayesian nonparametric approach for predicting the rate at which individuals will be exposed to interventions within the realm of online A/B testing. Our approach stands out by offering dual prediction capabilities: it forecasts both the quantity of new customers expected in future time windows and, unlike available alternative methods, the number of times they will be observed. We derive closed-form expressions for the posterior distributions of the quantities needed to form predictions about future user activity, thereby bypassing the need for numerical algorithms such as Markov chain Monte Carlo. After a comprehensive exposition of our model, we test its performance on experiments on real and simulated data, where we show its superior performance with respect to existing alternatives in the literature.
