BayesCNS: A Unified Bayesian Approach to Address Cold Start and Non-Stationarity in Search Systems at Scale
Randy Ardywibowo, Rakesh Sunki, Lucy Kuo, Sankalp Nayak
TL;DR
BayesCNS tackles cold-start and non-stationarity in large-scale information retrieval by learning an expressive empirical Bayesian prior over user-item interactions via a neural network that outputs Gamma-Poisson parameters. This prior, parameterized as $p(x|z)$ with $x$ following a Gamma-Poisson mixture (equivalently a Negative Binomial with $r=oldsymbol{\\alpha}$ and $\rho=\boldsymbol{\beta}/(1+\boldsymbol{\beta})=\boldsymbol{\sigma}(\boldsymbol{\zeta})$), is updated online using Thompson sampling to balance exploration and exploitation in ranking. The prior network outputs $\boldsymbol{\alpha}$ and $\boldsymbol{\zeta}$ (via $\log\boldsymbol{\alpha}$) from contextual features $\boldsymbol{z}$, enabling an efficient, ranker-guided online learning loop that updates posterior parameters $\boldsymbol{\alpha}_t(\boldsymbol{z})$ and $\boldsymbol{\beta}_t(\boldsymbol{z})$ under non-stationarity with a tunable rate parameter $\gamma$. Experiments across stationary and non-stationary simulations, benchmark datasets, and a large-scale online A/B test demonstrate improved new-item interactions (10.60% increase) and modest but meaningful gains in overall success rate (1.05%), validating BayesCNS as a practical approach for scalable, dynamic IR systems.
Abstract
Information Retrieval (IR) systems used in search and recommendation platforms frequently employ Learning-to-Rank (LTR) models to rank items in response to user queries. These models heavily rely on features derived from user interactions, such as clicks and engagement data. This dependence introduces cold start issues for items lacking user engagement and poses challenges in adapting to non-stationary shifts in user behavior over time. We address both challenges holistically as an online learning problem and propose BayesCNS, a Bayesian approach designed to handle cold start and non-stationary distribution shifts in search systems at scale. BayesCNS achieves this by estimating prior distributions for user-item interactions, which are continuously updated with new user interactions gathered online. This online learning procedure is guided by a ranker model, enabling efficient exploration of relevant items using contextual information provided by the ranker. We successfully deployed BayesCNS in a large-scale search system and demonstrated its efficacy through comprehensive offline and online experiments. Notably, an online A/B experiment showed a 10.60% increase in new item interactions and a 1.05% improvement in overall success metrics over the existing production baseline.
