Epinet for Content Cold Start
Hong Jun Jeon, Songbin Liu, Yuantong Li, Jie Lyu, Hunter Song, Ji Liu, Peng Wu, Zheqing Zhu
TL;DR
This work tackles the exploration-exploitation challenge in content recommendation under content cold-start by employing Epinet-based approximate Thompson sampling to model epistemic uncertainty in neural models. The authors integrate an Epinet atop a base predictor to produce posterior samples efficiently, enabling online deployment in Facebook Reels during the retrieval stage. They demonstrate a 17% uplift in impressions and improvements in like rates and video completion, particularly for content with few prior impressions, indicating stronger exploration of new content. The study highlights practical benefits for scalable uncertainty quantification in large-scale online systems and suggests avenues for extending these ideas to ranking and reinforcement learning settings.
Abstract
The exploding popularity of online content and its user base poses an evermore challenging matching problem for modern recommendation systems. Unlike other frontiers of machine learning such as natural language, recommendation systems are responsible for collecting their own data. Simply exploiting current knowledge can lead to pernicious feedback loops but naive exploration can detract from user experience and lead to reduced engagement. This exploration-exploitation trade-off is exemplified in the classic multi-armed bandit problem for which algorithms such as upper confidence bounds (UCB) and Thompson sampling (TS) demonstrate effective performance. However, there have been many challenges to scaling these approaches to settings which do not exhibit a conjugate prior structure. Recent scalable approaches to uncertainty quantification via epinets have enabled efficient approximations of Thompson sampling even when the learning model is a complex neural network. In this paper, we demonstrate the first application of epinets to an online recommendation system. Our experiments demonstrate improvements in both user traffic and engagement efficiency on the Facebook Reels online video platform.
