Learning to Comparison-Shop
Jie Tang, Daochen Zha, Xin Liu, Huiji Gao, Liwei He, Stephanie Moyerman, Sanjeev Katariya
TL;DR
This paper tackles the mismatch between traditional search ranking and users' comparison-shopping behavior in online marketplaces by introducing Learning-to-Comparison-Shop (LTCS). LTCS jointly trains a lightweight pointwise initial ranker and a setwise transformer-based re-ranker to emulate the two-stage decision process of evaluation followed by comparison, using a behavior-aligned co-training objective. In production at Airbnb, LTCS yields offline NDCG gains and an online booking conversion-rate improvement (+0.6%), along with enhanced user efficiency during the search process. The work demonstrates that behavior-aligned, two-stage training can outperform strong baselines and offers a practical blueprint for deploying complex ranking systems in large-scale marketplaces.
Abstract
In online marketplaces like Airbnb, users frequently engage in comparison shopping before making purchase decisions. Despite the prevalence of this behavior, a significant disconnect persists between mainstream e-commerce search engines and users' comparison needs. Traditional ranking models often evaluate items in isolation, disregarding the context in which users compare multiple items on a search results page. While recent advances in deep learning have sought to improve ranking accuracy, diversity, and fairness by encoding listwise context, the challenge of aligning search rankings with user comparison shopping behavior remains inadequately addressed. In this paper, we propose a novel ranking architecture - Learning-to-Comparison-Shop (LTCS) System - that explicitly models and learns users' comparison shopping behaviors. Through extensive offline and online experiments, we demonstrate that our approach yields statistically significant gains in key business metrics - improving NDCG by 1.7% and boosting booking conversion rate by 0.6% in A/B testing - while also enhancing user experience. We also compare our model against state-of-the-art approaches and demonstrate that LTCS significantly outperforms them.
