Taxonomy-based Negative Sampling In Personalized Semantic Search for E-commerce
Uthman Jinadu, Siawpeng Er, Le Yu, Chen Liang, Bingxin Li, Yi Ding, Aleksandar Velkoski
TL;DR
The paper tackles semantic product retrieval in large-scale e-commerce by introducing a taxonomy-based hard-negative sampling (TB-HNS) strategy and a personalized semantic engine. It deploys a two-tower bi-encoder with a personalized variant that fuses user demographics and past purchases, trained with Multiple Negative Ranking Loss and TB-HNS to learn fine-grained distinctions among similar items. Offline results show recall gains over BM25, DistilledBERT, and ANCE, while online A/B tests reveal improvements in Conversion Rate, Add-to-Cart Rate, and Average Order Value, all within latency targets. The work demonstrates generalization to a public dataset, improves cold-start item retrieval, and offers practical deployment lessons for large-scale e-commerce search systems with reduced negative sampling overhead.
Abstract
Large retail outlets offer products that may be domain-specific, and this requires having a model that can understand subtle differences in similar items. Sampling techniques used to train these models are most of the time, computationally expensive or logistically challenging. These models also do not factor in users' previous purchase patterns or behavior, thereby retrieving irrelevant items for them. We present a semantic retrieval model for e-commerce search that embeds queries and products into a shared vector space and leverages a novel taxonomy-based hard-negative sampling(TB-HNS) strategy to mine contextually relevant yet challenging negatives. To further tailor retrievals, we incorporate user-level personalization by modeling each customer's past purchase history and behavior. In offline experiments, our approach outperforms BM25, ANCE and leading neural baselines on Recall@K, while live A/B testing shows substantial uplifts in conversion rate, add-to-cart rate, and average order value. We also demonstrate that our taxonomy-driven negatives reduce training overhead and accelerate convergence, and we share practical lessons from deploying this system at scale.
