Information Discovery in e-Commerce
Zhaochun Ren, Xiangnan He, Dawei Yin, Maarten de Rijke
TL;DR
This survey comprehensively maps Information Discovery in e-commerce, detailing how search, recommendation, QA, and conversational AI collaborate with user modeling and interface design to enable efficient product discovery. It presents a unified framework covering data modalities, evaluation metrics, and a two-stage pipeline for recommendation, while highlighting emerging directions such as graph-based user modeling, multi-modal and generative retrieval, and large language model integration. Its contributions include a systematic taxonomy of tasks, standardized notation for cross-model comparisons, and an extensive dataset inventory to support reproducible research. The work is significant for practitioners and researchers aiming to optimize GMV-driven e-commerce experiences through integrated IR and language technologies, with emphasis on fairness, explainability, and privacy in future deployments.
Abstract
Electronic commerce, or e-commerce, is the buying and selling of goods and services, or the transmitting of funds or data online. E-commerce platforms come in many kinds, with global players such as Amazon, Airbnb, Alibaba, eBay and platforms targeting specific geographic regions. Information retrieval has a natural role to play in e-commerce, especially in connecting people to goods and services. Information discovery in e-commerce concerns different types of search (e.g., exploratory search vs. lookup tasks), recommender systems, and natural language processing in e-commerce portals. The rise in popularity of e-commerce sites has made research on information discovery in e-commerce an increasingly active research area. This is witnessed by an increase in publications and dedicated workshops in this space. Methods for information discovery in e-commerce largely focus on improving the effectiveness of e-commerce search and recommender systems, on enriching and using knowledge graphs to support e-commerce, and on developing innovative question answering and bot-based solutions that help to connect people to goods and services. In this survey, an overview is given of the fundamental infrastructure, algorithms, and technical solutions for information discovery in e-commerce. The topics covered include user behavior and profiling, search, recommendation, and language technology in e-commerce.
