Table of Contents
Fetching ...

KuaiSearch: A Large-Scale E-Commerce Search Dataset for Recall, Ranking, and Relevance

Yupeng Li, Ben Chen, Mingyue Cheng, Zhiding Liu, Xuxin Zhang, Chenyi Lei, Wenwu Ou

TL;DR

KuaiSearch introduces the largest real-world e-commerce search dataset to date, built from authentic Kuaishou interactions to cover recall, ranking, and relevance with preserved user queries and plaintext product texts. It provides 331,930 users, 18.6 million items, and 2.5 million queries, including a 46,422-pair relevance subset, plus KuaiSearch-Lite for efficient benchmarking. Comprehensive analyses reveal long-tail distributions, diverse user histories, and richly annotated relevance, while benchmark experiments contrast lexical, embedding-based, and generative approaches across recall, ranking, and relevance tasks. The dataset aims to accelerate research on end-to-end, LLM-enhanced e-commerce search and to support realistic evaluation and deployment considerations in real-world platforms.

Abstract

E-commerce search serves as a central interface, connecting user demands with massive product inventories and plays a vital role in our daily lives. However, in real-world applications, it faces challenges, including highly ambiguous queries, noisy product texts with weak semantic order, and diverse user preferences, all of which make it difficult to accurately capture user intent and fine-grained product semantics. In recent years, significant advances in large language models (LLMs) for semantic representation and contextual reasoning have created new opportunities to address these challenges. Nevertheless, existing e-commerce search datasets still suffer from notable limitations: queries are often heuristically constructed, cold-start users and long-tail products are filtered out, query and product texts are anonymized, and most datasets cover only a single stage of the search pipeline. Collectively, these issues constrain research on LLM-based e-commerce search. To address these challenges, we construct and release KuaiSearch. To the best of our knowledge, it is the largest e-commerce search dataset currently available. KuaiSearch is built upon real user search interactions from the Kuaishou platform, preserving authentic user queries and natural-language product texts, covering cold-start users and long-tail products, and systematically spanning three key stages of the search pipeline: recall, ranking, and relevance judgment. We conduct a comprehensive analysis of KuaiSearch from multiple perspectives, including products, users, and queries, and establish benchmark experiments across several representative search tasks. Experimental results demonstrate that KuaiSearch provides a valuable foundation for research on real-world e-commerce search.

KuaiSearch: A Large-Scale E-Commerce Search Dataset for Recall, Ranking, and Relevance

TL;DR

KuaiSearch introduces the largest real-world e-commerce search dataset to date, built from authentic Kuaishou interactions to cover recall, ranking, and relevance with preserved user queries and plaintext product texts. It provides 331,930 users, 18.6 million items, and 2.5 million queries, including a 46,422-pair relevance subset, plus KuaiSearch-Lite for efficient benchmarking. Comprehensive analyses reveal long-tail distributions, diverse user histories, and richly annotated relevance, while benchmark experiments contrast lexical, embedding-based, and generative approaches across recall, ranking, and relevance tasks. The dataset aims to accelerate research on end-to-end, LLM-enhanced e-commerce search and to support realistic evaluation and deployment considerations in real-world platforms.

Abstract

E-commerce search serves as a central interface, connecting user demands with massive product inventories and plays a vital role in our daily lives. However, in real-world applications, it faces challenges, including highly ambiguous queries, noisy product texts with weak semantic order, and diverse user preferences, all of which make it difficult to accurately capture user intent and fine-grained product semantics. In recent years, significant advances in large language models (LLMs) for semantic representation and contextual reasoning have created new opportunities to address these challenges. Nevertheless, existing e-commerce search datasets still suffer from notable limitations: queries are often heuristically constructed, cold-start users and long-tail products are filtered out, query and product texts are anonymized, and most datasets cover only a single stage of the search pipeline. Collectively, these issues constrain research on LLM-based e-commerce search. To address these challenges, we construct and release KuaiSearch. To the best of our knowledge, it is the largest e-commerce search dataset currently available. KuaiSearch is built upon real user search interactions from the Kuaishou platform, preserving authentic user queries and natural-language product texts, covering cold-start users and long-tail products, and systematically spanning three key stages of the search pipeline: recall, ranking, and relevance judgment. We conduct a comprehensive analysis of KuaiSearch from multiple perspectives, including products, users, and queries, and establish benchmark experiments across several representative search tasks. Experimental results demonstrate that KuaiSearch provides a valuable foundation for research on real-world e-commerce search.
Paper Structure (22 sections, 6 figures, 8 tables)

This paper contains 22 sections, 6 figures, 8 tables.

Figures (6)

  • Figure 1: Homepage
  • Figure 2: Mall
  • Figure 3: Detail Page
  • Figure 5: Log-log distributions of product interaction frequencies and the number of searches per user.
  • Figure 6: Word cloud visualizations of the most frequent tokens in item titles and user queries (Top-200), reflecting the lexical characteristics of product descriptions and search intent in the dataset. Word clouds are generated from the original-language tokens to ensure faithful visualization of token frequency distributions without distortion from translation.
  • ...and 1 more figures