Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval

Zhirui Kuai; Zuxu Chen; Huimu Wang; Mingming Li; Dadong Miao; Binbin Wang; Xusong Chen; Li Kuang; Yuxing Han; Jiaxing Wang; Guoyu Tang; Lin Liu; Songlin Wang; Jingwei Zhuo

Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval

Zhirui Kuai, Zuxu Chen, Huimu Wang, Mingming Li, Dadong Miao, Binbin Wang, Xusong Chen, Li Kuang, Yuxing Han, Jiaxing Wang, Guoyu Tang, Lin Liu, Songlin Wang, Jingwei Zhuo

TL;DR

This paper analyses and addresses the "Hourglass" phenomenon in RQ-SID by identifying data sparsity and long-tailed distribution as the primary causes, and proposes effective solutions to mitigate this issue, thereby significantly enhancing the effectiveness of generative retrieval in real-world E-commerce applications.

Abstract

Generative retrieval (GR) has emerged as a transformative paradigm in search and recommender systems, leveraging numeric-based identifier representations to enhance efficiency and generalization. Notably, methods like TIGER employing Residual Quantization-based Semantic Identifiers (RQ-SID), have shown significant promise in e-commerce scenarios by effectively managing item IDs. However, a critical issue termed the "\textbf{Hourglass}" phenomenon, occurs in RQ-SID, where intermediate codebook tokens become overly concentrated, hindering the full utilization of generative retrieval methods. This paper analyses and addresses this problem by identifying data sparsity and long-tailed distribution as the primary causes. Through comprehensive experiments and detailed ablation studies, we analyze the impact of these factors on codebook utilization and data distribution. Our findings reveal that the "Hourglass" phenomenon substantially impacts the performance of RQ-SID in generative retrieval. We propose effective solutions to mitigate this issue, thereby significantly enhancing the effectiveness of generative retrieval in real-world E-commerce applications.

Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval

TL;DR

Abstract

Paper Structure (16 sections, 4 equations, 6 figures, 2 tables)

This paper contains 16 sections, 4 equations, 6 figures, 2 tables.

Introduction
Related Works
Preliminary
Residual Quantization
Generative Retrieval
Problem of GR based on RQ
Hourglass Phenomenon
Analysis of Residual Quantization
Impact on the GR
Methods and Experiments
Heuristic Method
Variable Length of SID
Experiments
Valid Ratio
Conclusion
...and 1 more sections

Figures (6)

Figure 1: The Hourglass Phenomenon of Semantic IDs
Figure 2: Distribution and Connections of Semantic IDs
Figure 3: Illustrating the Hourglass Phenomenon in Semantic IDs with Different Statistical Metrics
Figure 4: Hierarchical Residual Reduction and Dimensional Analysis Across Layers
Figure 5: Invalid IDs Ratio when generating Semantic IDs using Beam Search for various values of $k$
...and 1 more figures

Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval

TL;DR

Abstract

Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval

Authors

TL;DR

Abstract

Table of Contents

Figures (6)