Table of Contents
Fetching ...

GraphEx: A Graph-based Extraction Method for Advertiser Keyphrase Recommendation

Ashirbad Mishra, Soumik Dey, Marshall Wu, Jinyu Zhao, He Yu, Kaichen Ni, Binbin Li, Kamesh Madduri

TL;DR

GraphEx reframes keyphrase recommendation as an extraction-from-titles problem solved via per-leaf bipartite graphs, enabling permutation-based mapping of title tokens to a curated keyphrase universe. It delivers low-latency, GPU-free inference by constructing lightweight CSR graphs, enumerating candidate keyphrases through an efficient token-based expansion, and ranking via a Label Title Alignment score that favors concise, highly relevant phrases. The approach mitigates click-data biases by decoupling recommendations from ground-truth clicks and focuses on head keyphrases under budget constraints, achieving higher diversity and broader advertiser impact in production at eBay. Empirical results, supported by a bespoke AI-based evaluation framework, show GraphEx providing significant incremental gains in revenue and GMV, along with scalable daily model refreshes and favorable latency across billions of items.

Abstract

Online sellers and advertisers are recommended keyphrases for their listed products, which they bid on to enhance their sales. One popular paradigm that generates such recommendations is Extreme Multi-Label Classification (XMC), which involves tagging/mapping keyphrases to items. We outline the limitations of using traditional item-query based tagging or mapping techniques for keyphrase recommendations on E-Commerce platforms. We introduce GraphEx, an innovative graph-based approach that recommends keyphrases to sellers using extraction of token permutations from item titles. Additionally, we demonstrate that relying on traditional metrics such as precision/recall can be misleading in practical applications, thereby necessitating a combination of metrics to evaluate performance in real-world scenarios. These metrics are designed to assess the relevance of keyphrases to items and the potential for buyer outreach. GraphEx outperforms production models at eBay, achieving the objectives mentioned above. It supports near real-time inferencing in resource-constrained production environments and scales effectively for billions of items.

GraphEx: A Graph-based Extraction Method for Advertiser Keyphrase Recommendation

TL;DR

GraphEx reframes keyphrase recommendation as an extraction-from-titles problem solved via per-leaf bipartite graphs, enabling permutation-based mapping of title tokens to a curated keyphrase universe. It delivers low-latency, GPU-free inference by constructing lightweight CSR graphs, enumerating candidate keyphrases through an efficient token-based expansion, and ranking via a Label Title Alignment score that favors concise, highly relevant phrases. The approach mitigates click-data biases by decoupling recommendations from ground-truth clicks and focuses on head keyphrases under budget constraints, achieving higher diversity and broader advertiser impact in production at eBay. Empirical results, supported by a bespoke AI-based evaluation framework, show GraphEx providing significant incremental gains in revenue and GMV, along with scalable daily model refreshes and favorable latency across billions of items.

Abstract

Online sellers and advertisers are recommended keyphrases for their listed products, which they bid on to enhance their sales. One popular paradigm that generates such recommendations is Extreme Multi-Label Classification (XMC), which involves tagging/mapping keyphrases to items. We outline the limitations of using traditional item-query based tagging or mapping techniques for keyphrase recommendations on E-Commerce platforms. We introduce GraphEx, an innovative graph-based approach that recommends keyphrases to sellers using extraction of token permutations from item titles. Additionally, we demonstrate that relying on traditional metrics such as precision/recall can be misleading in practical applications, thereby necessitating a combination of metrics to evaluate performance in real-world scenarios. These metrics are designed to assess the relevance of keyphrases to items and the potential for buyer outreach. GraphEx outperforms production models at eBay, achieving the objectives mentioned above. It supports near real-time inferencing in resource-constrained production environments and scales effectively for billions of items.
Paper Structure (34 sections, 7 figures, 7 tables, 1 algorithm)

This paper contains 34 sections, 7 figures, 7 tables, 1 algorithm.

Figures (7)

  • Figure 1: Screenshot of our keyphrases for manual targeting in Promoted Listings PriorityTM for eBay Advertising.
  • Figure 2: Distribution of click-data in terms of items and the number of queries associated with them.
  • Figure 3: Illustration of GraphEx's construction phase. (i) a set of keyphrases with their search volume rank, (ii) shows the bipartite graph constructed from the set in (i).
  • Figure 4: The average counts of relevant head/tail and irrelevant keyphrases per item are shown for each model.
  • Figure 5: Venn diagram illustrating various recall sources and their associated keyphrases. The intersecting areas show keyphrases shared by multiple sources, whereas the crossed-out, non-overlapping sections represent keyphrases unique to each source, highlighting their distinct contributions.
  • ...and 2 more figures