Table of Contents
Fetching ...

OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

Prabhat Agarwal, Minhazul Islam Sk, Nikil Pancha, Kurchi Subhra Hazra, Jiajing Xu, Chuck Rosenberg

TL;DR

OmniSearchSage introduces a unified, multi-task, multi-entity embedding framework that learns a single query representation aligned with pins and products. By enriching entity content with GenAI captions, board titles, and engaged queries, and by employing specialized encoders and a multi-task sampled softmax loss, the system achieves strong offline and online gains across retrieval, ranking, and ads tasks. The work demonstrates robust cross-language effectiveness, scalable serving with a cache-assisted architecture, and practical deployment across Pinterest search with notable improvements in Recall@10, organic fulfillment, and ads relevance. These findings underscore the value of end-to-end, end-user-focused embeddings that are compatible with existing production embeddings and capable of powering diverse search surfaces.

Abstract

In this paper, we present OmniSearchSage, a versatile and scalable system for understanding search queries, pins, and products for Pinterest search. We jointly learn a unified query embedding coupled with pin and product embeddings, leading to an improvement of $>8\%$ relevance, $>7\%$ engagement, and $>5\%$ ads CTR in Pinterest's production search system. The main contributors to these gains are improved content understanding, better multi-task learning, and real-time serving. We enrich our entity representations using diverse text derived from image captions from a generative LLM, historical engagement, and user-curated boards. Our multitask learning setup produces a single search query embedding in the same space as pin and product embeddings and compatible with pre-existing pin and product embeddings. We show the value of each feature through ablation studies, and show the effectiveness of a unified model compared to standalone counterparts. Finally, we share how these embeddings have been deployed across the Pinterest search stack, from retrieval to ranking, scaling to serve $300k$ requests per second at low latency. Our implementation of this work is available at https://github.com/pinterest/atg-research/tree/main/omnisearchsage.

OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

TL;DR

OmniSearchSage introduces a unified, multi-task, multi-entity embedding framework that learns a single query representation aligned with pins and products. By enriching entity content with GenAI captions, board titles, and engaged queries, and by employing specialized encoders and a multi-task sampled softmax loss, the system achieves strong offline and online gains across retrieval, ranking, and ads tasks. The work demonstrates robust cross-language effectiveness, scalable serving with a cache-assisted architecture, and practical deployment across Pinterest search with notable improvements in Recall@10, organic fulfillment, and ads relevance. These findings underscore the value of end-to-end, end-user-focused embeddings that are compatible with existing production embeddings and capable of powering diverse search surfaces.

Abstract

In this paper, we present OmniSearchSage, a versatile and scalable system for understanding search queries, pins, and products for Pinterest search. We jointly learn a unified query embedding coupled with pin and product embeddings, leading to an improvement of relevance, engagement, and ads CTR in Pinterest's production search system. The main contributors to these gains are improved content understanding, better multi-task learning, and real-time serving. We enrich our entity representations using diverse text derived from image captions from a generative LLM, historical engagement, and user-curated boards. Our multitask learning setup produces a single search query embedding in the same space as pin and product embeddings and compatible with pre-existing pin and product embeddings. We show the value of each feature through ablation studies, and show the effectiveness of a unified model compared to standalone counterparts. Finally, we share how these embeddings have been deployed across the Pinterest search stack, from retrieval to ranking, scaling to serve requests per second at low latency. Our implementation of this work is available at https://github.com/pinterest/atg-research/tree/main/omnisearchsage.
Paper Structure (32 sections, 5 equations, 5 figures, 8 tables)

This paper contains 32 sections, 5 equations, 5 figures, 8 tables.

Figures (5)

  • Figure 1: Diagrammatic Representation of OmniSearchSage's Multi-Entity, Multi-Task Architecture.
  • Figure 2: Overview of the query encoder architecture. The encoder takes the output from the last layer associated with the 'CLS' token, projects it onto a 256-dimensional vector space, and finally L2-normalizes the output to generate the final embedding.
  • Figure 3: Schematic of the unified encoder model for pins and products, illustrating the use of three different tokenizers, a hash embedding table, and an MLP layer for combining text embeddings with other continuous features.
  • Figure 4: A simplified depiction of the search retrieval and ranking stack at Pinterest highlighting the integration points for OmniSearchSage embeddings.
  • Figure 5: Comparative display of pins retrieved in response to the query 'antique copper bathroom sink' from the token-based system and the OmniSearchSage-based system. Pins deemed relevant are outlined in green, while those considered irrelevant are encircled in red.