Warmer for Less: A Cost-Efficient Strategy for Cold-Start Recommendations at Pinterest

Saeed Ebrahimi; Weijie Jiang; Jaewon Yang; Olafur Gudmundsson; Yucheng Tu; Huizhong Duan

Warmer for Less: A Cost-Efficient Strategy for Cold-Start Recommendations at Pinterest

Saeed Ebrahimi, Weijie Jiang, Jaewon Yang, Olafur Gudmundsson, Yucheng Tu, Huizhong Duan

TL;DR

This work tackles the cold-start problem in large-scale recommender systems at Pinterest by introducing a cost-efficient, plug-and-play framework. It combines a residual path for non-historical features, a lightweight score debiasing loss based on MMD, and embedding-space manifold mixup to boost generalization to CS items without increasing serving costs. Offline and online experiments show consistent gains in fresh-content engagement, particularly for CS items, with limited parameter overhead and scalable deployment to hundreds of millions of users. The results demonstrate a practical path toward more balanced, generalizable recommendations in industrial-scale systems.

Abstract

Pinterest is a leading visual discovery platform where recommender systems (RecSys) are key to delivering relevant, engaging, and fresh content to our users. In this paper, we study the problem of improving RecSys model predictions for cold-start (CS) items, which appear infrequently in the training data. Although this problem is well-studied in academia, few studies have addressed its root causes effectively at the scale of a platform like Pinterest. By investigating live traffic data, we identified several challenges of the CS problem and developed a corresponding solution for each: First, industrial-scale RecSys models must operate under tight computational constraints. Since CS items are a minority, any related improvements must be highly cost-efficient. To address this, our solutions were designed to be lightweight, collectively increasing the total parameters by only 5%. Second, CS items are represented only by non-historical (e.g., content or attribute) features, which models often treat as less important. To elevate their significance, we introduce a residual connection for the non-historical features. Third, CS items tend to receive lower prediction scores compared to non-CS items, reducing their likelihood of being surfaced. We mitigate this by incorporating a score regularization term into the model. Fourth, the labels associated with CS items are sparse, making it difficult for the model to learn from them. We apply the manifold mixup technique to address this data sparsity. Implemented together, our methods increased fresh content engagement at Pinterest by 10% without negatively impacting overall engagement and cost, and have been deployed to serve over 570 million users on Pinterest.

Warmer for Less: A Cost-Efficient Strategy for Cold-Start Recommendations at Pinterest

TL;DR

Abstract

Warmer for Less: A Cost-Efficient Strategy for Cold-Start Recommendations at Pinterest

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)