LLM-Enhanced Reranking for Complementary Product Recommendation
Zekun Xu, Yudi Zhang
TL;DR
The paper tackles the accuracy-diversity dilemma in complementary product recommendation by introducing a model-agnostic reranking framework that leverages LLM prompting on top of any baseline retriever. It employs two agents—diversity and accuracy—prompted via structured inputs to reorder candidate items without retraining the underlying model. Across four public datasets, the approach yields notable improvements in both accuracy (Hit@K, NDCG@K) and diversity metrics, with the diversity agent driving broader item coverage and the accuracy agent enhancing precision at the cost of some diversity. The work highlights a practical, retraining-free path to more balanced recommendations and suggests future work on iterative, multi-agent collaboration.
Abstract
Complementary product recommendation, which aims to suggest items that are used together to enhance customer value, is a crucial yet challenging task in e-commerce. While existing graph neural network (GNN) approaches have made significant progress in capturing complex product relationships, they often struggle with the accuracy-diversity tradeoff, particularly for long-tail items. This paper introduces a model-agnostic approach that leverages Large Language Models (LLMs) to enhance the reranking of complementary product recommendations. Unlike previous works that use LLMs primarily for data preprocessing and graph augmentation, our method applies LLM-based prompting strategies directly to rerank candidate items retrieved from existing recommendation models, eliminating the need for model retraining. Through extensive experiments on public datasets, we demonstrate that our approach effectively balances accuracy and diversity in complementary product recommendations, with at least 50% lift in accuracy metrics and 2% lift in diversity metrics on average for the top recommended items across datasets.
