Denoising Neural Reranker for Recommender Systems

Wenyu Mao; Shuchang Liu; Hailan Yang; Xiaobei Wang; Xiaoyu Yang; Xu Gao; Xiang Li; Lantao Hu; Han Li; Kun Gai; An Zhang; Xiang Wang

Denoising Neural Reranker for Recommender Systems

Wenyu Mao, Shuchang Liu, Hailan Yang, Xiaobei Wang, Xiaoyu Yang, Xu Gao, Xiang Li, Lantao Hu, Han Li, Kun Gai, An Zhang, Xiang Wang

TL;DR

This paper tackles the underexplored use of retriever scores to inform the reranker in a two-stage recommender system by treating reranking as a denoising problem. It introduces Denoising Neural Reranker (DNR), an adversarial framework that jointly trains a denoising reranker $q_\theta$ and a learnable noise generator $f_\boldsymbol{\phi}$ to augment and denoise retriever scores, guided by three objectives: $\mathcal{L}_z$ (denoising), $\mathcal{L}_{adv}$ (adversarial exploration), and $\mathcal{L}_x$ (score distribution regularization). The approach is supported by theoretical insights on the limitations of direct optimization and is validated through extensive experiments on three public datasets and an industrial system, showing consistent improvements over strong baselines. The findings demonstrate practical impact by improving final exposure quality while maintaining efficiency, and they suggest promising future directions for integrating noise-aware denoising into broader multi-stage recommendation pipelines.

Abstract

For multi-stage recommenders in industry, a user request would first trigger a simple and efficient retriever module that selects and ranks a list of relevant items, then the recommender calls a slower but more sophisticated reranking model that refines the item list exposure to the user. To consistently optimize the two-stage retrieval reranking framework, most efforts have focused on learning reranker-aware retrievers. In contrast, there has been limited work on how to achieve a retriever-aware reranker. In this work, we provide evidence that the retriever scores from the previous stage are informative signals that have been underexplored. Specifically, we first empirically show that the reranking task under the two-stage framework is naturally a noise reduction problem on the retriever scores, and theoretically show the limitations of naive utilization techniques of the retriever scores. Following this notion, we derive an adversarial framework DNR that associates the denoising reranker with a carefully designed noise generation module. The resulting DNR solution extends the conventional score error minimization loss with three augmented objectives, including: 1) a denoising objective that aims to denoise the noisy retriever scores to align with the user feedback; 2) an adversarial retriever score generation objective that improves the exploration in the retriever score space; and 3) a distribution regularization term that aims to align the distribution of generated noisy retriever scores with the real ones. We conduct extensive experiments on three public datasets and an industrial recommender system, together with analytical support, to validate the effectiveness of the proposed DNR.

Denoising Neural Reranker for Recommender Systems

TL;DR

Abstract

Denoising Neural Reranker for Recommender Systems

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)