Efficient Pointwise-Pairwise Learning-to-Rank for News Recommendation
Nithish Kannen, Yao Ma, Gerrit J. J. van den Burg, Jean Baptiste Faddoul
TL;DR
This work addresses efficient learning-to-rank for news recommendation by integrating pointwise relevance with pairwise preferences in a single PLM framework called GLIMPSE. It employs multi-task fine-tuning to train a model for both Rel and Pref as text-generation tasks and introduces an RTL-based inference that refines an initial pointwise ranking with at most $m$ passes on the top-$k$ items, yielding $O(K)$ complexity per pass. A Markov-chain theoretical analysis establishes conditions under which RTL refinements improve expected ranking metrics, linking pairwise accuracy to gains in $\text{MRR}$ and related measures. Empirically, GLIMPSE outperforms state-of-the-art baselines on MIND and Adressa datasets, with notable improvements in AUC and DCG metrics while maintaining practical runtimes, demonstrating the approach's practical impact for scalable, high-quality news recommendation.
Abstract
News recommendation is a challenging task that involves personalization based on the interaction history and preferences of each user. Recent works have leveraged the power of pretrained language models (PLMs) to directly rank news items by using inference approaches that predominately fall into three categories: pointwise, pairwise, and listwise learning-to-rank. While pointwise methods offer linear inference complexity, they fail to capture crucial comparative information between items that is more effective for ranking tasks. Conversely, pairwise and listwise approaches excel at incorporating these comparisons but suffer from practical limitations: pairwise approaches are either computationally expensive or lack theoretical guarantees, and listwise methods often perform poorly in practice. In this paper, we propose a novel framework for PLM-based news recommendation that integrates both pointwise relevance prediction and pairwise comparisons in a scalable manner. We present a rigorous theoretical analysis of our framework, establishing conditions under which our approach guarantees improved performance. Extensive experiments show that our approach outperforms the state-of-the-art methods on the MIND and Adressa news recommendation datasets.
