Whole Page Unbiased Learning to Rank
Haitao Mao, Lixin Zou, Yujia Zheng, Jiliang Tang, Xiaokai Chu, Jiashu Zhao, Qian Wang, Dawei Yin
TL;DR
This paper introduces Whole-page Unbiased Learning to Rank (WP-ULTR) and the BAL algorithm to mitigate biases from all SERP features, not just ranking position. BAL automatically learns a user behavior model via causal discovery and performs bias mitigation through confounding-removal with reweighting and by constraining learning to rely on query-document relevance, using a BERT-based backbone. Experiments on the Baidu-ULTR dataset show BAL outperforms state-of-the-art PB-ULTR baselines across DCG and ERR metrics, especially for high-frequency queries, and ablation studies confirm the necessity of causal-discovery-driven modeling and comprehensive bias mitigation. The work advances practical unbiased learning for modern search systems by enabling automatic discovery of complex, multi-feature biases and providing a scalable, explainable framework with real-world impact.
Abstract
The page presentation biases in the information retrieval system, especially on the click behavior, is a well-known challenge that hinders improving ranking models' performance with implicit user feedback. Unbiased Learning to Rank~(ULTR) algorithms are then proposed to learn an unbiased ranking model with biased click data. However, most existing algorithms are specifically designed to mitigate position-related bias, e.g., trust bias, without considering biases induced by other features in search result page presentation(SERP), e.g. attractive bias induced by the multimedia. Unfortunately, those biases widely exist in industrial systems and may lead to an unsatisfactory search experience. Therefore, we introduce a new problem, i.e., whole-page Unbiased Learning to Rank(WP-ULTR), aiming to handle biases induced by whole-page SERP features simultaneously. It presents tremendous challenges: (1) a suitable user behavior model (user behavior hypothesis) can be hard to find; and (2) complex biases cannot be handled by existing algorithms. To address the above challenges, we propose a Bias Agnostic whole-page unbiased Learning to rank algorithm, named BAL, to automatically find the user behavior model with causal discovery and mitigate the biases induced by multiple SERP features with no specific design. Experimental results on a real-world dataset verify the effectiveness of the BAL.
