Mitigating the Participation Bias by Balancing Extreme Ratings
Yongkang Guo, Yuqing Kong, Jialiang Liu
TL;DR
This paper tackles participation bias in rating aggregation by formulating a robust, minimax objective to estimate the underlying mean of ratings when reports are unevenly observed. It introduces two aggregators tailored to the data availability: BEA for the case with known sample size $n$, which imputes unobserved ratings from extreme values and blends them with observed ratings; and PAA for unknown $n$, which asymptotically achieves the minimax optimum by averaging two polarized estimations derived from thresholded histograms. Theoretical results provide lower bounds and optimality claims for BEA and PAA, including asymptotic regret forms and finite-sample guarantees, complemented by visualization and experimental validation on real hotel-rating data. Empirically, BEA and PAA outperform simple averaging and spectral methods in worst-case scenarios, demonstrating practical robustness to participation bias with real-world applicability. The work advances robust aggregation under missing-not-at-random participation and lays groundwork for adaptive, multi-attribute extensions.
Abstract
Rating aggregation plays a crucial role in various fields, such as product recommendations, hotel rankings, and teaching evaluations. However, traditional averaging methods can be affected by participation bias, where some raters do not participate in the rating process, leading to potential distortions. In this paper, we consider a robust rating aggregation task under the participation bias. We assume that raters may not reveal their ratings with a certain probability depending on their individual ratings, resulting in partially observed samples. Our goal is to minimize the expected squared loss between the aggregated ratings and the average of all underlying ratings (possibly unobserved) in the worst-case scenario. We focus on two settings based on whether the sample size (i.e. the number of raters) is known. In the first setting, where the sample size is known, we propose an aggregator, named as the Balanced Extremes Aggregator. It estimates unrevealed ratings with a balanced combination of extreme ratings. When the sample size is unknown, we derive another aggregator, the Polarizing-Averaging Aggregator, which becomes optimal as the sample size grows to infinity. Numerical results demonstrate the superiority of our proposed aggregators in mitigating participation bias, compared to simple averaging and the spectral method. Furthermore, we validate the effectiveness of our aggregators on a real-world dataset.
