When Less Is More: Binary Feedback Can Outperform Ordinal Comparisons in Ranking Recovery
Shirong Xu, Jingnan Zhang, Junhui Wang
TL;DR
The paper challenges the assumption that ordinal paired comparisons always yield stronger ranking than binary ones, showing that binarization can lead to faster, exponential convergence in ranking recovery under a counting algorithm. It introduces a flexible generalized additive ordinal model with a strength link function $\phi$ and a pattern function $\psi_\gamma$, clarifying when binarization helps via a signal-to-noise analysis of the ordinal pattern. Theoretical results show a quantifiable gap in ranking accuracy between binary and ordinal data, controlled by the SNR of the ordinal pattern, and identify pattern forms that minimize SNR to maximize binarization gains. These insights are validated through extensive simulations and a MovieLens real-data application, with practical implications for designing feedback mechanisms and ranking algorithms that leverage binary responses for more robust preference recovery.
Abstract
Paired comparison data, where users evaluate items in pairs, play a central role in ranking and preference learning tasks. While ordinal comparison data intuitively offer richer information than binary comparisons, this paper challenges that conventional wisdom. We propose a general parametric framework for modeling ordinal paired comparisons without ties. The model adopts a generalized additive structure, featuring a link function that quantifies the preference difference between two items and a pattern function that governs the distribution over ordinal response levels. This framework encompasses classical binary comparison models as special cases, by treating binary responses as binarized versions of ordinal data. Within this framework, we show that binarizing ordinal data can significantly improve the accuracy of ranking recovery. Specifically, we prove that under the counting algorithm, the ranking error associated with binary comparisons exhibits a faster exponential convergence rate than that of ordinal data. Furthermore, we characterize a substantial performance gap between binary and ordinal data in terms of a signal-to-noise ratio (SNR) determined by the pattern function. We identify the pattern function that minimizes the SNR and maximizes the benefit of binarization. Extensive simulations and a real application on the MovieLens dataset further corroborate our theoretical findings.
