Metric Learning from Limited Pairwise Preference Comparisons
Zhi Wang, Geelon So, Ramya Korlakai Vinayak
TL;DR
This work investigates learning a shared Mahalanobis distance from limited pairwise preference data under the ideal point model. It proves an impossibility result for generic high-dimensional data with $m=o(d)$ per user, then presents a divide-and-conquer framework that leverages low-rank subspace structure (subspace-clusterable data) to recover the metric by first learning subspace metrics and then stitching them together, with exact and approximate recovery guarantees. The approach is validated empirically on synthetic data, showing robust metric recovery under binary/noisy responses and varying numbers of subspaces, users, and comparisons. The results highlight a practical pathway for aligning high-dimensional foundation-model representations with human preferences by exploiting low-dimensional structure to reduce query costs.
Abstract
We study metric learning from preference comparisons under the ideal point model, in which a user prefers an item over another if it is closer to their latent ideal item. These items are embedded into $\mathbb{R}^d$ equipped with an unknown Mahalanobis distance shared across users. While recent work shows that it is possible to simultaneously recover the metric and ideal items given $\mathcal{O}(d)$ pairwise comparisons per user, in practice we often have a limited budget of $o(d)$ comparisons. We study whether the metric can still be recovered, even though it is known that learning individual ideal items is now no longer possible. We show that in general, $o(d)$ comparisons reveal no information about the metric, even with infinitely many users. However, when comparisons are made over items that exhibit low-dimensional structure, each user can contribute to learning the metric restricted to a low-dimensional subspace so that the metric can be jointly identified. We present a divide-and-conquer approach that achieves this, and provide theoretical recovery guarantees and empirical validation.
