Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty
Andreas Athanasopoulos, Anne-Marie George, Christos Dimitrakakis
TL;DR
This work tackles the problem of identifying the Probably Correct Optimal Stable Matching (PCOS) in centralized two-sided markets when the left side's preferences are unknown and feedback is noisy. It departs from regret-minimization by embracing pure exploration, introducing PCOS and several learning algorithms (Uniform Exploration, Elimination, Improved Elimination, Adaptive Sampling) with theoretical sample-complexity guarantees and empirical validation on synthetic data. The key contributions include algorithmic frameworks that learn stable matchings without restricting exploration to only currently stable actions, bounds on sample complexity such as $O\left(\sum_{(p,a) \in m_s^{\star}} \frac{\ln\left(KN/\delta \Delta_{p,a}\right)}{\Delta_{p,a}^2}\right)$ in its variants, and practical evidence that adaptive sampling yields superior performance. The results advance understanding of efficient information gathering in two-sided markets under uncertainty and offer practical avenues for fast, high-probability identification of the optimal stable matching in centralized settings.
Abstract
We consider a learning problem for the stable marriage model under unknown preferences for the left side of the market. We focus on the centralized case, where at each time step, an online platform matches the agents, and obtains a noisy evaluation reflecting their preferences. Our aim is to quickly identify the stable matching that is left-side optimal, rendering this a pure exploration problem with bandit feedback. We specifically aim to find Probably Correct Optimal Stable Matchings and present several bandit algorithms to do so. Our findings provide a foundational understanding of how to efficiently gather and utilize preference information to identify the optimal stable matching in two-sided markets under uncertainty. An experimental analysis on synthetic data complements theoretical results on sample complexities for the proposed methods.
