On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions
Omer Madmon, Idan Pipano, Itamar Reinman, Moshe Tennenholtz
TL;DR
This work addresses convergence of no-regret dynamics in information retrieval games where publishers compete over exposure under proportional ranking functions. It proves a tight equivalence: the activation function $g$ being concave is necessary and sufficient for the induced game to be concave and socially-concave, which in turn ensures convergence of any no-regret dynamics and, under bi-convex distance, unique Nash equilibria. The paper also provides a continuous embedding-based model and conducts extensive simulations using a state-of-the-art no-regret algorithm to explore welfare trade-offs between publishers and users as a function of $λ$, $n$, $s$, and $k$, showing clear trade-offs between welfare and convergence speed. These results offer practical guidance for ranking design in SEO-like systems while providing rigorous convergence guarantees under no-regret learning, and they outline key limitations and directions for future work.
Abstract
Publishers who publish their content on the web act strategically, in a behavior that can be modeled within the online learning framework. Regret, a central concept in machine learning, serves as a canonical measure for assessing the performance of learning agents within this framework. We prove that any proportional content ranking function with a concave activation function induces games in which no-regret learning dynamics converge. Moreover, for proportional ranking functions, we prove the equivalence of the concavity of the activation function, the social concavity of the induced games and the concavity of the induced games. We also study the empirical trade-offs between publishers' and users' welfare, under different choices of the activation function, using a state-of-the-art no-regret dynamics algorithm. Furthermore, we demonstrate how the choice of the ranking function and changes in the ecosystem structure affect these welfare measures, as well as the dynamics' convergence rate.
