Convergence and stationary distribution of Elo rating systems
Roberto Cortez, Hagop Tossounian
TL;DR
This work analyzes the Elo rating process as a Markov chain on the zero-sum subspace $\mathcal{Z}_N$, driven by random matches and a fixed $K$-factor. It proves the existence and uniqueness of a stationary distribution $\pi$ and establishes exponential convergence in the $2$-Wasserstein distance $W_2$ (and almost-sure convergence via a natural coupling). It further shows that $\pi$ has finite exponential moments, full support on $\mathcal{Z}_N$, and that $\pi$ converges to the true skill vector $\rho$ as $K\to0$ at rate $\sqrt{K}$, with implications for bias and estimation. Numerical simulations illustrate density shapes, bias phenomena, and the $\sqrt{K}$ scaling for small $K$, providing practical insight into Elo dynamics. Overall, the results furnish a rigorous foundation for the long-time behavior of Elo systems and point to several avenues for extending the model and its analysis.
Abstract
The Elo rating system is a popular and widely adopted method for measuring the relative skill levels of players or teams in various sports and competitions. It assigns players numerical ratings and dynamically updates them based on game results and a model parameter $K$, which determines the sensitivity of rating changes. Assuming random games, this leads to a Markov chain for the evolution of the ratings of the $N$ players in the league. Despite its widespread use, little is known about the long-term behavior of this process. Aiming to fill this gap, in this article we prove that the process converges to its unique equilibrium distribution at an exponential rate in the 2-Wasserstein distance and almost surely. Moreover, we show important properties of the stationary distribution, such as the finiteness of an exponential moment, full support, and convergence to the players' true skills as $K$ decreases, at a rate of $\sqrt{K}$. We also provide Monte Carlo simulations that illustrate some of these properties and offer new insights.
