An Analysis of Logit Learning with the r-Lambert Function

Rory Gavin; Ming Cao; Keith Paarporn

An Analysis of Logit Learning with the r-Lambert Function

Rory Gavin, Ming Cao, Keith Paarporn

TL;DR

The paper analyzes logit learning in two-strategy population games by linking logit fixed points to the $r$-Lambert function, yielding explicit expressions for fixed points across all $\beta \ge 0$. It shows that coordination games exhibit a pitchfork bifurcation (1 fixed point at low rationality to 3 at high rationality), while Prisoner’s Dilemma and anti-coordination have a single fixed point for all $\beta$, and that as $\beta\to\infty$ these fixed points converge to Nash equilibria. Stability analyses provide a universal condition $\dfrac{1}{k}W_r^2(kr) - W_r(kr) - 1 \le 0$ governing finite-$\beta$ stability, with high-$\beta$ dynamics matching best-response behavior. The results offer precise, actionable insights into controlling logit dynamics via the rationality parameter and have implications for engineered systems and socio-technical applications.

Abstract

The well-known replicator equation in evolutionary game theory describes how population-level behaviors change over time when individuals make decisions using simple imitation learning rules. In this paper, we study evolutionary dynamics based on a fundamentally different class of learning rules known as logit learning. Numerous previous studies on logit dynamics provide numerical evidence of bifurcations of multiple fixed points for several types of games. Our results here provide a more explicit analysis of the logit fixed points and their stability properties for the entire class of two-strategy population games -- by way of the $r$-Lambert function. We find that for Prisoner's Dilemma and anti-coordination games, there is only a single fixed point for all rationality levels. However, coordination games exhibit a pitchfork bifurcation: there is a single fixed point in a low-rationality regime, and three fixed points in a high-rationality regime. We provide an implicit characterization for the level of rationality where this bifurcation occurs. In all cases, the set of logit fixed points converges to the full set of Nash equilibria in the high rationality limit.

An Analysis of Logit Learning with the r-Lambert Function

TL;DR

The paper analyzes logit learning in two-strategy population games by linking logit fixed points to the

-Lambert function, yielding explicit expressions for fixed points across all

. It shows that coordination games exhibit a pitchfork bifurcation (1 fixed point at low rationality to 3 at high rationality), while Prisoner’s Dilemma and anti-coordination have a single fixed point for all

, and that as

these fixed points converge to Nash equilibria. Stability analyses provide a universal condition

governing finite-

stability, with high-

dynamics matching best-response behavior. The results offer precise, actionable insights into controlling logit dynamics via the rationality parameter and have implications for engineered systems and socio-technical applications.

Abstract

-Lambert function. We find that for Prisoner's Dilemma and anti-coordination games, there is only a single fixed point for all rationality levels. However, coordination games exhibit a pitchfork bifurcation: there is a single fixed point in a low-rationality regime, and three fixed points in a high-rationality regime. We provide an implicit characterization for the level of rationality where this bifurcation occurs. In all cases, the set of logit fixed points converges to the full set of Nash equilibria in the high rationality limit.

Paper Structure (17 sections, 10 theorems, 47 equations, 1 figure, 1 table)

This paper contains 17 sections, 10 theorems, 47 equations, 1 figure, 1 table.

Introduction
Preliminaries
The replicator equation
Problem Description
Analysis
Fixed Points Location in Terms of $r$-Lambert Function
Quantity of Fixed Points
Stability of Fixed Points
Simulations
Conclusions
Proof of Theorem \ref{['thm:AsympOfFPs']}
Proof of Theorem \ref{['thm:NoOfFPs']}
Proof of Theorem \ref{['thm:AsympFPStab']}
The Stationary Points of $h_0(\beta)$ and $h_{-1}(\beta)$
The Roots of $h_0(\beta)$ and $h_{-1}(\beta)$
...and 2 more sections

Key Result

Theorem 4.1

Figures (1)

Figure 1: Plots of the fixed points $x^*$ of \ref{['eq:logit_dyn']} as a function of $\beta$ for points in the parameter space $(\delta_{SP}, \delta_{RT}) = (\pm 1, \pm 2)$, $(\pm 2, \pm 1)$, and $(\pm 2, \pm 2)$. Dashed lines denote mixed Nash equilibria. This diagram plots stable values of $x^*$ in green and unstable ones in red, as calculated numerically using \ref{['eq:genFPstab']}. Gaps in the plots indicate the limits of the numerical methods used to compute the solutions to \ref{['eq:logit_dyn_sol']}.

Theorems & Definitions (19)

Definition 1
Theorem 4.1: Fixed Point Values as $\beta \to \infty$
proof
Theorem 4.2: The Number of Fixed Points
proof
Theorem 4.3: Fixed Point Stability as $\beta \to \infty$
proof
Theorem 4.4: Fixed Point Stability for Finite $\beta$
proof
Lemma 1.1
...and 9 more

An Analysis of Logit Learning with the r-Lambert Function

TL;DR

Abstract

An Analysis of Logit Learning with the r-Lambert Function

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (19)