Table of Contents
Fetching ...

On Mechanism Underlying Algorithmic Collusion

Zhang Xu, Wei Zhao

TL;DR

It is shown that in a general class of symmetric games, including Prisoner's Dilemma, Bertrand competition, and any (nonlinear) mixture of first and second price auction, only Nash Equilibrium is stochastically stable, and the tacit collusion is driven by failure to learn NE due to insufficient learning.

Abstract

Two issues of algorithmic collusion are addressed in this paper. First, we show that in a general class of symmetric games, including Prisoner's Dilemma, Bertrand competition, and any (nonlinear) mixture of first and second price auction, only (strict) Nash Equilibrium (NE) is stochastically stable. Therefore, the tacit collusion is driven by failure to learn NE due to insufficient learning, instead of learning some strategies to sustain collusive outcomes. Second, we study how algorithms adapt to collusion in real simulations with insufficient learning. Extensive explorations in early stages and discount factors inflates the Q-value, which interrupts the sequential and alternative price undercut and leads to bilateral rebound. The process is iterated, making the price curves like Edgeworth cycles. When both exploration rate and Q-value decrease, algorithms may bilaterally rebound to relatively high common price level by coincidence, and then get stuck. Finally, we accommodate our reasoning to simulation outcomes in the literature, including optimistic initialization, market design and algorithm design.

On Mechanism Underlying Algorithmic Collusion

TL;DR

It is shown that in a general class of symmetric games, including Prisoner's Dilemma, Bertrand competition, and any (nonlinear) mixture of first and second price auction, only Nash Equilibrium is stochastically stable, and the tacit collusion is driven by failure to learn NE due to insufficient learning.

Abstract

Two issues of algorithmic collusion are addressed in this paper. First, we show that in a general class of symmetric games, including Prisoner's Dilemma, Bertrand competition, and any (nonlinear) mixture of first and second price auction, only (strict) Nash Equilibrium (NE) is stochastically stable. Therefore, the tacit collusion is driven by failure to learn NE due to insufficient learning, instead of learning some strategies to sustain collusive outcomes. Second, we study how algorithms adapt to collusion in real simulations with insufficient learning. Extensive explorations in early stages and discount factors inflates the Q-value, which interrupts the sequential and alternative price undercut and leads to bilateral rebound. The process is iterated, making the price curves like Edgeworth cycles. When both exploration rate and Q-value decrease, algorithms may bilaterally rebound to relatively high common price level by coincidence, and then get stuck. Finally, we accommodate our reasoning to simulation outcomes in the literature, including optimistic initialization, market design and algorithm design.
Paper Structure (31 sections, 9 theorems, 22 equations, 13 figures, 1 table)

This paper contains 31 sections, 9 theorems, 22 equations, 13 figures, 1 table.

Key Result

Lemma 1

Fix any recurrent class $S$, then there exists $(\underline{q}^i,\underline{q}^{-i})$ such that,

Figures (13)

  • Figure 1: Q-value and Action Transition
  • Figure 2: Illustration of the Proof of Proposition \ref{['prp: absorbing']}
  • Figure 3: Average Convergent Price for a Grid of Values of $\alpha$ and $\beta$ (Decay)
  • Figure 4: Average Convergent Price as a Function of $\delta$ (Decay)
  • Figure 5: Average Convergent Price as a Function of $\delta$ (Constant)
  • ...and 8 more figures

Theorems & Definitions (19)

  • Example 1
  • Example 2
  • Example 3
  • Lemma 1
  • Proposition 1
  • Lemma 2
  • Theorem 1
  • Lemma 3
  • proof
  • Definition 1
  • ...and 9 more