On Mechanism Underlying Algorithmic Collusion

Zhang Xu; Wei Zhao

On Mechanism Underlying Algorithmic Collusion

Zhang Xu, Wei Zhao

TL;DR

It is shown that in a general class of symmetric games, including Prisoner's Dilemma, Bertrand competition, and any (nonlinear) mixture of first and second price auction, only Nash Equilibrium is stochastically stable, and the tacit collusion is driven by failure to learn NE due to insufficient learning.

Abstract

Two issues of algorithmic collusion are addressed in this paper. First, we show that in a general class of symmetric games, including Prisoner's Dilemma, Bertrand competition, and any (nonlinear) mixture of first and second price auction, only (strict) Nash Equilibrium (NE) is stochastically stable. Therefore, the tacit collusion is driven by failure to learn NE due to insufficient learning, instead of learning some strategies to sustain collusive outcomes. Second, we study how algorithms adapt to collusion in real simulations with insufficient learning. Extensive explorations in early stages and discount factors inflates the Q-value, which interrupts the sequential and alternative price undercut and leads to bilateral rebound. The process is iterated, making the price curves like Edgeworth cycles. When both exploration rate and Q-value decrease, algorithms may bilaterally rebound to relatively high common price level by coincidence, and then get stuck. Finally, we accommodate our reasoning to simulation outcomes in the literature, including optimistic initialization, market design and algorithm design.

On Mechanism Underlying Algorithmic Collusion

TL;DR

Abstract

Paper Structure (31 sections, 9 theorems, 22 equations, 13 figures, 1 table)

This paper contains 31 sections, 9 theorems, 22 equations, 13 figures, 1 table.

Introduction
Literature Review
Preliminaries
Stochastic Stability
Unperturbed Dynamics
Perturbed Dynamics
Sketch of the Proof
The Strict Partial Order
Simulation Results
Constant Exploration Rate
Channel underlying AI Collusion
Relation to the Theory
Applications
Optimistic Initialization
Market Design
...and 16 more sections

Key Result

Lemma 1

Fix any recurrent class $S$, then there exists $(\underline{q}^i,\underline{q}^{-i})$ such that,

Figures (13)

Figure 1: Q-value and Action Transition
Figure 2: Illustration of the Proof of Proposition \ref{['prp: absorbing']}
Figure 3: Average Convergent Price for a Grid of Values of $\alpha$ and $\beta$ (Decay)
Figure 4: Average Convergent Price as a Function of $\delta$ (Decay)
Figure 5: Average Convergent Price as a Function of $\delta$ (Constant)
...and 8 more figures

Theorems & Definitions (19)

Example 1
Example 2
Example 3
Lemma 1
Proposition 1
Lemma 2
Theorem 1
Lemma 3
proof
Definition 1
...and 9 more

On Mechanism Underlying Algorithmic Collusion

TL;DR

Abstract

On Mechanism Underlying Algorithmic Collusion

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (19)