A potentialization algorithm for games with applications to multi-agent learning in repeated games

Philipp Lakheshar; Sharwin Rezagholi

A potentialization algorithm for games with applications to multi-agent learning in repeated games

Philipp Lakheshar, Sharwin Rezagholi

TL;DR

An algorithm that assigns to any game in normal form an approximating game that admits an ordinal potential function guarantees convergence to stable agent behavior.

Abstract

We investigate an algorithm that assigns to any game in normal form an approximating game that admits an ordinal potential function. Due to the properties of potential games, the algorithm equips every game with a surrogate reward structure that allows efficient multi-agent learning. Numerical simulations using the replicator dynamics show that 'potentialization' guarantees convergence to stable agent behavior.

A potentialization algorithm for games with applications to multi-agent learning in repeated games

TL;DR

An algorithm that assigns to any game in normal form an approximating game that admits an ordinal potential function guarantees convergence to stable agent behavior.

Abstract

Paper Structure (9 sections, 2 theorems, 17 equations, 5 figures, 1 algorithm)

This paper contains 9 sections, 2 theorems, 17 equations, 5 figures, 1 algorithm.

Introduction
Games and potentials
Potentialization algorithm
Properties of the algorithmic result
Numerical experiments
Random generation of games.
Experiments.
Results.
Conclusion

Key Result

Theorem 5

A game admits an ordinal potential function if and only if its deviation graph is devoid of weak improvement cycles.

Figures (5)

Figure 1: Construction of the potential function.
Figure 2: Coordination game. The pure Nash equilibria of the original game are Pareto-ordered: $\alpha > \beta > \gamma$. The second and the third action of both players is weakly dominated. There is a cycle containing all vertices but $\alpha$. All pure equilibria are retained and two additional pure equilibria are created.
Figure 3: Game from Voorneveld and Nolde (1997). The vertices denoted by $\bullet$ form a cycle. All three equilibria are retained but the Pareto-best original equilibrium becomes preferable to both players.
Figure 4: Experimental results for $(10 \times 10)$-games ($\pm$ empirical standard deviation).
Figure 5: Experimental results for $(4 \times 4 \times 4)$-games ($\pm$ empirical standard deviation).

Theorems & Definitions (10)

Definition 1: Finite game in normal form
Definition 2: Deviation graph
Definition 3: Ordinal potential monderer1996potential
Definition 4: Weak improvement cycle
Theorem 5: Voorneveld and Nolde voorneveld1997
Definition 6: Strongly connected component
Definition 7: Condensation of weighted directed graphs
Proposition 8
proof
Definition 9: Replicator differential equation taylor1978

A potentialization algorithm for games with applications to multi-agent learning in repeated games

TL;DR

Abstract

A potentialization algorithm for games with applications to multi-agent learning in repeated games

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (10)