On Passivity, Reinforcement Learning and Higher-Order Learning in Multi-Agent Finite Games

Bolin Gao; Lacra Pavel

On Passivity, Reinforcement Learning and Higher-Order Learning in Multi-Agent Finite Games

Bolin Gao, Lacra Pavel

TL;DR

It is shown that convergence to a Nash distribution can be attained in a broader class of games than previously considered in the literature—namely, in games characterized by the monotonicity property of their (negative) payoff vectors.

Abstract

In this paper, we propose a passivity-based methodology for analysis and design of reinforcement learning in multi-agent finite games. Starting from a known exponentially-discounted reinforcement learning scheme, we show that convergence to a Nash distribution can be shown in the class of games characterized by the monotonicity property of their (negative) payoff. We further exploit passivity to propose a class of higher-order schemes that preserve convergence properties, can improve the speed of convergence and can even converge in cases whereby their first-order counterpart fail to converge. We demonstrate these properties through numerical simulations for several representative games.

On Passivity, Reinforcement Learning and Higher-Order Learning in Multi-Agent Finite Games

TL;DR

Abstract

On Passivity, Reinforcement Learning and Higher-Order Learning in Multi-Agent Finite Games

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (19)

Theorems & Definitions (31)