Scalable spectral representations for multi-agent reinforcement learning in network MDPs

Zhaolin Ren; Runyu Zhang; Bo Dai; Na Li

Scalable spectral representations for multi-agent reinforcement learning in network MDPs

Zhaolin Ren, Runyu Zhang, Bo Dai, Na Li

TL;DR

This work first derive scalable spectral local representations for network MDPs, which induces a network linear subspace for the local $Q$-function of each agent, and designs a scalable algorithmic framework for continuous state-action network MDPs, and provides end-to-end guarantees for the convergence of the algorithm.

Abstract

Network Markov Decision Processes (MDPs), a popular model for multi-agent control, pose a significant challenge to efficient learning due to the exponential growth of the global state-action space with the number of agents. In this work, utilizing the exponential decay property of network dynamics, we first derive scalable spectral local representations for network MDPs, which induces a network linear subspace for the local $Q$-function of each agent. Building on these local spectral representations, we design a scalable algorithmic framework for continuous state-action network MDPs, and provide end-to-end guarantees for the convergence of our algorithm. Empirically, we validate the effectiveness of our scalable representation-based approach on two benchmark problems, and demonstrate the advantages of our approach over generic function approximation approaches to representing the local $Q$-functions.

Scalable spectral representations for multi-agent reinforcement learning in network MDPs

TL;DR

This work first derive scalable spectral local representations for network MDPs, which induces a network linear subspace for the local

-function of each agent, and designs a scalable algorithmic framework for continuous state-action network MDPs, and provides end-to-end guarantees for the convergence of the algorithm.

Abstract

-function of each agent. Building on these local spectral representations, we design a scalable algorithmic framework for continuous state-action network MDPs, and provide end-to-end guarantees for the convergence of our algorithm. Empirically, we validate the effectiveness of our scalable representation-based approach on two benchmark problems, and demonstrate the advantages of our approach over generic function approximation approaches to representing the local

-functions.

Scalable spectral representations for multi-agent reinforcement learning in network MDPs

TL;DR

Abstract

Scalable spectral representations for multi-agent reinforcement learning in network MDPs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (24)