Learning of Nash Equilibria in Risk-Averse Games

Zifan Wang; Yi Shen; Michael M. Zavlanos; Karl H. Johansson

Learning of Nash Equilibria in Risk-Averse Games

Zifan Wang, Yi Shen, Michael M. Zavlanos, Karl H. Johansson

TL;DR

A first-order risk-averse leaning algorithm, in which the CVaR gradient estimate depends on an estimate of the Value at Risk (VaR) value combined with the gradient of the stochastic cost function, is proposed.

Abstract

This paper considers risk-averse learning in convex games involving multiple agents that aim to minimize their individual risk of incurring significantly high costs. Specifically, the agents adopt the conditional value at risk (CVaR) as a risk measure with possibly different risk levels. To solve this problem, we propose a first-order risk-averse leaning algorithm, in which the CVaR gradient estimate depends on an estimate of the Value at Risk (VaR) value combined with the gradient of the stochastic cost function. Although estimation of the CVaR gradients using finitely many samples is generally biased, we show that the accumulated error of the CVaR gradient estimates is bounded with high probability. Moreover, assuming that the risk-averse game is strongly monotone, we show that the proposed algorithm converges to the risk-averse Nash equilibrium. We present numerical experiments on a Cournot game example to illustrate the performance of the proposed method.

Learning of Nash Equilibria in Risk-Averse Games

TL;DR

Abstract

Paper Structure (14 sections, 5 theorems, 33 equations, 1 figure, 1 algorithm)

This paper contains 14 sections, 5 theorems, 33 equations, 1 figure, 1 algorithm.

Introduction
Preliminaries
Convex Games
CVaR and VaR
Risk-Averse Convex Games
Problem Formulation
Strong Monotonicity Analysis
A Risk-Averse Learning Algorithm
Algorithm Description
Convergence Analysis
Numerical Result
Conclusion
Proof of Lemma \ref{['lemma:FO:sum:epsilon']}
Proof of Theorem \ref{['theorem:FO']}

Key Result

Lemma 1

Let Assumption assump:strong_monotone hold. Then, the risk-averse game def:risk-averse_game is $m$-strongly monotone.

Figures (1)

Figure 1: Error to the Nash equilibrium among Algorithm 1, unbiased first-order algorithm (Unbiased FO), and the zeroth-order algorithm with momentum (ZO momentum) in wang2022risk. The solid lines and shades are averages and standard deviations over 20 runs.

Theorems & Definitions (10)

Lemma 1
proof
Lemma 2
proof
Lemma 3
proof
Lemma 4
proof
Theorem 1
proof

Learning of Nash Equilibria in Risk-Averse Games

TL;DR

Abstract

Learning of Nash Equilibria in Risk-Averse Games

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (10)