Safe Exploitative Play with Untrusted Type Beliefs

Tongxin Li; Tinashe Handina; Shaolei Ren; Adam Wierman

Safe Exploitative Play with Untrusted Type Beliefs

Tongxin Li, Tinashe Handina, Shaolei Ren, Adam Wierman

TL;DR

A tradeoff between risk and opportunity is formally defined by comparing the payoff obtained against the optimal payoff, which is represented by a gap caused by trusting or distrusting the learned beliefs.

Abstract

The combination of the Bayesian game and learning has a rich history, with the idea of controlling a single agent in a system composed of multiple agents with unknown behaviors given a set of types, each specifying a possible behavior for the other agents. The idea is to plan an agent's own actions with respect to those types which it believes are most likely to maximize the payoff. However, the type beliefs are often learned from past actions and likely to be incorrect. With this perspective in mind, we consider an agent in a game with type predictions of other components, and investigate the impact of incorrect beliefs to the agent's payoff. In particular, we formally define a tradeoff between risk and opportunity by comparing the payoff obtained against the optimal payoff, which is represented by a gap caused by trusting or distrusting the learned beliefs. Our main results characterize the tradeoff by establishing upper and lower bounds on the Pareto front for both normal-form and stochastic Bayesian games, with numerical results provided.

Safe Exploitative Play with Untrusted Type Beliefs

TL;DR

Abstract

Safe Exploitative Play with Untrusted Type Beliefs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (13)