Fairness in Reinforcement Learning with Bisimulation Metrics

Sahand Rezaei-Shoshtari; Hanna Yurchyk; Scott Fujimoto; Doina Precup; David Meger

Fairness in Reinforcement Learning with Bisimulation Metrics

Sahand Rezaei-Shoshtari, Hanna Yurchyk, Scott Fujimoto, Doina Precup, David Meger

TL;DR

The paper tackles long-term group fairness in reinforcement learning by linking demographic parity to bisimulation metrics. It introduces Bisimulator, which unconstrainedly optimizes reward and observation dynamics guided by a group-conditioned bisimulation metric, leaving the underlying RL solver unchanged. The authors formalize group-conditioned pi-bisimulation, derive value-bound relations, and propose a practical algorithm that minimizes a joint bisimulation-based loss via quantile-matched state-group pairs, demonstrated on lending and college admissions benchmarks with PPO and DQN. This approach provides a scalable, solver-agnostic pathway to reduce disparities over time, while maintaining competitive performance in dynamic, sequential decision problems.

Abstract

Ensuring long-term fairness is crucial when developing automated decision making systems, specifically in dynamic and sequential environments. By maximizing their reward without consideration of fairness, AI agents can introduce disparities in their treatment of groups or individuals. In this paper, we establish the connection between bisimulation metrics and group fairness in reinforcement learning. We propose a novel approach that leverages bisimulation metrics to learn reward functions and observation dynamics, ensuring that learners treat groups fairly while reflecting the original problem. We demonstrate the effectiveness of our method in addressing disparities in sequential decision making problems through empirical evaluation on a standard fairness benchmark consisting of lending and college admission scenarios.

Fairness in Reinforcement Learning with Bisimulation Metrics

TL;DR

Abstract

Fairness in Reinforcement Learning with Bisimulation Metrics

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (15)