Active Learning for Fair and Stable Online Allocations

Riddhiman Bhattacharya; Thanh Nguyen; Will Wei Sun; Mohit Tawarmalani

Active Learning for Fair and Stable Online Allocations

Riddhiman Bhattacharya, Thanh Nguyen, Will Wei Sun, Mohit Tawarmalani

TL;DR

The approach hinges on an active-learning procedure that carefully selects the agent from whom to gather feedback, ensuring its effectiveness in the allocation process, and introduces a deliberate constraint on feedback, restricting it to a single agent or a limited number of agents per period.

Abstract

We explore an active learning approach for dynamic fair resource allocation problems. Unlike previous work that assumes full feedback from all agents on their allocations, we consider feedback from a select subset of agents at each epoch of the online resource allocation process. Despite this restriction, our proposed algorithms provide regret bounds that are sub-linear in number of time-periods for various measures that include fairness metrics commonly used in resource allocation problems and stability considerations in matching mechanisms. The key insight of our algorithms lies in adaptively identifying the most informative feedback using dueling upper and lower confidence bounds. With this strategy, we show that efficient decision-making does not require extensive feedback and produces efficient outcomes for a variety of problem classes.

Active Learning for Fair and Stable Online Allocations

TL;DR

Abstract

Paper Structure (21 sections, 237 equations, 6 figures, 4 algorithms)

This paper contains 21 sections, 237 equations, 6 figures, 4 algorithms.

Introduction
Related Works
Max-min Fairness for Unit Demand Agents
A Special Example: Top $K$-Arm Identification
Our Algorithm and Regret Analysis
Allocation of Bundles
Max-Min Fairness
Our Algorithm and Regret Analysis
Minimal Envy Fairness
Stable Assignment
Model
Learning Task and Regret Measures
Algorithm
Simulations
Conclusion
...and 6 more sections

Figures (6)

Figure 1: Outline of the four interconnected problems addressed in our paper.
Figure 2: Illustration of an allocation with bundle $\phi(j)$ being allocated to agent $j$.
Figure 3: Comparison between the proposed Dueling ULCB and the two benchmark methods.
Figure 4: Cumulative regrets of our Dueling ULCB algorithm with varying $K$ and varying $N$.
Figure 5: Cumulative regrets of the Dueling Max-Min ULCB algorithm with varying reward functions for both agents.
...and 1 more figures

Active Learning for Fair and Stable Online Allocations

TL;DR

Abstract

Active Learning for Fair and Stable Online Allocations

Authors

TL;DR

Abstract

Table of Contents

Figures (6)