Learning Fair and Preferable Allocations through Neural Network
Ryota Maruo, Koh Takeuchi, Hisashi Kashima
TL;DR
This work addresses learning $EF1$ allocations from implicit expert-like rules in the fair division of indivisible goods. It introduces a differentiable relaxation of Round Robin, $\mathrm{SoftRR}$, and a neural network, $\mathrm{NRR}$, that learns the agent order used by RR while preserving $EF1$ at inference. Key contributions include the first integration of supervised learning to recover implicit EF1 rules from examples, the $\mathrm{SoftRR}$ relaxation enabling backpropagation, and the $\mathrm{NRR}$ architecture that jointly learns agent ordering and allocation. Experiments on synthetic data show $\mathrm{NRR}$ outperforms baselines in allocation proximity and related metrics, demonstrating that implicit fair rules can be recovered and utilized with strict fairness guarantees. The approach offers a scalable way to fuse expert heuristics with differentiable learning for fair division in practical settings.
Abstract
The fair allocation of indivisible resources is a fundamental problem. Existing research has developed various allocation mechanisms or algorithms to satisfy different fairness notions. For example, round robin (RR) was proposed to meet the fairness criterion known as envy-freeness up to one good (EF1). Expert algorithms without mathematical formulations are used in real-world resource allocation problems to find preferable outcomes for users. Therefore, we aim to design mechanisms that strictly satisfy good properties with replicating expert knowledge. However, this problem is challenging because such heuristic rules are often difficult to formalize mathematically, complicating their integration into theoretical frameworks. Additionally, formal algorithms struggle to find preferable outcomes, and directly replicating these implicit rules can result in unfair allocations because human decision-making can introduce biases. In this paper, we aim to learn implicit allocation mechanisms from examples while strictly satisfying fairness constraints, specifically focusing on learning EF1 allocation mechanisms through supervised learning on examples of reported valuations and corresponding allocation outcomes produced by implicit rules. To address this, we developed a neural RR (NRR), a novel neural network that parameterizes RR. NRR is built from a differentiable relaxation of RR and can be trained to learn the agent ordering used for RR. We conducted experiments to learn EF1 allocation mechanisms from examples, demonstrating that our method outperforms baselines in terms of the proximity of predicted allocations and other metrics.
