Learning in Multi-Objective Public Goods Games with Non-Linear Utilities
Nicole Orzan, Erman Acar, Davide Grossi, Patrick Mannion, Roxana Rădulescu
TL;DR
The paper investigates learning in a multi-objective public goods setting where agents have non-linear, risk-based utilities that decouple collective and individual incentives. It introduces MO-EPGG, a MO-MARL framework that vectorizes rewards into $(r^C,r^I)$ and applies a non-linear utility on the collective component under the SER criterion, enabling analysis of risk attitudes (via $\beta$) and environmental uncertainty. Through analytical game-theoretic results (ESR/SER and NE) and MO-DQN experiments, it shows that risk-averse (low $\beta$) agents hinder cooperation, while risk-seeking (high $\beta$) agents promote cooperation in competitive or mixed-motive settings, with uncertainty amplifying these effects; heterogeneity can suppress cooperation in cooperative regimes. The work provides a principled, scalable framework to study incentive alignment, risk preferences, and uncertainty in multi-agent learning, with implications for designing cooperative AI in uncertain human-agent collaborations.
Abstract
Addressing the question of how to achieve optimal decision-making under risk and uncertainty is crucial for enhancing the capabilities of artificial agents that collaborate with or support humans. In this work, we address this question in the context of Public Goods Games. We study learning in a novel multi-objective version of the Public Goods Game where agents have different risk preferences, by means of multi-objective reinforcement learning. We introduce a parametric non-linear utility function to model risk preferences at the level of individual agents, over the collective and individual reward components of the game. We study the interplay between such preference modelling and environmental uncertainty on the incentive alignment level in the game. We demonstrate how different combinations of individual preferences and environmental uncertainties sustain the emergence of cooperative patterns in non-cooperative environments (i.e., where competitive strategies are dominant), while others sustain competitive patterns in cooperative environments (i.e., where cooperative strategies are dominant).
