Emergent Cooperation under Uncertain Incentive Alignment
Nicole Orzan, Erman Acar, Davide Grossi, Roxana Rădulescu
TL;DR
This work investigates how cooperation emerges among reinforcement learning agents when incentive alignment is uncertain, using the Extended Public Goods Game (EPGG) to span competitive, mixed-motive, and cooperative regimes. It analyzes reputation-based social norms, steering agents, and intrinsic rewards as mechanisms to promote cooperation under uncertainty, modeling observation noise on the incentive multiplier $f$ and evaluating both tabular Q-learning and Deep Q-Networks. The key findings show that uncertainty substantially reduces cooperation in cooperative and mixed-motive settings, but reputation with an effective social norm and intrinsic rewards can restore near-optimal cooperation, especially when steering agents are present. The results highlight the importance of combining social and intrinsic motivation signals to robustly foster cooperative behavior in multi-agent systems facing uncertain incentives, with implications for scalable, cooperative AI in real-world, sparse-interaction environments.
Abstract
Understanding the emergence of cooperation in systems of computational agents is crucial for the development of effective cooperative AI. Interaction among individuals in real-world settings are often sparse and occur within a broad spectrum of incentives, which often are only partially known. In this work, we explore how cooperation can arise among reinforcement learning agents in scenarios characterised by infrequent encounters, and where agents face uncertainty about the alignment of their incentives with those of others. To do so, we train the agents under a wide spectrum of environments ranging from fully competitive, to fully cooperative, to mixed-motives. Under this type of uncertainty we study the effects of mechanisms, such as reputation and intrinsic rewards, that have been proposed in the literature to foster cooperation in mixed-motives environments. Our findings show that uncertainty substantially lowers the agents' ability to engage in cooperative behaviour, when that would be the best course of action. In this scenario, the use of effective reputation mechanisms and intrinsic rewards boosts the agents' capability to act nearly-optimally in cooperative environments, while greatly enhancing cooperation in mixed-motive environments as well.
