Multi-agent reinforcement learning in the all-or-nothing public goods game on networks
Benedikt Valentin Meylahn
TL;DR
The paper studies trust formation in an all-or-nothing public goods game played on networks, using exponential moving average learning to infer neighbor contributions. It proves that, on any connected network, the adaptive learning dynamics converge in the long run to a pure-strategy consensus where all agents either always contribute or always defect, with metastable pre-limit behavior possible on complex networks. Simulations on random geometric graphs reveal that higher network density slows convergence and fosters defecting states, while regular graphs show quicker, non-metastable convergence; local network structure strongly shapes interim trust patterns. The findings imply that promoting global public goods may be more effective when leveraging small, well-connected local groups and considering network topology in designing interventions. Overall, the work links network structure, learning dynamics, and threshold public goods to explain how trust and contribution emerge and stabilize in complex systems.
Abstract
We study interpersonal trust by means of the all-or-nothing public goods game between agents on a network. The agents are endowed with the simple yet adaptive learning rule, exponential moving average, by which they estimate the behavior of their neighbors in the network. Theoretically we show that in the long-time limit this multi-agent reinforcement learning process always eventually results in indefinite contribution to the public good or indefinite defection (no agent contributing to the public good). However, by simulation of the pre-limit behavior, we see that on complex network structures there may be mixed states in which the process seems to stabilize before actual convergence to states in which agent beliefs and actions are all the same. In these metastable states the local network characteristics can determine whether agents have high or low trust in their neighbors. More generally it is found that more dense networks result in lower rates of contribution to the public good. This has implications for how one can spread global contribution toward a public good by enabling smaller local interactions.
