MEDIATE: Mutually Endorsed Distributed Incentive Acknowledgment Token Exchange
Philipp Altmann, Katharina Winter, Michael Kölle, Maximilian Zorn, Thomy Phan, Claudia Linnhoff-Popien
TL;DR
MEDIATE addresses the challenge of fostering cooperation in decentralized multi-agent systems under privacy constraints by introducing automatic, domain-adaptive incentivization tokens derived from local value estimates and a privacy-preserving consensus mechanism. It extends the MATE framework with per-agent token derivation and additive secret sharing-based consensus, enabling tokens to adapt to varying reward landscapes while preserving privacy. Empirical results across Iterated Prisoner's Dilemma, CoinGame variants, and Harvest show MEDIATE improves or matches state-of-the-art PI approaches, with tokens converging within the first $1000$ epochs and coordinated token exchange yielding robust social welfare gains. This work provides a scalable, privacy-conscious protocol that enhances cooperative behavior in diverse social-dilemma environments and offers a foundation for future adversarial and large-scale evaluations.
Abstract
Recent advances in multi-agent systems (MAS) have shown that incorporating peer incentivization (PI) mechanisms vastly improves cooperation. Especially in social dilemmas, communication between the agents helps to overcome sub-optimal Nash equilibria. However, incentivization tokens need to be carefully selected. Furthermore, real-world applications might yield increased privacy requirements and limited exchange. Therefore, we extend the PI protocol for mutual acknowledgment token exchange (MATE) and provide additional analysis on the impact of the chosen tokens. Building upon those insights, we propose mutually endorsed distributed incentive acknowledgment token exchange (MEDIATE), an extended PI architecture employing automatic token derivation via decentralized consensus. Empirical results show the stable agreement on appropriate tokens yielding superior performance compared to static tokens and state-of-the-art approaches in different social dilemma environments with various reward distributions.
