Learning to Negotiate via Voluntary Commitment

Shuhui Zhu; Baoxiang Wang; Sriram Ganapathi Subramanian; Pascal Poupart

Learning to Negotiate via Voluntary Commitment

Shuhui Zhu, Baoxiang Wang, Sriram Ganapathi Subramanian, Pascal Poupart

TL;DR

This work tackles commitment failures in mixed-motive multi-agent settings by introducing Markov Commitment Games (MCGs) and a learnable Differentiable Commitment Learning (DCL) framework. DCL learns a tripartite policy per agent—proposal, commitment, and action—through unbiased policy gradients $\nabla V^i_{\bm{\phi,\psi,\pi}}(s)$ while differentiating through other agents\' policies, guided by incentive-compatible constraints to favor mutually beneficial agreements. The approach yields faster convergence and higher social welfare than Independent PPO, Mediated MARL, and MOCA across Prisoner\'s Dilemma, Grid, and repeated/conflicting games, and scales to many players with robustness to irrational agents. These results demonstrate that self-interested agents can negotiate effective agreements without central altruism, with implications for scalable cooperative AI in dynamic environments.

Abstract

The partial alignment and conflict of autonomous agents lead to mixed-motive scenarios in many real-world applications. However, agents may fail to cooperate in practice even when cooperation yields a better outcome. One well known reason for this failure comes from non-credible commitments. To facilitate commitments among agents for better cooperation, we define Markov Commitment Games (MCGs), a variant of commitment games, where agents can voluntarily commit to their proposed future plans. Based on MCGs, we propose a learnable commitment protocol via policy gradients. We further propose incentive-compatible learning to accelerate convergence to equilibria with better social welfare. Experimental results in challenging mixed-motive tasks demonstrate faster empirical convergence and higher returns for our method compared with its counterparts. Our code is available at https://github.com/shuhui-zhu/DCL.

Learning to Negotiate via Voluntary Commitment

TL;DR

Abstract

Learning to Negotiate via Voluntary Commitment

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (8)