Table of Contents
Fetching ...

Cooperative Bargaining Games Without Utilities: Mediated Solutions from Direction Oracles

Kushagra Gupta, Surya Murthy, Mustafa O. Karabag, Ufuk Topcu, David Fridovich-Keil

TL;DR

This paper addresses cooperative bargaining when a mediator lacks access to agents' utilities and can only observe each agent's most preferred direction in the decision space. It introduces Direction-based Bargaining Solution (DiBS), an iterative method with updates $\mathbf{x}_{k+1} = \mathbf{x}_k + \alpha_k\sum_{i=1}^N \|\mathbf{x}_k-\mathbf{x}^{*,i}\|_2\,\mathcal{O}^{\mathcal{D}, i}_{\boldsymbol{\ell}}(\mathbf{x}_k)$ that advances toward Pareto-stationary outcomes using only direction oracles. Theoretical results prove that fixed points of DiBS are Pareto stationary, that, under strong convexity and smoothness, the method globally converges to the fixed-point set, and that DiBS enjoys invariance to strictly increasing monotone nonaffine transformations and symmetry (with independence of irrelevant alternatives under stronger conditions). Empirically, the authors validate DiBS on nonconvex multi-agent formation and mediated stock portfolio allocation, showing balanced outcomes and robustness to direction-oracle estimation via comparisons, with code available at the provided repository.

Abstract

Cooperative bargaining games are widely used to model resource allocation and conflict resolution. Traditional solutions assume the mediator can access agents utility function values and gradients. However, there is an increasing number of settings, such as human AI interactions, where utility values may be inaccessible or incomparable due to unknown, nonaffine transformations. To model such settings, we consider that the mediator has access only to agents most preferred directions, i.e., normalized utility gradients in the decision space. To this end, we propose a cooperative bargaining algorithm where a mediator has access to only the direction oracle of each agent. We prove that unlike popular approaches such as the Nash and Kalai Smorodinsky bargaining solutions, our approach is invariant to monotonic nonaffine transformations, and that under strong convexity and smoothness assumptions, this approach enjoys global asymptotic convergence to Pareto stationary solutions. Moreover, we show that the bargaining solutions found by our algorithm also satisfy the axioms of symmetry and (under slightly stronger conditions) independence of irrelevant alternatives, which are popular in the literature. Finally, we conduct experiments in two domains, multi agent formation assignment and mediated stock portfolio allocation, which validate these theoretic results. All code for our experiments can be found at https://github.com/suryakmurthy/dibs_bargaining.

Cooperative Bargaining Games Without Utilities: Mediated Solutions from Direction Oracles

TL;DR

This paper addresses cooperative bargaining when a mediator lacks access to agents' utilities and can only observe each agent's most preferred direction in the decision space. It introduces Direction-based Bargaining Solution (DiBS), an iterative method with updates that advances toward Pareto-stationary outcomes using only direction oracles. Theoretical results prove that fixed points of DiBS are Pareto stationary, that, under strong convexity and smoothness, the method globally converges to the fixed-point set, and that DiBS enjoys invariance to strictly increasing monotone nonaffine transformations and symmetry (with independence of irrelevant alternatives under stronger conditions). Empirically, the authors validate DiBS on nonconvex multi-agent formation and mediated stock portfolio allocation, showing balanced outcomes and robustness to direction-oracle estimation via comparisons, with code available at the provided repository.

Abstract

Cooperative bargaining games are widely used to model resource allocation and conflict resolution. Traditional solutions assume the mediator can access agents utility function values and gradients. However, there is an increasing number of settings, such as human AI interactions, where utility values may be inaccessible or incomparable due to unknown, nonaffine transformations. To model such settings, we consider that the mediator has access only to agents most preferred directions, i.e., normalized utility gradients in the decision space. To this end, we propose a cooperative bargaining algorithm where a mediator has access to only the direction oracle of each agent. We prove that unlike popular approaches such as the Nash and Kalai Smorodinsky bargaining solutions, our approach is invariant to monotonic nonaffine transformations, and that under strong convexity and smoothness assumptions, this approach enjoys global asymptotic convergence to Pareto stationary solutions. Moreover, we show that the bargaining solutions found by our algorithm also satisfy the axioms of symmetry and (under slightly stronger conditions) independence of irrelevant alternatives, which are popular in the literature. Finally, we conduct experiments in two domains, multi agent formation assignment and mediated stock portfolio allocation, which validate these theoretic results. All code for our experiments can be found at https://github.com/suryakmurthy/dibs_bargaining.

Paper Structure

This paper contains 41 sections, 4 theorems, 13 equations, 5 figures.

Key Result

Proposition 1

There does not exist any bargaining algorithm in which a mediator with access to only direction oracles $\mathcal{O}^{\mathcal{D}, i}_{\boldsymbol{\ell}}$ can find the Nash or the Kalai-Smorodinsky bargaining solutions for all problems satisfying Assumptions assumption: state space-assumption: costs

Figures (5)

  • Figure 1: Formations achieved by different bargaining solutions. While \ref{['eq: our method']} yields qualitatively similar outcomes to \ref{['eq: nbs']} and \ref{['eq: ksbs']} in the original setting, it is also robust to monotone nonaffine scalings. \ref{['eq: ksbs']} is solved in a single shot, with no iteration trajectories to plot (see \ref{['appendix: formation implementation details']})
  • Figure 2: Results for the portfolio management example, showing the $1.5^\mathrm{th}, 25^\mathrm{th}, 50^\mathrm{th}, 75^\mathrm{th}$, and $98.5^\mathrm{th}$ percentiles. \ref{['eq: our method']} offers promising performance even when the direction oracle is estimated through comparisons. Dots represent outliers; cf. \ref{['appendix: stock implementation details']} for further details.
  • Figure 3: Repeating the Mediated Portfolio Management experiment for $N=2$ agents.
  • Figure 4: Repeating the Mediated Portfolio Management experiment for $N=3$ agents.
  • Figure 5: Repeating the Mediated Portfolio Management experiment for $N=5$ agents.

Theorems & Definitions (9)

  • Definition 1: Pareto Stationarity
  • Definition 2: Direction Oracle
  • Proposition 1: Inadequacy of \ref{['eq: nbs']} and \ref{['eq: ksbs']} for the direction oracle
  • Definition 3: Fixed/Equilibrium Points
  • Definition 4: Global Asymptotic Convergence
  • Theorem 1: Convergence of \ref{['eq: our method']} to Pareto Stationary Points
  • Theorem 2: Bargaining axioms satisfied by \ref{['eq: our method']}
  • Proposition 2
  • proof