Engineering Social Optimality via Utility Shaping in Non-Cooperative Games under Incomplete Information and Imperfect Monitoring
David Smith, Jie Dong, Yizhou Yang
TL;DR
The paper tackles decentralized decision-making under incomplete information and imperfect public monitoring. It introduces a message-free blueprint that engineers social optimality by embedding shadow prices or KKT-aligned penalties into private utilities, turning the stage game into an exact potential game whose unique Nash equilibrium aligns with the planner’s welfare $W(p)$ (including constraints). A curvature condition based on a single-inflection compressed/stretched exponential response ensures strong monotonicity and contraction, enabling a Bayesian equilibrium characterized as a strongly-monotone SVI; with noisy updates and drift, explicit tracking bounds are derived. Two computational studies—multi-tier supply chains and a non-cooperative agentic-AI compute market—demonstrate near-centralized welfare, elimination of steady-state constraint violations, and faster convergence compared to baselines, even under discrete quantization. The proposed deployable rules provide a practical, scalable path to achieving social optima with limited messaging, broad applicability across demand response, cloud/edge scheduling, transportation pricing, and biosecurity/agriculture, while acknowledging scope, limitations, and guardrails for robust deployment.
Abstract
In this paper, we study decentralized decision-making where agents optimize private objectives under incomplete information and imperfect public monitoring, in a non-cooperative setting. By shaping utilities-embedding shadow prices or Karush-Kuhn-Tucker(KKT)-aligned penalties-we make the stage game an exact-potential game whose unique equilibrium equals the (possibly constrained) social optimum. We characterize the Bayesian equilibrium as a stochastic variational inequality; strong monotonicity follows from a single-inflection compressed/stretched-exponential response combined with convex pricing. We give tracking bounds for damped-gradient and best-response-with-hysteresis updates under a noisy public index, and corresponding steady-state error. The framework accommodates discrete and continuous action sets and composes with slower discrete assignment. Deployable rules include: embed prices/penalties; publish a single public index; tune steps, damping, and dual rates for contraction. Computational experiments cover (i) a multi-tier supply chain and (ii) a non-cooperative agentic-AI compute market of bidding bots. Relative to price-only baselines, utility shaping attains near-centralized welfare, eliminates steady-state constraint/capacity violations when feasible, and accelerates convergence; with quantization, discrete equilibria track continuous ones within the mesh. The blueprint is portable to demand response, cloud/edge scheduling, and transportation pricing and biosecurity/agriculture. Overall, utility shaping plus a public index implements the constrained social optimum with stable equilibria under noise and drift-an operations-research-friendly alternative to heavy messaging or full mechanism design.
