Nash Equilibrium Learning In Large Populations With First-Order Payoff Modifications
Matthew S. Hankins, Jair Certório, Tzuyu Jeng, Nuno C. Martins
TL;DR
This paper develops a unified framework for Nash equilibrium learning in large populations under first-order payoff modifications. It couples two nonstandard system-theoretic notions—counterclockwise dissipativity (CCW) and delta-passivity—to prove convergence for a broad class of payoff mechanisms and learning dynamics, including discontinuous best-response and continuous revisions. The analysis introduces an evolutionary differential inclusion (EDIM) blending best-response with continuous learning rules and proves that, under certain passivity conditions and boundedness assumptions, population trajectories converge to the Nash equilibrium set of an associated stationary game. A numerical example based on Braess's paradox demonstrates practical toll design that steers equilibria toward more efficient outcomes. The results extend prior work by enabling joint CCW/delta-passivity treatment of first-order payoff modifications and guiding design of payoff mechanisms in large populations.
Abstract
We establish Nash equilibrium learning in large populations of noncooperative, strategic agents. Our analysis considers the broadest class to date of payoff mechanisms with first-order modifications, capable of modeling bounded rationality and anticipatory effects, averaging, or Padé delay approximations. We propose a framework that, for the first time, combines two nonstandard system-theoretic passivity notions. Our results hold for discontinuous best response dynamics alongside continuous learning rules, significantly extending prior work.
