Counterclockwise Dissipativity, Potential Games and Evolutionary Nash Equilibrium Learning
Nuno C. Martins, Jair Certório, Matthew S. Hankins
TL;DR
The paper tackles evolutionary Nash equilibrium learning in large populations with potentially dynamic payoff mechanisms. It introduces counterclockwise dissipativity (CCW) as a unifying framework that accommodates both imitation-based rules and continuous delta-passive rules through conic combinations, linking CCW payoff mechanisms to potential games for memoryless cases. The authors prove that CCW payoff mechanisms guarantee convergence of the population state to the Nash equilibria set of the associated stationary game for any hybrid learning rule within a convex cone, and they provide a numerical example illustrating convergence to a symmetric Nash equilibrium. This work broadens the applicability of passivity-based analysis in multi-agent learning, offering design guidance for payoff mechanisms that ensure evolutionary Nash equilibrium learning even when the exact learning rule is uncertain or mixed. Overall, CCW dissipativity provides a robust, broadly applicable framework that encompasses delta-passive, imitator-based, and approximated best-response behaviors in population games.
Abstract
We use system-theoretic passivity methods to study evolutionary Nash equilibria learning in large populations of agents engaged in strategic, non-cooperative interactions. The agents follow learning rules (rules for short) that capture their strategic preferences and a payoff mechanism ascribes payoffs to the available strategies. The population's aggregate strategic profile is the state of an associated evolutionary dynamical system. Evolutionary Nash equilibrium learning refers to the convergence of this state to the Nash equilibria set of the payoff mechanism. Most approaches consider memoryless payoff mechanisms, such as potential games. Recently, methods using $δ$-passivity and equilibrium independent passivity (EIP) have introduced dynamic payoff mechanisms. However, $δ$-passivity does not hold when agents follow rules exhibiting ``imitation" behavior, such as in replicator dynamics. Conversely, EIP applies to the replicator dynamics but not to $δ$-passive rules. We address this gap using counterclockwise dissipativity (CCW). First, we prove that continuous memoryless payoff mechanisms are CCW if and only if they are potential games. Subsequently, under (possibly dynamic) CCW payoff mechanisms, we establish evolutionary Nash equilibrium learning for any rule within a convex cone spanned by imitation rules and continuous $δ$-passive rules.
