Learning Optimal Tax Design in Nonatomic Congestion Games
Qiwen Cui, Maryam Fazel, Simon S. Du
TL;DR
This work tackles learning an optimal tax to maximize social welfare in nonatomic congestion games under equilibrium feedback, where only Nash equilibria resulting from applied taxes are observable. It introduces a novel online framework that uses piecewise-linear approximations of the marginal-cost tax, enforces a strongly convex potential via an auxiliary linear term, and employs an exploratory tax to reveal information about unknown facilities. The authors prove that their method can achieve an $ε$-optimal tax with $O(F^2β/ε)$ samples, and they provide computationally efficient implementations for network congestion settings. The approach bridges learning under partial information with the structural properties of congestion games, offering a practical route to welfare-improving toll design in large-scale traffic and networked systems.
Abstract
In multiplayer games, self-interested behavior among the players can harm the social welfare. Tax mechanisms are a common method to alleviate this issue and induce socially optimal behavior. In this work, we take the initial step of learning the optimal tax that can maximize social welfare with limited feedback in congestion games. We propose a new type of feedback named \emph{equilibrium feedback}, where the tax designer can only observe the Nash equilibrium after deploying a tax plan. Existing algorithms are not applicable due to the exponentially large tax function space, nonexistence of the gradient, and nonconvexity of the objective. To tackle these challenges, we design a computationally efficient algorithm that leverages several novel components: (1) a piece-wise linear tax to approximate the optimal tax; (2) extra linear terms to guarantee a strongly convex potential function; (3) an efficient subroutine to find the exploratory tax that can provide critical information about the game. The algorithm can find an $ε$-optimal tax with $O(βF^2/ε)$ sample complexity, where $β$ is the smoothness of the cost function and $F$ is the number of facilities.
