Table of Contents
Fetching ...

Differentiable Bilevel Programming for Stackelberg Congestion Games

Jiayang Li, Jing Yu, Qianni Wang, Boyi Liu, Zhaoran Wang, Yu Marco Nie

TL;DR

This work addresses the computational intractability of large-scale Stackelberg congestion games by replacing the lower-level Wardrop equilibrium with a smooth imitative logit dynamic ($ILD$) that converges to $WE$. It then treats ILD as a differentiable program and introduces two local solvers, DolMD (double-loop mirror descent) and SilMD (single-loop with limited anticipation), to optimize the leader’s decision with gradient information obtained via automatic differentiation. The methods are validated on continuous network design problems (CNDP) and second-best congestion tolling problems (SCTP) across networks ranging up to nearly 100k origin-destination pairs, showing improved scalability, regularity, and competitive solution quality compared with sensitivity-analysis-based methods and heuristics. The results demonstrate that differentiable programming can provide scalable, reliable local solvers for large SCGs, potentially enabling practical deployment in transportation planning and infrastructure design with evolving behavioral dynamics.

Abstract

In a Stackelberg congestion game (SCG), a leader aims to maximize their own gain by anticipating and manipulating the equilibrium state at which the followers settle by playing a congestion game. Often formulated as bilevel programs, large-scale SCGs are well known for their intractability and complexity. Here, we attempt to tackle this computational challenge by marrying traditional methodologies with the latest differentiable programming techniques in machine learning. The core idea centers on replacing the lower-level equilibrium problem with a smooth evolution trajectory defined by the imitative logit dynamic (ILD), which we prove converges to the equilibrium of the congestion game under mild conditions. Building upon this theoretical foundation, we propose two new local search algorithms for SCGs. The first is a gradient descent algorithm that obtains the derivatives by unrolling ILD via differentiable programming. Thanks to the smoothness of ILD, the algorithm promises both efficiency and scalability. The second algorithm adds a heuristic twist by cutting short the followers' evolution trajectory. Behaviorally, this means that, instead of anticipating the followers' best response at equilibrium, the leader seeks to approximate that response by only looking ahead a limited number of steps. Our numerical experiments are carried out over various instances of classic SCG applications, ranging from toy benchmarks to large-scale real-world examples. The results show the proposed algorithms are reliable and scalable local solvers that deliver high-quality solutions with greater regularity and significantly less computational effort compared to the many incumbents included in our study.

Differentiable Bilevel Programming for Stackelberg Congestion Games

TL;DR

This work addresses the computational intractability of large-scale Stackelberg congestion games by replacing the lower-level Wardrop equilibrium with a smooth imitative logit dynamic () that converges to . It then treats ILD as a differentiable program and introduces two local solvers, DolMD (double-loop mirror descent) and SilMD (single-loop with limited anticipation), to optimize the leader’s decision with gradient information obtained via automatic differentiation. The methods are validated on continuous network design problems (CNDP) and second-best congestion tolling problems (SCTP) across networks ranging up to nearly 100k origin-destination pairs, showing improved scalability, regularity, and competitive solution quality compared with sensitivity-analysis-based methods and heuristics. The results demonstrate that differentiable programming can provide scalable, reliable local solvers for large SCGs, potentially enabling practical deployment in transportation planning and infrastructure design with evolving behavioral dynamics.

Abstract

In a Stackelberg congestion game (SCG), a leader aims to maximize their own gain by anticipating and manipulating the equilibrium state at which the followers settle by playing a congestion game. Often formulated as bilevel programs, large-scale SCGs are well known for their intractability and complexity. Here, we attempt to tackle this computational challenge by marrying traditional methodologies with the latest differentiable programming techniques in machine learning. The core idea centers on replacing the lower-level equilibrium problem with a smooth evolution trajectory defined by the imitative logit dynamic (ILD), which we prove converges to the equilibrium of the congestion game under mild conditions. Building upon this theoretical foundation, we propose two new local search algorithms for SCGs. The first is a gradient descent algorithm that obtains the derivatives by unrolling ILD via differentiable programming. Thanks to the smoothness of ILD, the algorithm promises both efficiency and scalability. The second algorithm adds a heuristic twist by cutting short the followers' evolution trajectory. Behaviorally, this means that, instead of anticipating the followers' best response at equilibrium, the leader seeks to approximate that response by only looking ahead a limited number of steps. Our numerical experiments are carried out over various instances of classic SCG applications, ranging from toy benchmarks to large-scale real-world examples. The results show the proposed algorithms are reliable and scalable local solvers that deliver high-quality solutions with greater regularity and significantly less computational effort compared to the many incumbents included in our study.
Paper Structure (43 sections, 21 theorems, 43 equations, 8 figures, 4 tables, 13 algorithms)

This paper contains 43 sections, 21 theorems, 43 equations, 8 figures, 4 tables, 13 algorithms.

Key Result

Proposition 3.2

A route choice ${\bm{p}}^* \in {\mathcal{P}}$ is a WE strategy if and only if

Figures (8)

  • Figure 1: A DiP representation of the leader's cost in an SCG.
  • Figure 1: Solutions obtained by CP, IOA, SO, DolMD (with $\varepsilon = 10^{-3}$), and SilMD (with $T = 1$) together with their objective values in solving CNDP on Braess.
  • Figure 2: Braess network.
  • Figure 3: Hearn network.
  • Figure 4: Optimality gaps of the solutions obtained by IOA, SO, SAB, DolMD, and SilMD in solving the CNDP problem on Braess.
  • ...and 3 more figures

Theorems & Definitions (25)

  • Definition 3.1
  • Proposition 3.2
  • Proposition 3.3
  • Proposition 3.4
  • Proposition 3.8
  • Proposition 3.9
  • Definition 4.1: Bregman divergence
  • Lemma 4.2
  • Proposition 4.3
  • Proposition 4.4
  • ...and 15 more