Strategizing Equitable Transit Evacuations: A Data-Driven Reinforcement Learning Approach
Fang Tang, Han Wang, Maria Laura Delle Monache
TL;DR
This work tackles equitable transit evacuations under natural disasters by formulating an MDP for bus-based evacuations and solving it with a PPO-based controller that incorporates an equity penalty. The equity term relies on a point-biserial index to prioritize equity-priority communities while minimizing total evacuation time, enabling a data-driven balance between efficiency and fairness. Extensive simulations on a San Francisco Bay Area network, using GTFS and OpenStreetMap data, demonstrate that the Equity-RL framework achieves substantially lower inequity (lower |r_pb|) and more equitable service distribution with competitive evacuation times compared to stochastic and rule-based baselines. The results suggest a scalable, real-time capable approach to emergency management that can inform urban resilience planning and improve equitable access to evacuation resources.
Abstract
As natural disasters become increasingly frequent, the need for efficient and equitable evacuation planning has become more critical. This paper proposes a data-driven, reinforcement learning-based framework to optimize bus-based evacuations with an emphasis on improving both efficiency and equity. We model the evacuation problem as a Markov Decision Process solved by reinforcement learning, using real-time transit data from General Transit Feed Specification and transportation networks extracted from OpenStreetMap. The reinforcement learning agent dynamically reroutes buses from their scheduled location to minimize total passengers' evacuation time while prioritizing equity-priority communities. Simulations on the San Francisco Bay Area transportation network indicate that the proposed framework achieves significant improvements in both evacuation efficiency and equitable service distribution compared to traditional rule-based and random strategies. These results highlight the potential of reinforcement learning to enhance system performance and urban resilience during emergency evacuations, offering a scalable solution for real-world applications in intelligent transportation systems.
