Learning Heuristics for Transit Network Design and Improvement with Deep Reinforcement Learning
Andrew Holliday, Ahmed El-Geneidy, Gregory Dudek
TL;DR
This paper introduces Neural Evolutionary Algorithm (NEA), a DRL-based approach that learns neural heuristics to guide transit network design within an evolutionary framework. By training a graph neural network policy with PPO on a construction MDP, NEA learns to compose and extend routes in a city graph, outperforming traditional evolutionary baselines on large benchmark cities and a real-world Laval case study. The results demonstrate state-of-the-art performance on challenging Mumford benchmarks and significant cost savings in Laval, illustrating the potential of neural heuristics to yield cost-efficient, rider-supporting transit networks. Limitations include training in a construction setting and the need for broader policy diversification; future work could extend neural heuristics to more operators and multi-objective metaheuristics for real-world deployment.
Abstract
Planning a network of public transit routes is a challenging optimization problem. Metaheuristic algorithms search through the space of possible transit networks by applying heuristics that randomly alter routes in a network. Existing algorithms almost exclusively use heuristics that modify the network in purely random ways. In this work, we explore whether we can obtain better transit networks using more intelligent heuristics, that modify networks according to a learned preference function instead of at random. We use reinforcement learning to train graph neural nets to act as heuristics. These neural heuristics yield improved results on benchmark synthetic cities with 70 nodes or more, and achieve new state-of-the-art results on the challenging Mumford benchmark. They also improve upon a simulation of the real transit network in the city of Laval, Canada, achieving cost savings of up to 19% over the city's existing transit network.
