Forecasting MBTA Transit Dynamics: A Performance Benchmarking of Statistical and Machine Learning Models

Sai Siddharth Nalamalpu; Kaining Yuan; Aiden Zhou; Eugene Pinsky

Forecasting MBTA Transit Dynamics: A Performance Benchmarking of Statistical and Machine Learning Models

Sai Siddharth Nalamalpu, Kaining Yuan, Aiden Zhou, Eugene Pinsky

TL;DR

This study benchmarks a broad set of statistical and machine-learning approaches to forecast MBTA subway ridership (gated station entries) and delays, emphasizing calendar features over weather. It systematically tests 11 models across multiple covariate combinations with bootstrap RMSE and SHAP analyses, and includes a novel Hawkes self-exciting point process to model delay events. Key findings show day-of-week and seasonality provide stronger predictive signals than weather, with Random Forest, Gradient Boosting, and MLPs delivering top-day-ahead performance; Hawkes offers calibrated next-event forecasts but is less effective for daily counts. The work informs transit planning by identifying robust predictors and illustrating how different modeling paradigms contribute to reliability and passenger information, while outlining avenues for higher-resolution data and spatially-informed extensions.

Abstract

The Massachusetts Bay Transportation Authority (MBTA) is the main public transit provider in Boston, operating multiple means of transport, including trains, subways, and buses. However, the system often faces delays and fluctuations in ridership volume, which negatively affect efficiency and passenger satisfaction. To further understand this phenomenon, this paper compares the performance of existing and unique methods to determine the best approach in predicting gated station entries in the subway system (a proxy for subway usage) and the number of delays in the overall MBTA system. To do so, this research considers factors that tend to affect public transportation, such as day of week, season, pressure, wind speed, average temperature, and precipitation. This paper evaluates the performance of 10 statistical and machine learning models on predicting next-day subway usage. On predicting delay count, the number of models is extended to 11 per day by introducing a self-exciting point process model, representing a unique application of a point-process framework for MBTA delay modeling. This research involves experimenting with the selective inclusion of features to determine feature importance, testing model accuracy via Root Mean Squared Error (RMSE). Remarkably, it is found that providing either day of week or season data has a more substantial benefit to predictive accuracy compared to weather data; in fact, providing weather data generally worsens performance, suggesting a tendency of models to overfit.

Forecasting MBTA Transit Dynamics: A Performance Benchmarking of Statistical and Machine Learning Models

TL;DR

Abstract

Forecasting MBTA Transit Dynamics: A Performance Benchmarking of Statistical and Machine Learning Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)