Deciphering Air Travel Disruptions: A Machine Learning Approach

Aravinda Jatavallabha; Jacob Gerlach; Aadithya Naresh

Deciphering Air Travel Disruptions: A Machine Learning Approach

Aravinda Jatavallabha, Jacob Gerlach, Aadithya Naresh

TL;DR

Flight Delay Prediction investigates predicting individual delay components using ML, comparing time-series models (LSTM, BiLSTM, LSTM-CNN) against baseline regressors on the DOT BTS dataset (2019–2023). Predictions are evaluated with $MAE$ and $MSE$, highlighting modest gains from time-series approaches but limited accuracy for ARR_DELAY due to skewed distributions and pandemic-related disruptions. The study emphasizes model explainability through component-level predictions and reveals challenges in forecasting aviation delays with high reliability. This work informs aviation operations and planning by providing insights into the predictability of specific delay sources and the potential value of time-series modeling for proactive flight scheduling.

Abstract

This research investigates flight delay trends by examining factors such as departure time, airline, and airport. It employs regression machine learning methods to predict the contributions of various sources to delays. Time-series models, including LSTM, Hybrid LSTM, and Bi-LSTM, are compared with baseline regression models such as Multiple Regression, Decision Tree Regression, Random Forest Regression, and Neural Network. Despite considerable errors in the baseline models, the study aims to identify influential features in delay prediction, potentially informing flight planning strategies. Unlike previous work, this research focuses on regression tasks and explores the use of time-series models for predicting flight delays. It offers insights into aviation operations by independently analyzing each delay component (e.g., security, weather).

Deciphering Air Travel Disruptions: A Machine Learning Approach

TL;DR

and

, highlighting modest gains from time-series approaches but limited accuracy for ARR_DELAY due to skewed distributions and pandemic-related disruptions. The study emphasizes model explainability through component-level predictions and reveals challenges in forecasting aviation delays with high reliability. This work informs aviation operations and planning by providing insights into the predictability of specific delay sources and the potential value of time-series modeling for proactive flight scheduling.

Abstract

Paper Structure (29 sections, 9 equations, 11 figures, 6 tables)

This paper contains 29 sections, 9 equations, 11 figures, 6 tables.

Introduction
Statistical Analysis
Probabilistic Models
Machine Learning
Literature Survey
DATASET DESCRIPTION AND Preprocessing
Description
Pruning Data
Data Transformation
PROPOSED METHOD
EXPERIMENT
Multiple Regression
Decision Tree Regressor
Random Forest Regressor
XGBoost Regressor
...and 14 more sections

Figures (11)

Figure 1: Taxonomy of Flight Delay Prediction Problem
Figure 2: Flight record counts
Figure 3: Structure of Decision Tree Nodes: Root, Interior, and Leaf.
Figure 4: LSTM Unit Structure
Figure 5: LSTM architecture
...and 6 more figures

Deciphering Air Travel Disruptions: A Machine Learning Approach

TL;DR

Abstract

Deciphering Air Travel Disruptions: A Machine Learning Approach

Authors

TL;DR

Abstract

Table of Contents

Figures (11)