Hierarchical Delay Attribution Classification using Unstructured Text in Train Management Systems
Anton Borg, Per Lingvall, Martin Svensson
TL;DR
This work tackles automating Swedish railway delay attribution coding, a manually performed task across ~200 hierarchical codes with a 9-day finalization window. It adopts a TF-IDF representation of unstructured delay reports and compares Random Forest and SVM classifiers, augmented with conformal prediction, in both flat and hierarchical multi-level setups. Results show that hierarchical classification improves performance over flat approaches at Levels 2 and 3, with RF and SVM approaching, but not matching, the manual operator (TKL) performance, highlighting practical value as a decision-support tool and uncertainty estimator. The study demonstrates feasibility and offers a path toward more reliable, faster coding, while outlining directions for future enhancements such as transformer-based text representations and improved explainability to further close the gap with human experts.
Abstract
EU directives stipulate a systematic follow-up of train delays. In Sweden, the Swedish Transport Administration registers and assigns an appropriate delay attribution code. However, this delay attribution code is assigned manually, which is a complex task. In this paper, a machine learning-based decision support for assigning delay attribution codes based on event descriptions is investigated. The text is transformed using TF-IDF, and two models, Random Forest and Support Vector Machine, are evaluated against a random uniform classifier and the classification performance of the Swedish Transport Administration. Further, the problem is modeled as both a hierarchical and flat approach. The results indicate that a hierarchical approach performs better than a flat approach. Both approaches perform better than the random uniform classifier but perform worse than the manual classification.
