Volvo Discovery Challenge at ECML-PKDD 2024
Mahmoud Rahat, Peyman Sheikholharam Mashhadi, Sławomir Nowaczyk, Shamik Choudhury, Leo Petrin, Thorsteinn Rognvaldsson, Andreas Voskou, Carlo Metta, Claudio Savelli
TL;DR
The Volvo Discovery Challenge at ECML-PKDD 2024 tackled predictive maintenance by forecasting a three-class failure-risk label for a Volvo truck component using real-world fleet data, with training restricted to gen1 and testing across gen1/gen2. The paper presents dataset structure, evaluation protocol, and three winning methodologies—STab-based tabular modeling with ensemble and normalization, LSTM with pseudo-labeling and boosting, and Time-Transformer–driven two-stage risk classification with rich temporal feature extraction. Key contributions include a detailed account of the competition setup, performance patterns across generations, and actionable code resources from top teams for predictive maintenance research. The work demonstrates robust generalization to a new component generation and offers practical methodologies and benchmarks for applied maintenance analytics in industrial settings.
Abstract
This paper presents an overview of the Volvo Discovery Challenge, held during the ECML-PKDD 2024 conference. The challenge's goal was to predict the failure risk of an anonymized component in Volvo trucks using a newly published dataset. The test data included observations from two generations (gen1 and gen2) of the component, while the training data was provided only for gen1. The challenge attracted 52 data scientists from around the world who submitted a total of 791 entries. We provide a brief description of the problem definition, challenge setup, and statistics about the submissions. In the section on winning methodologies, the first, second, and third-place winners of the competition briefly describe their proposed methods and provide GitHub links to their implemented code. The shared code can be interesting as an advanced methodology for researchers in the predictive maintenance domain. The competition was hosted on the Codabench platform.
