On Vessel Location Forecasting and the Effect of Federated Learning
Andreas Tritsarolis, Nikos Pelekis, Konstantina Bereta, Dimitris Zissis, Yannis Theodoridis
TL;DR
This work addresses vessel location forecasting under data privacy constraints by developing Nautilus, an LSTM-based centralized VLF model, and FedNautilus, a federated extension. Nautilus delivers competitive, often superior, short-term predictions (up to 60 minutes) on multiple AIS datasets, while FedNautilus exposes challenges from client drift due to data heterogeneity but gains from personalization (PerFL). The study provides a comprehensive comparison between centralized and federated learning in maritime contexts, including a detailed communication-cost analysis that favors FL in distributed settings. Practical implications include privacy-preserving collaborative VLF with significant reductions in data movement and potential improvements from client-specific personalization. Future work suggests enriching the model with weather and itinerary data and exploring cross-device FL to further optimize the accuracy-cost trade-off.
Abstract
The wide spread of Automatic Identification System (AIS) has motivated several maritime analytics operations. Vessel Location Forecasting (VLF) is one of the most critical operations for maritime awareness. However, accurate VLF is a challenging problem due to the complexity and dynamic nature of maritime traffic conditions. Furthermore, as privacy concerns and restrictions have grown, training data has become increasingly fragmented, resulting in dispersed databases of several isolated data silos among different organizations, which in turn decreases the quality of learning models. In this paper, we propose an efficient VLF solution based on LSTM neural networks, in two variants, namely Nautilus and FedNautilus for the centralized and the federated learning approach, respectively. We also demonstrate the superiority of the centralized approach with respect to current state of the art and discuss the advantages and disadvantages of the federated against the centralized approach.
