Enhancing Maritime Trajectory Forecasting via H3 Index and Causal Language Modelling (CLM)
Nicolas Drapier, Aladine Chetouani, Aurélien Chateigner
TL;DR
The paper tackles global maritime trajectory forecasting using only GNSS data by discretising geographic coordinates with Uber's H3 index (resolution 10) and encoding them into a compact pseudo-octal token vocabulary. A causal language model (Mixtral8x7B) autoregressively predicts sequences of H3-based tokens, enabling long-range trajectory forecasting with a 2560-token sliding window and 220M parameters. Evaluated with Fréchet distance against Kalman filter baselines and the TrAISformer, the approach achieves substantial predictive accuracy and demonstrates global applicability, including a reported capacity to forecast up to 8 hours ahead with 30 minutes of context. The work identifies hallucinations as a characteristic of autoregressive trajectory generation and discusses potential mitigations via RL-based alignment and richer spatial tokens, while outlining future enhancements such as Fourier transforms to further improve modeling of maritime trajectories.
Abstract
The prediction of ship trajectories is a growing field of study in artificial intelligence. Traditional methods rely on the use of LSTM, GRU networks, and even Transformer architectures for the prediction of spatio-temporal series. This study proposes a viable alternative for predicting these trajectories using only GNSS positions. It considers this spatio-temporal problem as a natural language processing problem. The latitude/longitude coordinates of AIS messages are transformed into cell identifiers using the H3 index. Thanks to the pseudo-octal representation, it becomes easier for language models to learn the spatial hierarchy of the H3 index. The method is compared with a classical Kalman filter, widely used in the maritime domain, and introduces the Fréchet distance as the main evaluation metric. We show that it is possible to predict ship trajectories quite precisely up to 8 hours ahead with 30 minutes of context, using solely GNSS positions, without relying on any additional information such as speed, course, or external conditions - unlike many traditional methods. We demonstrate that this alternative works well enough to predict trajectories worldwide.
