Clustering Dynamics for Improved Speed Prediction Deriving from Topographical GPS Registrations
Sarah Almeida Carneiro, Giovanni Chierchia, Aurelie Pirayre, Laurent Najman
TL;DR
This work tackles speed prediction in data-sparse regions by exploiting topographical similarities and road-design features through a Temporally Orientated Speed Dictionary centered on topographically clustered links (CDS). It introduces an Off-training Region Table to build specialized dictionaries, an ILSTM-based per-link spatio-temporal representation, and a Random Ordered Past Point Association (ROPPA) mechanism within an RNN that jointly optimizes regression and classification losses. Experimental results across topographical and infrastructural feature sets show that CDS-derived features and ROPPA-RNN architectures yield competitive, often superior, speed predictions with reduced dependence on exact vehicle positioning. The approach enables extrapolation to missing regions and supports simulatable, data-efficient speed profiling for Intelligent Transportation Systems.
Abstract
A persistent challenge in the field of Intelligent Transportation Systems is to extract accurate traffic insights from geographic regions with scarce or no data coverage. To this end, we propose solutions for speed prediction using sparse GPS data points and their associated topographical and road design features. Our goal is to investigate whether we can use similarities in the terrain and infrastructure to train a machine learning model that can predict speed in regions where we lack transportation data. For this we create a Temporally Orientated Speed Dictionary Centered on Topographically Clustered Roads, which helps us to provide speed correlations to selected feature configurations. Our results show qualitative and quantitative improvement over new and standard regression methods. The presented framework provides a fresh perspective on devising strategies for missing data traffic analysis.
