Efflex: Efficient and Flexible Pipeline for Spatio-Temporal Trajectory Graph Modeling and Representation Learning
Ming Cheng, Ziyi Zhou, Bowen Zhang, Ziyu Wang, Jiaqi Gan, Ziang Ren, Weiqi Feng, Yi Lyu, Hefan Zhang, Xingjian Diao
TL;DR
Efflex addresses the challenge of learning dense representations from massive spatio-temporal trajectories by constructing graphs directly from trajectories using a multi-scale KNN scheme with feature fusion to form a weighted adjacency $S$, where $s_{ij} = \frac{e^{dist(T_i, T_j)}}{\sum_{T_k \in \mathcal{K}} e^{dist(T_i, T_k)}}$ with $dist$ drawn from Fréchet, Hausdorff, or DTW, and then learning embeddings via a lightweight Graph Convolutional Network ($Efflex-B$) or a parameter-rich node2vec ($Efflex-L$). On Porto and Geolife, Efflex achieves state-of-the-art representation learning across multiple distance measures, while Efflex-B provides up to 36x faster embedding extraction with competitive accuracy. The framework is designed for both real-time, edge deployments and large-scale, compute-heavy settings, and points to future extensions such as incorporating Large Language Models as encoders for richer graph features.
Abstract
In the landscape of spatio-temporal data analytics, effective trajectory representation learning is paramount. To bridge the gap of learning accurate representations with efficient and flexible mechanisms, we introduce Efflex, a comprehensive pipeline for transformative graph modeling and representation learning of the large-volume spatio-temporal trajectories. Efflex pioneers the incorporation of a multi-scale k-nearest neighbors (KNN) algorithm with feature fusion for graph construction, marking a leap in dimensionality reduction techniques by preserving essential data features. Moreover, the groundbreaking graph construction mechanism and the high-performance lightweight GCN increase embedding extraction speed by up to 36 times faster. We further offer Efflex in two versions, Efflex-L for scenarios demanding high accuracy, and Efflex-B for environments requiring swift data processing. Comprehensive experimentation with the Porto and Geolife datasets validates our approach, positioning Efflex as the state-of-the-art in the domain. Such enhancements in speed and accuracy highlight the versatility of Efflex, underscoring its wide-ranging potential for deployment in time-sensitive and computationally constrained applications.
