RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility
Haoyu He, Haozheng Luo, Yan Chen, Qi R. Wang
TL;DR
RHYTHM tackles human mobility prediction by unifying hierarchical temporal tokenization with frozen LLM reasoning to capture multi-scale daily and weekly rhythms. It tokenizes trajectories into daily-like segments, enriches segment tokens with precomputed semantic embeddings from prompts, and processes them with a frozen LLM backbone to produce accurate location predictions while reducing computational load. Key contributions include temporal tokenization, prompt-guided semantic context integration, and a parameter-efficient frozen-backbone design that yields improvements in accuracy (notably 2.4% overall and 5.0% on weekends) and training speed (about 24.6% faster). The approach demonstrates strong performance across three real-world mobility datasets and offers scalable deployment with various pretrained LLMs, highlighting practical impact for large-scale mobility forecasting.
Abstract
Predicting human mobility is inherently challenging due to complex long-range dependencies and multi-scale periodic behaviors. To address this, we introduce RHYTHM (Reasoning with Hierarchical Temporal Tokenization for Human Mobility), a unified framework that leverages large language models (LLMs) as general-purpose spatio-temporal predictors and trajectory reasoners. Methodologically, RHYTHM employs temporal tokenization to partition each trajectory into daily segments and encode them as discrete tokens with hierarchical attention that captures both daily and weekly dependencies, thereby quadratically reducing the sequence length while preserving cyclical information. Additionally, we enrich token representations by adding pre-computed prompt embeddings for trajectory segments and prediction targets via a frozen LLM, and feeding these combined embeddings back into the LLM backbone to capture complex interdependencies. Computationally, RHYTHM keeps the pretrained LLM backbone frozen, yielding faster training and lower memory usage. We evaluate our model against state-of-the-art methods using three real-world datasets. Notably, RHYTHM achieves a 2.4% improvement in overall accuracy, a 5.0% increase on weekends, and a 24.6% reduction in training time. Code is publicly available at https://github.com/he-h/rhythm.
