Pretrained Mobility Transformer: A Foundation Model for Human Mobility
Xinhua Wu, Haoyu He, Yanchao Wang, Qi Wang
TL;DR
This work presents Pretrained Mobility Transformer (PMT), a transformer-based foundation model trained on massive unlabeled location-based service trajectories to learn urban space representations. By tokenizing geographic areas as trainable spatial embeddings and integrating spatiotemporal encoding, PMT is pretrained with next-location prediction and mask imputation tasks to capture complex mobility patterns. Across three U.S. MSAs, PMT learns spatial embeddings that reflect geographic proximity and socio-demographic attributes, and larger PMT variants consistently outperform baselines on next-location prediction, trajectory imputation, and trajectory generation, suggesting scalable benefits from foundation-model pretraining in human mobility. The study highlights PMT's potential to inform urban planning and mobility analytics while acknowledging sampling bias and privacy considerations inherent to LBS data.
Abstract
Ubiquitous mobile devices are generating vast amounts of location-based service data that reveal how individuals navigate and utilize urban spaces in detail. In this study, we utilize these extensive, unlabeled sequences of user trajectories to develop a foundation model for understanding urban space and human mobility. We introduce the \textbf{P}retrained \textbf{M}obility \textbf{T}ransformer (PMT), which leverages the transformer architecture to process user trajectories in an autoregressive manner, converting geographical areas into tokens and embedding spatial and temporal information within these representations. Experiments conducted in three U.S. metropolitan areas over a two-month period demonstrate PMT's ability to capture underlying geographic and socio-demographic characteristics of regions. The proposed PMT excels across various downstream tasks, including next-location prediction, trajectory imputation, and trajectory generation. These results support PMT's capability and effectiveness in decoding complex patterns of human mobility, offering new insights into urban spatial functionality and individual mobility preferences.
