Large Language Models for Next Point-of-Interest Recommendation
Peibo Li, Maarten de Rijke, Hao Xue, Shuang Ao, Yang Song, Flora D. Salim
TL;DR
This work tackles next POI recommendation by leveraging pretrained LLMs to preserve and exploit rich contextual information in LBSN data. The authors introduce LLM4POI, a framework built on trajectory prompting, a novel key-query similarity mechanism, and supervised fine-tuning with parameter-efficient methods (LoRA), quantization (NF4), and long-context support (FlashAttention-2). Empirical results across three real-world datasets show substantial improvements over strong baselines, with notable gains for cold-start and short-trajectory scenarios and evidence of cross-dataset generalization. Limitations include computational efficiency and context-length constraints, which the authors address as directions for future work such as chain-of-thought reasoning and extending beyond single-item recommendations.
Abstract
The next Point of Interest (POI) recommendation task is to predict users' immediate next POI visit given their historical data. Location-Based Social Network (LBSN) data, which is often used for the next POI recommendation task, comes with challenges. One frequently disregarded challenge is how to effectively use the abundant contextual information present in LBSN data. Previous methods are limited by their numerical nature and fail to address this challenge. In this paper, we propose a framework that uses pretrained Large Language Models (LLMs) to tackle this challenge. Our framework allows us to preserve heterogeneous LBSN data in its original format, hence avoiding the loss of contextual information. Furthermore, our framework is capable of comprehending the inherent meaning of contextual information due to the inclusion of commonsense knowledge. In experiments, we test our framework on three real-world LBSN datasets. Our results show that the proposed framework outperforms the state-of-the-art models in all three datasets. Our analysis demonstrates the effectiveness of the proposed framework in using contextual information as well as alleviating the commonly encountered cold-start and short trajectory problems.
