LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation
Hao Jiang, Guoquan Wang, Donglin Zhou, Sheng Yu, Yang Zeng, Wencong Zeng, Kun Gai, Guorui Zhou
TL;DR
This paper tackles the geographic misalignment in text-based item tokenization for local-life recommendations by introducing LGSID, a two-stage framework that first aligns LLMs to geographic knowledge via a geography-aware reward model and a novel G-DPO algorithm, then compresses semantic representations through Hierarchical Geographic Item Tokenization. The RL-based alignment injects real-world spatial proximity into LLM embeddings, while the HGIT stage produces geography-aware discrete tokens with reconstruction-driven and entropy-regularized training. Empirical results on a large Kuaishou local-life dataset show consistent improvements in both discriminative (AUC gains up to +0.0417 on several backbones) and generative (NDCG/Hit gains up to ~+0.13) settings, supported by ablations, visualizations, and case studies that illustrate enhanced geographic awareness without sacrificing semantic fidelity. The method offers scalable, geography-conscious tokenization suitable for industrial deployment, improving delivery-distance relevance and exposure fairness in spatially constrained recommendations, with a tunable balance between semantic and geographic signals via the similarity weight in G-DPO.
Abstract
Recent advances in Large Language Models (LLMs) have enhanced text-based recommendation by enriching traditional ID-based methods with semantic generalization capabilities. Text-based methods typically encode item textual information via prompt design and generate discrete semantic IDs through item tokenization. However, in domain-specific tasks such as local-life services, simply injecting location information into prompts fails to capture fine-grained spatial characteristics and real-world distance awareness among items. To address this, we propose LGSID, an LLM-Aligned Geographic Item Tokenization Framework for Local-life Recommendation. This framework consists of two key components: (1) RL-based Geographic LLM Alignment, and (2) Hierarchical Geographic Item Tokenization. In the RL-based alignment module, we initially train a list-wise reward model to capture real-world spatial relationships among items. We then introduce a novel G-DPO algorithm that uses pre-trained reward model to inject generalized spatial knowledge and collaborative signals into LLMs while preserving their semantic understanding. Furthermore, we propose a hierarchical geographic item tokenization strategy, where primary tokens are derived from discrete spatial and content attributes, and residual tokens are refined using the aligned LLM's geographic representation vectors. Extensive experiments on real-world Kuaishou industry datasets show that LGSID consistently outperforms state-of-the-art discriminative and generative recommendation models. Ablation studies, visualizations, and case studies further validate its effectiveness.
