CHIME: A Compressive Framework for Holistic Interest Modeling
Yong Bai, Rui Xiang, Kaiyuan Li, Yongxiang Tang, Yanhua Cheng, Xialong Liu, Peng Jiang, Kun Gai
TL;DR
CHIME tackles scalable holistic long-term interest modeling by compressing full behavior sequences into compact histograms using an Interest Adaptation Module, an Interest Representation Module based on pretrained decoder-only LLMs, and an Interest Compression Module with residual vector quantization. It introduces holistic and immediate contrastive losses to align global and recent interests and demonstrates that pretrained LLM initialization improves performance as model depth grows. Experiments on MicroVideo, Tmall, and EBNeRD show consistent CTR/CVR gains and that CHIME can be integrated with existing ranking models and other long-term methods to reduce online computation. The work offers a practical, end-to-end, plug-and-play solution for industrial recommendation systems requiring scalable holistic user modeling.
Abstract
Modeling holistic user interests is important for improving recommendation systems but is challenged by high computational cost and difficulty in handling diverse information with full behavior context. Existing search-based methods might lose critical signals during behavior selection. To overcome these limitations, we propose CHIME: A Compressive Framework for Holistic Interest Modeling. It uses adapted large language models to encode complete user behaviors with heterogeneous inputs. We introduce multi-granular contrastive learning objectives to capture both persistent and transient interest patterns and apply residual vector quantization to generate compact embeddings. CHIME demonstrates superior ranking performance across diverse datasets, establishing a robust solution for scalable holistic interest modeling in recommendation systems.
