Field Matters: A lightweight LLM-enhanced Method for CTR Prediction
Yu Cui, Feng Liu, Jiawei Chen, Xingyu Lou, Changwang Zhang, Jun Wang, Yuegang Sun, Xiaohu Yang, Can Wang
TL;DR
This work tackles CTR prediction at industrial scale by suppressing the heavy costs of instance- or user/item-level LLM processing. It introduces LLaCTR, a field-level enhancement framework built on SSFT for distilling field semantics, FRE for aligning field and feature embeddings, and FIE for injecting field-aware cues into feature interactions. Across four real-world datasets and six backbones, LLaCTR delivers about 2.24% relative AUC gains while achieving 10–100x reductions in training time compared to prior LLM-enhanced methods, with ablations confirming the necessity of all three components. The approach provides a practical, scalable path to integrate semantic knowledge from LLMs into production-grade CTR systems.
Abstract
Click-through rate (CTR) prediction is a fundamental task in modern recommender systems. In recent years, the integration of large language models (LLMs) has been shown to effectively enhance the performance of traditional CTR methods. However, existing LLM-enhanced methods often require extensive processing of detailed textual descriptions for large-scale instances or user/item entities, leading to substantial computational overhead. To address this challenge, this work introduces LLaCTR, a novel and lightweight LLM-enhanced CTR method that employs a field-level enhancement paradigm. Specifically, LLaCTR first utilizes LLMs to distill crucial and lightweight semantic knowledge from small-scale feature fields through self-supervised field-feature fine-tuning. Subsequently, it leverages this field-level semantic knowledge to enhance both feature representation and feature interactions. In our experiments, we integrate LLaCTR with six representative CTR models across four datasets, demonstrating its superior performance in terms of both effectiveness and efficiency compared to existing LLM-enhanced methods. Our code is available at https://anonymous.4open.science/r/LLaCTR-EC46.
