Prompt Tuning as User Inherent Profile Inference Machine

Yusheng Lu; Zhaocheng Du; Xiangyang Li; Pengyue Jia; Yejing Wang; Weiwen Liu; Yichao Wang; Huifeng Guo; Ruiming Tang; Zhenhua Dong; Yongrui Duan; Xiangyu Zhao

Prompt Tuning as User Inherent Profile Inference Machine

Yusheng Lu, Zhaocheng Du, Xiangyang Li, Pengyue Jia, Yejing Wang, Weiwen Liu, Yichao Wang, Huifeng Guo, Ruiming Tang, Zhenhua Dong, Yongrui Duan, Xiangyu Zhao

TL;DR

The paper tackles the problem of inferring latent user profiles for recommender systems using large language models, addressing twisted causality, textual noise, and modality gaps. It introduces UserIP-Tuning, which uses soft prompts and EM-guided latent-profile inference, followed by a quantization module that maps embeddings to lightweight collaborative IDs stored in a feature bank. The approach demonstrates superior performance over strong baselines, transfers across models, and delivers practical benefits in industrial-scale deployments, including online A/B validation. The work also emphasizes explainability of inferred profiles and shows robust improvements in both accuracy and efficiency, making it suitable for real-world recommender systems.

Abstract

Large Language Models (LLMs) have exhibited significant promise in recommender systems by empowering user profiles with their extensive world knowledge and superior reasoning capabilities. However, LLMs face challenges like unstable instruction compliance, modality gaps, and high inference latency, leading to textual noise and limiting their effectiveness in recommender systems. To address these challenges, we propose UserIP-Tuning, which uses prompt-tuning to infer user profiles. It integrates the causal relationship between user profiles and behavior sequences into LLMs' prompts. It employs Expectation Maximization (EM) to infer the embedded latent profile, minimizing textual noise by fixing the prompt template. Furthermore, a profile quantization codebook bridges the modality gap by categorizing profile embeddings into collaborative IDs pre-stored for online deployment. This improves time efficiency and reduces memory usage. Experiments show that UserIP-Tuning outperforms state-of-the-art recommendation algorithms. An industry application confirms its effectiveness, robustness, and transferability. The presented solution has been deployed in Huawei AppGallery's Explore page since May 2025, serving 2 million daily active users, delivering significant improvements in real-world recommendation scenarios. The code is publicly available for replication at https://github.com/Applied-Machine-Learning-Lab/UserIP-Tuning.

Prompt Tuning as User Inherent Profile Inference Machine

TL;DR

Abstract

Paper Structure (19 sections, 9 equations, 8 figures, 7 tables, 1 algorithm)

This paper contains 19 sections, 9 equations, 8 figures, 7 tables, 1 algorithm.

Introduction
Framework
Preliminary and Setup
Framework Overview
UserIP Inference Module
UserIP Quantization Module
UserIP Feature Bank and Downstream Recommender model
Experiment
Datasets, Evaluation Metrics, and Baselines
Implementation Details
Overall Performance Comparison
Transferability Study
Ablation Studies
Parameter Analysis
Industrial Application Study
...and 4 more sections

Figures (8)

Figure 1: Example of inferring user latent profiles with LLMs based on observable behaviors. Blue lines indicate informative profiles, while white lines represent noise in the RS task.
Figure 2: Users' latent profile and observed behavior.
Figure 3: Overview of the UserIP-Tuning framework. Here, two user latent profiles are illustrated: hobby and income background. UserIP-Tuning consists of a UserIP inference module, a UserIP quantization module, and a pre-stored UserIP feature bank.
Figure 4: Causal relationship (Upper) between missing user profiles and behaviors. The curve means the causal direction. Note that latent profiles are independent of each other. Causal mask (Below) in UserIP Inference Module. The blue (gray) square denotes that column $j$ will (not) attend to row $i$.
Figure 5: Prompt template of user's latent profiles
...and 3 more figures

Prompt Tuning as User Inherent Profile Inference Machine

TL;DR

Abstract

Prompt Tuning as User Inherent Profile Inference Machine

Authors

TL;DR

Abstract

Table of Contents

Figures (8)